ReflexioDeveloper Docs
Menu
OSS

embeddings

Run the local OpenAI-compatible embedding daemon.

reflexio embeddings

Self-hosted open-source only. The embedding daemon serves embeddings from a local model on the machine where the CLI runs.

The daemon hosts a single active local embedding model (default local/nomic-embed-text-v1.5) behind an OpenAI-compatible API. services start launches it automatically when local embedding mode is selected (REFLEXIO_EMBEDDING_PROVIDER=local_service or CLAUDE_SMART_USE_LOCAL_EMBEDDING=1); run it directly only when you want the embedding service on its own.


Command: reflexio embeddings serve

Serves the embedding daemon with uvicorn and blocks until you interrupt with Ctrl-C.

reflexio embeddings serve [OPTIONS]

Options

Prop

Type

Endpoints

  • GET /health — liveness check; reports the currently active model.
  • POST /v1/embeddings — OpenAI-compatible embedding endpoint. The model field is required and must match the daemon's single active model; requests for a different model are rejected.

Examples

# Serve on the default port (EMBEDDING_PORT or 8072)
reflexio embeddings serve

# Serve on a custom port
reflexio embeddings serve --port 9072

# Check the daemon and its active model
curl -H "User-Agent: my-agent-reflexio" http://localhost:8072/health

See services for starting the embedding daemon alongside the backend, and Configuration for embedding provider settings.