All endpoints are under http://localhost:8000 by default.
For full request/response schemas, see the OpenAPI spec or the interactive docs at /docs when the gateway is running.
Endpoint availability
Endpoint group | Standalone | Connected to Otari |
Health ( | Yes | Yes |
Chat completions ( | Yes | Yes |
All other | Yes | No |
Authentication
Standalone
Preferred header:
Otari-Key: Bearer <token>Back-compat headers:
AnyLLM-Key,X-AnyLLM-KeyAuthorization: Bearer <token>is also accepted
Regular API endpoints use an API key. Management endpoints use the master key.
Connected to Otari
POST /v1/chat/completionsexpectsAuthorization: Bearer <user-token>Otari-Keyand local API keys are not used for this path
Available in both deployment types
Health
No authentication required.
Method | Path | Description |
|
| General health check. Includes otari.ai reachability fields when connected. |
|
| Kubernetes liveness probe. |
|
| Kubernetes readiness probe. Checks DB (standalone) or otari.ai reachability. Returns 503 on failure. |
Chat completions
Method | Path | Description | Auth |
|
| OpenAI-compatible chat completions. Supports streaming and tool use ( | Standalone: API key or master key. Connected: |
Standalone-only endpoints
Messages
Method | Path | Description | Auth |
|
| Anthropic Messages API-compatible endpoint. Supports streaming and extended thinking. | API key or master key |
Responses
Method | Path | Description | Auth |
|
| OpenAI Responses API-compatible endpoint. Supports streaming. | API key or master key |
Embeddings
Method | Path | Description | Auth |
|
| Generate embeddings for text input. | API key or master key |
Models
Method | Path | Description | Auth |
|
| List available models from pricing entries. | API key or master key |
|
| Get a specific model. | API key or master key |
Moderations
Method | Path | Description | Auth |
|
| OpenAI-compatible content moderation. | API key or master key |
Rerank
Method | Path | Description | Auth |
|
| Reorder documents by relevance to a query. | API key or master key |
Images
Method | Path | Description | Auth |
|
| Generate images from text prompts. | API key or master key |
Audio
Method | Path | Description | Auth |
|
| Transcribe audio to text (multipart upload). | API key or master key |
|
| Generate speech from text (TTS). | API key or master key |
Batches
Method | Path | Description | Auth |
|
| Create an async batch of LLM requests. | API key or master key |
|
| List batches. Query param: | API key or master key |
|
| Get batch status. Query param: | API key or master key |
|
| Cancel a batch. Query param: | API key or master key |
|
| Get batch results. Returns 409 if not complete. Query param: | API key or master key |
Key management
Method | Path | Description | Auth |
|
| Create an API key. | Master key |
|
| List all API keys. | Master key |
|
| Get a specific key. | Master key |
|
| Update a key (name, active status, expiration, metadata). | Master key |
|
| Revoke a key. | Master key |
User management
Method | Path | Description | Auth |
|
| Create a user. | Master key |
|
| List users. | Master key |
|
| Get a specific user. | Master key |
|
| Update a user. | Master key |
|
| Soft-delete a user and deactivate their keys. | Master key |
|
| Get usage history for a user. | Master key |
Budget management
Method | Path | Description | Auth |
|
| Create a budget. | Master key |
|
| List budgets. | Master key |
|
| Get a specific budget. | Master key |
|
| Update a budget. | Master key |
|
| Delete a budget. | Master key |
Pricing
Method | Path | Description | Auth |
|
| Set or update model pricing. | Master key |
|
| List all model pricing. | API key or master key |
|
| Get effective pricing for a model. Optional | API key or master key |
|
| Get full pricing history for a model. | API key or master key |
|
| Delete a pricing entry. | Master key |
Usage
Method | Path | Description | Auth |
|
| List usage logs. Filters: | Master key |
Source: mozilla-ai/otari/docs/api-reference.md
