Available Models
All models accessible through a single API endpoint. Pricing is per 1M tokens.
List Models API
GET /v1/models Authorization: Bearer sk-bc-YOUR_API_KEY
OpenAI
| Model ID | Input | Output | Context | Description |
|---|---|---|---|---|
| gpt-4o | $2.50 | $10.00 | 128K | Flagship multimodal model — best for complex tasks |
| gpt-4o-mini | $0.15 | $0.60 | 128K | Fast and cost-effective for most tasks |
| gpt-4.1 | $2.00 | $8.00 | 1M | Latest GPT model with improved coding and instruction following |
| gpt-4.1-mini | $0.40 | $1.60 | 1M | Compact version of GPT-4.1 |
| gpt-4.1-nano | $0.10 | $0.40 | 1M | Fastest and cheapest GPT-4.1 variant |
| o4-mini | $1.10 | $4.40 | 200K | Reasoning model for math, science, and coding |
Anthropic
| Model ID | Input | Output | Context | Description |
|---|---|---|---|---|
| claude-sonnet-4 | $3.00 | $15.00 | 200K | Best balance of intelligence and speed |
| claude-opus-4 | $15.00 | $75.00 | 200K | Most capable Claude model for complex analysis |
| claude-haiku-4 | $0.80 | $4.00 | 200K | Fast and affordable for high-volume tasks |
| claude-3-5-sonnet | $3.00 | $15.00 | 200K | Previous generation Sonnet |
| claude-3-5-haiku | $0.80 | $4.00 | 200K | Previous generation Haiku |
| Model ID | Input | Output | Context | Description |
|---|---|---|---|---|
| gemini-2.5-flash | $0.15 | $0.60 | 1M | Fast multimodal model with thinking capabilities |
| gemini-2.5-pro | $1.25 | $10.00 | 1M | Most capable Gemini model |
| gemini-2.0-flash | $0.10 | $0.40 | 1M | Previous generation Flash — ultra low cost |
Model Routing
The gateway automatically routes to the correct provider based on model name prefix:
gpt-*,o1-*,o3-*,o4-*— OpenAIclaude-*— Anthropicgemini-*— Google
Automatic Failover
If a provider returns a 5xx error or times out, the gateway automatically retries with a fallback provider. This happens transparently — you see a successful response from the fallback.