Available Models

All models accessible through a single API endpoint. Pricing is per 1M tokens.

List Models API

GET /v1/models
Authorization: Bearer sk-bc-YOUR_API_KEY

OpenAI

Model ID	Input	Output	Context	Description
gpt-4o	$2.50	$10.00	128K	Flagship multimodal model — best for complex tasks
gpt-4o-mini	$0.15	$0.60	128K	Fast and cost-effective for most tasks
gpt-4.1	$2.00	$8.00	1M	Latest GPT model with improved coding and instruction following
gpt-4.1-mini	$0.40	$1.60	1M	Compact version of GPT-4.1
gpt-4.1-nano	$0.10	$0.40	1M	Fastest and cheapest GPT-4.1 variant
o4-mini	$1.10	$4.40	200K	Reasoning model for math, science, and coding

Anthropic

Model ID	Input	Output	Context	Description
claude-sonnet-4	$3.00	$15.00	200K	Best balance of intelligence and speed
claude-opus-4	$15.00	$75.00	200K	Most capable Claude model for complex analysis
claude-haiku-4	$0.80	$4.00	200K	Fast and affordable for high-volume tasks
claude-3-5-sonnet	$3.00	$15.00	200K	Previous generation Sonnet
claude-3-5-haiku	$0.80	$4.00	200K	Previous generation Haiku

Google

Model ID	Input	Output	Context	Description
gemini-2.5-flash	$0.15	$0.60	1M	Fast multimodal model with thinking capabilities
gemini-2.5-pro	$1.25	$10.00	1M	Most capable Gemini model
gemini-2.0-flash	$0.10	$0.40	1M	Previous generation Flash — ultra low cost

Model Routing

The gateway automatically routes to the correct provider based on model name prefix:

gpt-*, o1-*, o3-*, o4-* — OpenAI
claude-* — Anthropic
gemini-* — Google

Automatic Failover

If a provider returns a 5xx error or times out, the gateway automatically retries with a fallback provider. This happens transparently — you see a successful response from the fallback.

By Service

By Industry

Enterprise AI Suite

Innovation Lab