Reference

Models catalog

Every public alias you can pass as the model field. Prices are quoted in USD per 1 million tokens, the same unit OpenAI, Anthropic and Google publish on their pricing pages. Relay billing meters at per-token resolution; amounts are shown in USD and your wallet is debited in USD-cents.

44 models live. You can also browse this list in the developer console.

deepseek-v

Alias	Modality	Input / 1M ($)	Output / 1M ($)
`deepseek-v4-flash`	💬 Text	$0.140	$0.280
`deepseek-v4-pro`	💬 Text	$0.440	$0.870

doubao

Alias	Modality	Input / 1M ($)	Output / 1M ($)
`doubao-1.5-lite-32k`	💬 Text	$0.040	$0.080
`doubao-1.5-pro-32k`	💬 Text	$0.110	$0.280
`doubao-1.5-pro-32k-character`	💬 Text	$0.110	$0.280
`doubao-1.5-vision-pro`	💬 Text	$0.420	$1.25

doubao-embedding

Alias	Modality	Input / 1M ($)	Output / 1M ($)
`doubao-embedding-vision`	🔢 Embeddings	$0.100	—

doubao-seed

Alias	Modality	Input / 1M ($)	Output / 1M ($)
`doubao-seed-1.6`	💬 Text	$0.110	$1.11
`doubao-seed-1.6-flash`	💬 Text	$0.020	$0.210
`doubao-seed-1.6-vision`	💬 Text	$0.110	$1.13
`doubao-seed-1.8`	💬 Text	$0.110	$0.280
`doubao-seed-2.0-code`	💬 Text	$0.440	$2.22
`doubao-seed-2.0-lite`	💬 Text	$0.080	$0.500
`doubao-seed-2.0-mini`	💬 Text	$0.030	$0.280
`doubao-seed-2.0-pro`	💬 Text	$0.440	$2.22
`doubao-seed-2.1-pro`	💬 Text	$0.830	$4.17
`doubao-seed-2.1-turbo`	💬 Text	$0.420	$2.08
`doubao-seed-character`	💬 Text	$0.110	$0.280
`doubao-seed-code`	💬 Text	$0.170	$1.12
`doubao-seed-evolving`	💬 Text	$0.830	$4.17
`doubao-seed-translation`	💬 Text	$0.170	$0.500

doubao-seedance

Alias	Modality	Input / 1M ($)	Output / 1M ($)
`doubao-seedance-1.0-pro`	🎬 Video	—	$2.08
`doubao-seedance-1.0-pro-fast`	🎬 Video	—	$0.580
`doubao-seedance-1.5-pro`	🎬 Video	—	$2.22
`doubao-seedance-2-0-pro`	🎬 Video	—	$6.40
`doubao-seedance-2.0-fast`	🎬 Video	—	$5.14

doubao-seedream

Alias	Modality	Input / 1M ($)	Output / 1M ($)
`doubao-seedream-4.0`	🖼 Image	—	—
`doubao-seedream-4.5`	🖼 Image	—	—
`doubao-seedream-5.0-lite`	🖼 Image	—	—

gpt

Alias	Modality	Input / 1M ($)	Output / 1M ($)
`gpt-4.1-mini`	💬 Text	$0.150	$0.600
`gpt-5.2`	💬 Text	$1.75	$14.00
`gpt-5.3-codex`	💬 Text	$1.75	$14.00
`gpt-5.3-codex-spark`	💬 Text	—	—
`gpt-5.4`	💬 Text	$2.50	$15.00
`gpt-5.4-mini`	💬 Text	$0.700	$4.20
`gpt-5.4-openai-compact`	💬 Text	$2.50	$15.00
`gpt-5.4-xhigh-fast`	💬 Text	$2.50	$15.00
`gpt-5.5`	💬 Text	$5.00	$30.00
`gpt-5.5-openai-compact`	💬 Text	$5.00	$30.00
`gpt-5.5-xhigh-fast`	💬 Text	$5.00	$30.00

volc-deepseek

Alias	Modality	Input / 1M ($)	Output / 1M ($)
`volc-deepseek-v3.2`	💬 Text	$0.280	$0.420
`volc-deepseek-v4-flash`	💬 Text	$0.140	$0.280
`volc-deepseek-v4-pro`	💬 Text	$1.67	$3.33

volc-glm

Alias	Modality	Input / 1M ($)	Output / 1M ($)
`volc-glm-4.7`	💬 Text	$0.420	$1.94

Routing & failover

You always pass the platform alias (e.g. gpt-5.2), never the upstream-specific id (e.g. openai/gpt-5.2). Behind the scenes:

Each alias maps to one or more upstream routes ordered by priority.
Each route consumes credentials from a per-provider pool. If a credential gets repeatedly rejected, it's cooled down for 60s and the next request picks a sibling.
A single retryable upstream error (5xx, 401/403 token issues, 429 rate limit) triggers the relay to retry the next candidate automatically — your client only sees the final result.
If every candidate is exhausted, you get an explicit 503 with a clear message instead of a stuck request.

See Common errors for what each status code means and how to react.