// all accessible via one api key

Model Catalog

Browse every model available through LLMAI. Filter by provider, compare context windows and capabilities, and copy the slug directly into your code.

OpenAI
GPT 5.4
gpt-5.4
200K ctx

OpenAI's flagship model. Excels at complex reasoning, nuanced instruction-following, and multimodal tasks.

ChatReasoningVision
Input / 1M tokens
$0.875
Output / 1M tokens
$5.250
GPT 5.3 Codex
gpt-5.3-codex
200K ctx

A code-specialist variant of the GPT-5 series, tuned for generation, review, and debugging workflows.

ChatCodeReasoning
Input / 1M tokens
$0.613
Output / 1M tokens
$4.900
GPT 5.2
gpt-5.2
128K ctx

A capable general-purpose model suited for chat, summarisation, and structured reasoning.

ChatReasoning
Input / 1M tokens
$0.613
Output / 1M tokens
$4.900
Google
Gemini 3.1 Pro
gemini-3.1-pro
>200K ctx

Google's most advanced model with an extended context window. Ideal for large documents, long conversations, and deep multimodal tasks.

ChatVisionLong Context
Input / 1M tokens
$0.700
Output / 1M tokens
$4.200
Gemini 3.1 Pro Preview
gemini-3.1-pro-preview
<200K ctx

Preview variant of Gemini 3.1 Pro optimised for standard context workloads. Strong multimodal reasoning at reduced cost.

ChatVisionReasoning
Input / 1M tokens
$0.525
Output / 1M tokens
$3.150
ZZ.AI
Z
GLM 5 Turbo
glm-5-turbo
128K ctx

The speed-optimised variant of GLM 5. Excellent for latency-sensitive pipelines and high-frequency calls.

ChatFastCode
Input / 1M tokens
$0.420
Output / 1M tokens
$1.400
Z
GLM 5
glm-5
128K ctx

Z.AI's full-capability model. Strong multilingual performance and solid reasoning across technical domains.

ChatReasoningCode
Input / 1M tokens
$0.350
Output / 1M tokens
$1.120
Z
GLM 5.1
glm-5.1
128K ctx

The latest iteration of Z.AI's GLM family, featuring improved instruction-following and enhanced vision understanding.

ChatReasoningVision
Input / 1M tokens
$0.490
Output / 1M tokens
$1.470