Models

Every AI model available through LLMAI, with API slugs, context sizes, and token pricing.

Model Reference

Use the slug from the table below as the model field in your request. All models are available on every account — no model-specific unlocking or tier requirements.

OpenAI

Model	API Slug	Context Window	Input / 1M tokens	Output / 1M tokens
GPT-5.4	`gpt-5.4`	1M	$0.788	$5.250
GPT-5.5	`gpt-5.5`	1M	$1.575	$10.500

Anthropic Claude

Model	API Slug	Context Window	Input / 1M tokens	Output / 1M tokens
Claude Opus 4.8 ✨ NEW	`claude-opus-4.8`	1M	$2.625	$10.500
Claude Opus 4.7	`claude-opus-4.7`	1M	$2.625	$10.500
Claude Opus 4.6	`claude-opus-4.6`	1M	$1.575	$7.875
Claude Sonnet 4.6	`claude-sonnet-4.6`	1M	$1.050	$5.250
Claude Sonnet 5 ✨ NEW	`claude-sonnet-5`	1M	$1.050	$5.250
Claude Fable 5 🧠 REASONING	`claude-fable-5`	1M	$4.500	$22.500
Claude Haiku 4.5	`claude-haiku-4.5`	1M	$0.525	$2.625

Google Gemini

Model	API Slug	Context Window	Input / 1M tokens	Output / 1M tokens
Gemini 3.5 Flash	`gemini-3.5-flash`	1M	$0.788	$4.725
Gemini 3.1 Pro (Preview)	`gemini-3.1-pro-preview`	1M	$1.050	$4.200
Gemini 3.1 Flash Lite (Preview)	`gemini-3.1-flash-lite-preview`	1M	$0.137	$0.788
Gemini 3 Flash (Preview)	`gemini-3-flash-preview`	1M	$0.210	$1.260

Image & Video Generation

These models are billed at a flat $0.21 per successful generation rather than per token. Send a standard chat-completion request — the upstream returns generated media in the response.

Model	API Slug	Type	Per Generation
Veo 3.1	`veo-3.1`	Video	$0.21
Nano Banana 2	`nano-banana-2`	Image	$0.21
Nano Banana Pro	`nano-banana-pro`	Image	$0.21

Kimi (Moonshot AI)

Model	API Slug	Context Window	Input / 1M tokens	Output / 1M tokens
Kimi K2.7 Code	`kimi-k2.7-code`	256K	$0.665	$2.800
Kimi K2.6	`kimi-k2.6`	256K	$0.620	$2.600
Kimi K2.5	`kimi-k2.5`	256K	$0.210	$1.050

DeepSeek

Model	API Slug	Context Window	Input / 1M tokens	Output / 1M tokens
DeepSeek V4 Pro	`deepseek-v4-pro`	128K	$0.2175	$0.435
DeepSeek V4 Flash	`deepseek-v4-flash`	128K	$0.050	$0.100
DeepSeek V3.2	`deepseek-v3.2`	64K	$0.088	$0.132

MiniMax

Model	API Slug	Context Window	Input / 1M tokens	Output / 1M tokens
MiniMax M3	`minimax-m3`	1M	$0.420	$1.680
MiniMax M2.7	`minimax-m2.7`	1M	$0.200	$0.850

Alibaba Qwen

Model	API Slug	Context Window	Input / 1M tokens	Output / 1M tokens
Qwen 3.6 Plus	`qwen3.6-plus`	128K	$0.120	$0.680
Qwen 3.5 Plus	`qwen3.5-plus`	128K	$0.140	$0.810

Xiaomi MiMo

Model	API Slug	Context Window	Input / 1M tokens	Output / 1M tokens
MiMo V2.5 Pro	`mimo-v2.5-pro`	128K	$0.3045	$0.609
MiMo V2.5	`mimo-v2.5`	128K	$0.098	$0.196

Google Gemma

Model	API Slug	Context Window	Input / 1M tokens	Output / 1M tokens
Gemma 4	`gemma-4`	128K	$0.046	$0.130

Z.AI GLM

Model	API Slug	Context Window	Input / 1M tokens	Output / 1M tokens
GLM 5.1	`glm-5.1`	128K	$0.550	$1.300

Making a Request

Set the model field to the exact slug shown above:

curl https://api.llmai.dev/v1/chat/completions \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "deepseek-v3.2",
    "messages": [{"role": "user", "content": "Review this Python snippet for bugs."}]
  }'

Slugs are case-sensitive. Kimi-K2.6 and kimi-k2.6 are not equivalent — use the lowercase slug exactly as shown.

Fetching the Model List Programmatically

curl https://api.llmai.dev/v1/models \
  -H "Authorization: Bearer YOUR_API_KEY"

Returns a standard ListModelsResponse object containing all model IDs currently available on the platform.

Notes on Pricing

Token-priced models charge per one million tokens — input and output independently.
Image/video models charge a flat fee per successful generation (no token billing applies).
Actual charges scale proportionally — a request using 2,000 tokens costs (2,000 ÷ 1,000,000) × rate.
Context window is the combined maximum for prompt and completion tokens in a single call.
Model availability is subject to change; check the Models catalog for the current live list.

On this page