Straightforward Pricing
Every rate is listed openly. Deposit credit, call models, and pay only for the tokens your requests actually consume. No monthly minimums.
Per-Token Rates per 1M tokens
Request a free $2 trial credit to test any model before committing to a deposit.
| Model | Slug | Context | Input / 1M | Output / 1M | vs Direct |
|---|---|---|---|---|---|
| GPT-5.4 | gpt-5.4 | 200K | $0.875 | $5.250 | ~65% less |
| GPT-5.3 Codex | gpt-5.3-codex | 200K | $0.613 | $4.900 | ~65% less |
| GPT-5.2 | gpt-5.2 | 128K | $0.613 | $4.900 | ~65% less |
| Model | Slug | Context | Input / 1M | Output / 1M | vs Direct |
|---|---|---|---|---|---|
| Gemini 3.1 Pro Preview | gemini-3.1-pro-preview | 1M | $0.700 | $4.200 | ~65% less |
| Gemini 3 Flash Preview | gemini-3-flash-preview | 1M | $0.175 | $1.050 | ~65% less |
| Model | Slug | Context | Input / 1M | Output / 1M | vs Direct |
|---|---|---|---|---|---|
| GLM 5 Turbo | glm-5-turbo | 128K | $0.420 | $1.400 | ~65% less |
| GLM 5 | glm-5 | 128K | $0.350 | $1.120 | ~65% less |
| Model | Slug | Context | Input / 1M | Output / 1M | vs Direct |
|---|---|---|---|---|---|
| V3.2 | deepseek-v3.2 | 64K | $0.280 | $0.840 | ~65% less |
Credit Deposit Options
Fund your account with any amount. Unused credit never expires.
Deposits are non-refundable ยท Balance persists until spent ยท Top up at any time
What You Get
The practical benefits of a unified gateway over managing providers separately.
No Base Fees
Some aggregator services charge a platform fee on top of token costs. LLMAI charges nothing beyond the per-token rate โ every dollar of your deposit goes toward actual API usage.
Single Balance, All Models
Managing credit across OpenAI, Google, and DeepSeek separately means multiple accounts and payment instruments. One LLMAI deposit covers every provider in the catalog.
Full Rate Transparency
Every model's input and output rate is published on this page before you commit a dollar. No locked-in pricing, no surprise charges, no opaque markups.
Start saving on AI API costs
Register, add credit, and make your first model call โ all before your coffee gets cold.
Common Questions
How is my usage charged?
You load credit into your LLMAI account before making calls. Every API request deducts the token cost โ calculated from input and output token counts at the model's published per-million-token rate. There are no recurring charges; your balance is only reduced when you make a request.
What happens when I run out of credit?
Requests will fail with a 402 error until you reload. Your API key remains valid โ there is no deactivation. Add more credit from the console and calls resume immediately.
Can I use my existing OpenAI SDK code?
Yes. Set your base URL to https://api.llmai.dev/v1 and replace your API key with your LLMAI key. The request and response format is identical to OpenAI's Chat Completions API.
Are purchases refundable?
Credit deposits are non-refundable. We recommend starting with a smaller amount to test your integration. If a platform error caused an incorrect deduction, contact support with the request details.
What payment methods do you support?
We accept major credit and debit cards processed through Stripe. Additional payment options may be available โ contact support if you have specific requirements.
How does LLMAI's pricing compare to going direct?
LLMAI customers typically save over 65% compared to going directly to providers. Our aggregated access model means you pay significantly less per token for the same models โ GPT-5, Gemini, DeepSeek, and GLM โ without any subscription fees.