LLM directory · Pricing · Benchmarks
LLM cost calculator and model directory
Per-token pricing, context windows, capabilities, and public benchmark scores for 2,518 models across 54 providers — chat, embeddings, image, audio, video, and rerank. Pricing verified weekly. Submit edits in one click. Route any model via Future AGI's Agent Command Center for unified caching, fallback, and observability.
Browse by provider
Browse by category
One-click into the directory above, scoped to a single modality.
Chat & reasoning
Conversational and instruction-following models.
Embeddings
Vector-encoding models for retrieval, clustering, and semantic search.
Image generation
Text-to-image and diffusion models.
Image edit
Inpainting, editing, and image-to-image.
Speech to text
Audio transcription and diarization.
Text to speech
Voice synthesis and TTS models.
Video generation
Text-to-video and image-to-video models.
Rerank
Cross-encoder rerankers for RAG pipelines.
Moderation
Content classification and policy enforcement.
OCR
Document and image text extraction.
Realtime audio
Streaming voice + tool-use over WebSocket.
Highest-ranked chat models on Chatbot Arena
Sorted by Chatbot Arena ELO (real human pairwise preference). Open the full directory →
| Model | Provider | Input / 1M | Output / 1M | Context |
|---|---|---|---|---|
| | Azure AI Foundry | $5.00/M | $25.00/M | 200,000 |
| | Google Vertex AI | $2.00/M | $12.00/M | 1,048,576 |
| | Azure AI Foundry | $5.00/M | $25.00/M | 200,000 |
| | Google Vertex AI | $2.00/M | $12.00/M | 1,048,576 |
| | OpenAI | $5.00/M | $30.00/M | 1,050,000 |
| | Azure OpenAI | $1.75/M | $14.00/M | 272,000 |
| | OpenAI | $1.75/M | $14.00/M | 128,000 |
| | Azure OpenAI | $2.50/M | $15.00/M | 1,050,000 |
| | xAI | $2.00/M | $6.00/M | 2,000,000 |
| | xAI | $2.00/M | $6.00/M | 2,000,000 |
| | Azure AI Foundry | $5.00/M | $25.00/M | 200,000 |
| | Azure AI Foundry | $3.00/M | $15.00/M | 1,000,000 |
| | Moonshot AI | $0.950/M | $4.00/M | 262,144 |
| | xAI | $3.00/M | $15.00/M | 256,000 |
| | Azure OpenAI | $0.750/M | $4.50/M | 1,050,000 |
FAQ
How many models does the directory cover?
2,518 models across 54+ providers — chat, embeddings, image generation, image edit, speech-to-text, text-to-speech, video generation, rerank, moderation, OCR, and realtime audio. Maxim and most competitors only cover chat models on a handful of providers.
How often is pricing updated?
Pricing is verified weekly from litellm + models.dev + OpenRouter, with the per-model "Last verified" timestamp shown on every page. If you spot an outdated rate, click "suggest edit" on the row — submissions land in our review queue and roll out on the next refresh.
How accurate are these prices?
We aggregate from public, version-controlled sources (BerriAI/litellm, models.dev, OpenRouter) and pull the most-recently-published number. We do NOT contractually guarantee accuracy — for production billing decisions, always confirm against the provider's own pricing page (linked on every model page).
Do you cover deprecated and upcoming models?
Yes — every model carries a `deprecation_date` when one is published by the provider. Models retiring in the next 180 days show a countdown badge; deprecated models still render with a clear notice and migration recommendations to a current alternative.
Can I get pricing for a model that isn't listed?
Click "Request a model" on the hub — submissions land in a public queue, ranked by community votes. Models with active demand get added within a few days. No sign-up required.
How do I route across providers without changing my code?
Use Future AGI's Agent Command Center — an open-source OpenAI-compatible gateway. One config supports cost-optimized routing, latency-aware retries, model fallback, shadow / mirror traffic, exact + semantic caching, and 18 built-in guardrail scanners. Free for the first 100K requests/month. The "Production recipe" block on every model page has a ready-to-paste config.