Databricks models & pricing
Databricks hosts 28 models (28 with public pricing) covering 2 modalities. Mosaic AI Model Serving — Llama, DBRX, Claude on Databricks with Unity Catalog governance. Cheapest input starts at $0.0500/M tokens; the most premium goes up to $15.00/M. Use Future AGI's Agent Command Center to route any Databricks model with cost-optimized fallback and unified observability.
chat 26
| Model ↕ | Input / 1M ↕ | Output / 1M ↕ | Context ↕ | Caps |
|---|---|---|---|---|
| Databricks GPT 5 nano | $0.0500/M | $0.400/M | 272,000 | |
| Databricks GPT Oss 20B | $0.0700/M | $0.300/M | 131,072 | |
| Databricks Gemma 3.12B | $0.150/M | $0.500/M | 128,000 | |
| Databricks GPT Oss 120B | $0.150/M | $0.600/M | 131,072 | |
| Databricks Meta Llama 3.1 8B Instruct | $0.150/M | $0.450/M | 200,000 | |
| Databricks GPT 5 mini | $0.250/M | $2.00/M | 272,000 | |
| Databricks Gemini 2.5 Flash | $0.300/M | $2.50/M | 1,048,576 | tools |
| Databricks Llama 2.70B Chat | $0.500/M | $1.50/M | 4,096 | |
| Databricks Llama 4 Maverick | $0.500/M | $1.50/M | 128,000 | |
| Databricks Meta Llama 3.3 70B Instruct | $0.500/M | $1.50/M | 128,000 | |
| Databricks Mixtral 8×7B Instruct | $0.500/M | $1.00/M | 4,096 | |
| Databricks Mpt 7B Instruct | $0.500/M | — | 8,192 | |
| Databricks Claude Haiku 4.5 | $1.00/M | $5.00/M | 200,000 | tools · reasoning |
| Databricks Meta Llama 3.70B Instruct | $1.00/M | $3.00/M | 128,000 | |
| Databricks Mpt 30B Instruct | $1.00/M | $1.00/M | 8,192 | |
| Databricks Gemini 2.5 Pro | $1.25/M | $10.00/M | 1,048,576 | tools |
| Databricks GPT 5 | $1.25/M | $10.00/M | 272,000 | |
| Databricks GPT 5.1 | $1.25/M | $10.00/M | 272,000 | |
| Databricks Claude 3.7 Sonnet | $3.00/M | $15.00/M | 200,000 | tools · reasoning |
| Databricks Claude Sonnet 4 | $3.00/M | $15.00/M | 200,000 | tools · reasoning |
| Databricks Claude Sonnet 4.1 | $3.00/M | $15.00/M | 200,000 | tools · reasoning |
| Databricks Claude Sonnet 4.5 | $3.00/M | $15.00/M | 200,000 | tools · reasoning |
| Databricks Claude Opus 4.5 | $5.00/M | $25.00/M | 200,000 | tools · reasoning |
| Databricks Meta Llama 3.1 405B Instruct | $5.00/M | $15.00/M | 128,000 | |
| Databricks Claude Opus 4 | $15.00/M | $75.00/M | 200,000 | tools · reasoning |
| Databricks Claude Opus 4.1 | $15.00/M | $75.00/M | 200,000 | tools · reasoning |
embedding 2
| Model ↕ | Input / 1M ↕ | Output / 1M ↕ | Context ↕ | Caps |
|---|---|---|---|---|
| Databricks Bge Large En | $0.100/M | — | 512 | |
| Databricks Gte Large En | $0.130/M | — | 8,192 |
FAQ
How many Databricks models are there?
28 Databricks models are listed across 2 modalities on this page. 28 have public per-token pricing.
How is Databricks pricing verified?
Pricing is aggregated from BerriAI/litellm, models.dev, and OpenRouter and refreshed weekly. Each row shows a per-model "verified" date. If a price is wrong, click the row to open the model page and use the inline "suggest edit" link — submissions go into a public review queue.
Which Databricks model is cheapest?
Input pricing on Databricks starts at $0.0500 per 1M tokens. Sort the table by price (or use the in-page filter at the top) to find the cheapest model that matches your capability requirements.
Can I route to Databricks via an OpenAI-compatible API?
Yes — point your OpenAI client at Future AGI's Agent Command Center, configure a Databricks target, and call Databricks models with the standard /v1/chat/completions surface. The same gateway can route to other providers as fallback. Free for the first 100K requests/month.