Fireworks AI models & pricing

Fireworks AI hosts 268 models (259 with public pricing) covering 5 modalities. Fast inference for Llama, Qwen, DeepSeek, Mixtral and 200+ open-source models. Cheapest input starts at $0.000130/M tokens; the most premium goes up to $3.00/M. Use Future AGI's Agent Command Center to route any Fireworks AI model with cost-optimized fallback and unified observability.

Homepage ↗ Docs ↗
268

chat 244

Model Input / 1M Output / 1M Context Caps
Accounts Fireworks Models Flux 1 Dev Controlnet Union $0.001000/M $0.001000/M 4,096
Accounts Fireworks Models GPT Oss 20B $0.0500/M $0.200/M 131,072 tools · reasoning
Accounts Fireworks Models Codegemma 2B $0.1000/M $0.1000/M 8,192
Accounts Fireworks Models Cogito V1 preview Llama 3B $0.1000/M $0.1000/M 131,072
Accounts Fireworks Models DeepSeek Coder 1B Base $0.1000/M $0.1000/M 16,384
Accounts Fireworks Models DeepSeek R1 Distill Qwen 1p5b $0.1000/M $0.1000/M 131,072
Accounts Fireworks Models Ernie 4p5.21b A3b Pt $0.1000/M $0.1000/M 4,096
Accounts Fireworks Models Ernie 4p5.300b A47b Pt $0.1000/M $0.1000/M 4,096
Accounts Fireworks Models Flux 1 Dev $0.1000/M $0.1000/M 4,096
Accounts Fireworks Models Flux 1 Schnell $0.1000/M $0.1000/M 4,096
Accounts Fireworks Models Gemma 2B It $0.1000/M $0.1000/M 8,192
Accounts Fireworks Models Llama Guard 3.1B $0.1000/M $0.1000/M 131,072
Accounts Fireworks Models Llama V2.70b $0.1000/M $0.1000/M 4,096
Accounts Fireworks Models Llama V3p1.405b Instruct Long $0.1000/M $0.1000/M 4,096
Accounts Fireworks Models Llama V3p1.70b Instruct 1B $0.1000/M $0.1000/M 4,096
Accounts Fireworks Models Llama V3p1.8b Instruct $0.1000/M $0.1000/M 16,384
Accounts Fireworks Models Llama V3p2.1b $0.1000/M $0.1000/M 131,072
Accounts Fireworks Models Llama V3p2.1b Instruct $0.1000/M $0.1000/M 16,384
Accounts Fireworks Models Llama V3p2.3b $0.1000/M $0.1000/M 131,072
Accounts Fireworks Models Llama V3p2.3b Instruct $0.1000/M $0.1000/M 16,384
Accounts Fireworks Models Minimax M1.80k $0.1000/M $0.1000/M 4,096
Accounts Fireworks Models Ministral 3.3B Instruct 2512 $0.1000/M $0.1000/M 256,000
Accounts Fireworks Models Nemotron nano V2.12b VL $0.1000/M $0.1000/M 4,096
Accounts Fireworks Models Phi 2.3B $0.1000/M $0.1000/M 2,048
Accounts Fireworks Models Phi 3 mini 128k Instruct $0.1000/M $0.1000/M 131,072
Accounts Fireworks Models Qwen2 VL 2B Instruct $0.1000/M $0.1000/M 32,768
Accounts Fireworks Models Qwen2p5.0p5b Instruct $0.1000/M $0.1000/M 32,768
Accounts Fireworks Models Qwen2p5.1p5b Instruct $0.1000/M $0.1000/M 32,768
Accounts Fireworks Models Qwen2p5 Coder 0p5b $0.1000/M $0.1000/M 32,768
Accounts Fireworks Models Qwen2p5 Coder 0p5b Instruct $0.1000/M $0.1000/M 32,768
Accounts Fireworks Models Qwen2p5 Coder 1p5b $0.1000/M $0.1000/M 32,768
Accounts Fireworks Models Qwen2p5 Coder 1p5b Instruct $0.1000/M $0.1000/M 32,768
Accounts Fireworks Models Qwen2p5 Coder 3B $0.1000/M $0.1000/M 32,768
Accounts Fireworks Models Qwen2p5 Coder 3B Instruct $0.1000/M $0.1000/M 32,768
Accounts Fireworks Models Qwen3.0p6b $0.1000/M $0.1000/M 40,960
Accounts Fireworks Models Qwen3.1p7b $0.1000/M $0.1000/M 131,072
Accounts Fireworks Models Qwen3.1p7b Fp8 Draft $0.1000/M $0.1000/M 262,144
Accounts Fireworks Models Qwen3.1p7b Fp8 Draft 131072 $0.1000/M $0.1000/M 131,072
Accounts Fireworks Models Qwen3.1p7b Fp8 Draft 40960 $0.1000/M $0.1000/M 40,960
Accounts Fireworks Models Stablecode 3B $0.1000/M $0.1000/M 4,096
Accounts Fireworks Models Starcoder2.3b $0.1000/M $0.1000/M 16,384
Accounts Fireworks Models GPT Oss 120B $0.150/M $0.600/M 131,072 tools · reasoning
Accounts Fireworks Models Llama4 Scout Instruct Basic $0.150/M $0.600/M 131,072
Accounts Fireworks Models Qwen3.30b A3b $0.150/M $0.600/M 131,072
Accounts Fireworks Models Qwen3 Coder 30B A3b Instruct $0.150/M $0.600/M 262,144
Accounts Fireworks Models Qwen3 VL 30B A3b Instruct $0.150/M $0.600/M 262,144
Accounts Fireworks Models Qwen3 VL 30B A3b Thinking $0.150/M $0.600/M 262,144
Accounts Fireworks Models Chronos Hermes 13B v2 $0.200/M $0.200/M 4,096
Accounts Fireworks Models Code Llama 13B $0.200/M $0.200/M 16,384
Accounts Fireworks Models Code Llama 13B Instruct $0.200/M $0.200/M 16,384
Accounts Fireworks Models Code Llama 13B Python $0.200/M $0.200/M 16,384
Accounts Fireworks Models Code Llama 7B $0.200/M $0.200/M 16,384
Accounts Fireworks Models Code Llama 7B Instruct $0.200/M $0.200/M 16,384
Accounts Fireworks Models Code Llama 7B Python $0.200/M $0.200/M 16,384
Accounts Fireworks Models Code Qwen 1p5.7b $0.200/M $0.200/M 65,536
Accounts Fireworks Models Codegemma 7B $0.200/M $0.200/M 8,192
Accounts Fireworks Models Cogito V1 preview Llama 8B $0.200/M $0.200/M 131,072
Accounts Fireworks Models Cogito V1 preview Qwen 14B $0.200/M $0.200/M 131,072
Accounts Fireworks Models DeepSeek Coder 7B Base $0.200/M $0.200/M 4,096
Accounts Fireworks Models DeepSeek Coder 7B Base V1p5 $0.200/M $0.200/M 4,096
Accounts Fireworks Models DeepSeek Coder 7B Instruct V1p5 $0.200/M $0.200/M 4,096
Accounts Fireworks Models DeepSeek R1.0528 Distill Qwen3.8b $0.200/M $0.200/M 131,072
Accounts Fireworks Models DeepSeek R1 Distill Llama 8B $0.200/M $0.200/M 131,072
Accounts Fireworks Models DeepSeek R1 Distill Qwen 14B $0.200/M $0.200/M 131,072
Accounts Fireworks Models DeepSeek R1 Distill Qwen 7B $0.200/M $0.200/M 131,072
Accounts Fireworks Models Dobby mini Unhinged Plus Llama 3.1 8B $0.200/M $0.200/M 131,072
Accounts Fireworks Models Firellava 13B $0.200/M $0.200/M 4,096
Accounts Fireworks Models Firesearch OCR v6 $0.200/M $0.200/M 8,192
Accounts Fireworks Models Gemma 7B $0.200/M $0.200/M 8,192
Accounts Fireworks Models Gemma 7B It $0.200/M $0.200/M 8,192
Accounts Fireworks Models Gemma2.9b It $0.200/M $0.200/M 8,192
Accounts Fireworks Models Hermes 2 Pro Mistral 7B $0.200/M $0.200/M 32,768
Accounts Fireworks Models Internvl3.8b $0.200/M $0.200/M 16,384
Accounts Fireworks Models Llama Guard 2.8B $0.200/M $0.200/M 8,192
Accounts Fireworks Models Llama Guard 3.8B $0.200/M $0.200/M 131,072
Accounts Fireworks Models Llama V2.13b $0.200/M $0.200/M 4,096
Accounts Fireworks Models Llama V2.13b Chat $0.200/M $0.200/M 4,096
Accounts Fireworks Models Llama V2.7b $0.200/M $0.200/M 4,096
Accounts Fireworks Models Llama V2.7b Chat $0.200/M $0.200/M 4,096
Accounts Fireworks Models Llama V3.8b $0.200/M $0.200/M 8,192
Accounts Fireworks Models Llama V3.8b Instruct Hf $0.200/M $0.200/M 8,192
Accounts Fireworks Models Llama V3p2.11b Vision Instruct $0.200/M $0.200/M 16,384 vision
Accounts Fireworks Models Llamaguard 7B $0.200/M $0.200/M 4,096
Accounts Fireworks Models Ministral 3.14B Instruct 2512 $0.200/M $0.200/M 256,000
Accounts Fireworks Models Ministral 3.8B Instruct 2512 $0.200/M $0.200/M 256,000
Accounts Fireworks Models Mistral 7B $0.200/M $0.200/M 32,768
Accounts Fireworks Models Mistral 7B Instruct 4k $0.200/M $0.200/M 32,768
Accounts Fireworks Models Mistral 7B Instruct V0p2 $0.200/M $0.200/M 32,768
Accounts Fireworks Models Mistral 7B Instruct v3 $0.200/M $0.200/M 32,768
Accounts Fireworks Models Mistral 7B V0p2 $0.200/M $0.200/M 32,768
Accounts Fireworks Models Mistral Nemo Base 2407 $0.200/M $0.200/M 128,000
Accounts Fireworks Models Mistral Nemo Instruct 2407 $0.200/M $0.200/M 128,000
Accounts Fireworks Models Mythomax L2.13b $0.200/M $0.200/M 4,096
Accounts Fireworks Models Nous Capybara 7B V1p9 $0.200/M $0.200/M 32,768
Accounts Fireworks Models Nous Hermes Llama2.13b $0.200/M $0.200/M 4,096
Accounts Fireworks Models Nous Hermes Llama2.7b $0.200/M $0.200/M 4,096
Accounts Fireworks Models Nvidia Nemotron nano 12B v2 $0.200/M $0.200/M 131,072
Accounts Fireworks Models Nvidia Nemotron nano 9B v2 $0.200/M $0.200/M 131,072
Accounts Fireworks Models Openchat 3p5.0106.7b $0.200/M $0.200/M 8,192
Accounts Fireworks Models Openhermes 2 Mistral 7B $0.200/M $0.200/M 32,768
Accounts Fireworks Models Openhermes 2p5 Mistral 7B $0.200/M $0.200/M 32,768
Accounts Fireworks Models Openorca 7B $0.200/M $0.200/M 32,768
Accounts Fireworks Models Phi 3 Vision 128k Instruct $0.200/M $0.200/M 32,064
Accounts Fireworks Models Pythia 12B $0.200/M $0.200/M 2,048
Accounts Fireworks Models Qwen V2p5.14b Instruct $0.200/M $0.200/M 32,768
Accounts Fireworks Models Qwen V2p5.7b $0.200/M $0.200/M 131,072
Accounts Fireworks Models Qwen2.7b Instruct $0.200/M $0.200/M 32,768
Accounts Fireworks Models Qwen2 VL 7B Instruct $0.200/M $0.200/M 32,768
Accounts Fireworks Models Qwen2p5.14b $0.200/M $0.200/M 131,072
Accounts Fireworks Models Qwen2p5.7b Instruct $0.200/M $0.200/M 32,768
Accounts Fireworks Models Qwen2p5 Coder 14B $0.200/M $0.200/M 32,768
Accounts Fireworks Models Qwen2p5 Coder 14B Instruct $0.200/M $0.200/M 32,768
Accounts Fireworks Models Qwen2p5 Coder 7B $0.200/M $0.200/M 32,768
Accounts Fireworks Models Qwen2p5 Coder 7B Instruct $0.200/M $0.200/M 32,768
Accounts Fireworks Models Qwen2p5 VL 3B Instruct $0.200/M $0.200/M 128,000
Accounts Fireworks Models Qwen2p5 VL 7B Instruct $0.200/M $0.200/M 128,000
Accounts Fireworks Models Qwen3.14b $0.200/M $0.200/M 40,960
Accounts Fireworks Models Qwen3.4b $0.200/M $0.200/M 40,960
Accounts Fireworks Models Qwen3.4b Instruct 2507 $0.200/M $0.200/M 262,144
Accounts Fireworks Models Qwen3.8b $0.200/M $0.200/M 40,960 reasoning
Accounts Fireworks Models Qwen3 VL 8B Instruct $0.200/M $0.200/M 4,096
Accounts Fireworks Models Rolm OCR $0.200/M $0.200/M 128,000
Accounts Fireworks Models Snorkel Mistral 7B Pairrm DPO $0.200/M $0.200/M 32,768
Accounts Fireworks Models Starcoder 16B $0.200/M $0.200/M 8,192
Accounts Fireworks Models Starcoder 7B $0.200/M $0.200/M 8,192
Accounts Fireworks Models Starcoder2.15b $0.200/M $0.200/M 16,384
Accounts Fireworks Models Starcoder2.7b $0.200/M $0.200/M 16,384
Accounts Fireworks Models Toppy M 7B $0.200/M $0.200/M 32,768
Accounts Fireworks Models Yi 6B $0.200/M $0.200/M 4,096
Accounts Fireworks Models Zephyr 7B beta $0.200/M $0.200/M 32,768
Accounts Fireworks Models Glm 4p5 Air $0.220/M $0.880/M 128,000 tools · reasoning
Accounts Fireworks Models Llama4 Maverick Instruct Basic $0.220/M $0.880/M 131,072
Accounts Fireworks Models Qwen3.235b A22b $0.220/M $0.880/M 131,072
Accounts Fireworks Models Qwen3.235b A22b Instruct 2507 $0.220/M $0.880/M 262,144
Accounts Fireworks Models Qwen3.235b A22b Thinking 2507 $0.220/M $0.880/M 262,144
Accounts Fireworks Models Qwen3 VL 235B A22b Instruct $0.220/M $0.880/M 262,144
Accounts Fireworks Models Qwen3 VL 235B A22b Thinking $0.220/M $0.880/M 262,144
Accounts Fireworks Models Minimax M2 $0.300/M $1.20/M 4,096
Accounts Fireworks Models Minimax M2p1 $0.300/M $1.20/M 204,800 tools
Minimax M2p1 $0.300/M $1.20/M 204,800 tools
Accounts Fireworks Models Qwen3 Coder 480B A35b Instruct $0.450/M $1.80/M 262,144 reasoning
Accounts Fireworks Models DeepSeek Coder V2 Lite Base $0.500/M $0.500/M 163,840
Accounts Fireworks Models DeepSeek Coder V2 Lite Instruct $0.500/M $0.500/M 163,840
Accounts Fireworks Models DeepSeek V2 Lite Chat $0.500/M $0.500/M 163,840
Accounts Fireworks Models Dolphin 2p6 Mixtral 8×7B $0.500/M $0.500/M 32,768
Accounts Fireworks Models Firefunction v1 $0.500/M $0.500/M 32,768
Accounts Fireworks Models GPT Oss Safeguard 20B $0.500/M $0.500/M 131,072
Accounts Fireworks Models Mixtral 8×7B $0.500/M $0.500/M 32,768
Accounts Fireworks Models Mixtral 8×7B Instruct $0.500/M $0.500/M 32,768
Accounts Fireworks Models Mixtral 8×7B Instruct Hf $0.500/M $0.500/M 32,768
Accounts Fireworks Models Nous Hermes 2 Mixtral 8×7B DPO $0.500/M $0.500/M 32,768
Accounts Fireworks Models Qwen3.30b A3b Instruct 2507 $0.500/M $0.500/M 262,144
Accounts Fireworks Models DeepSeek R1 Basic $0.550/M $2.19/M 128,000
Accounts Fireworks Models Glm 4p5 $0.550/M $2.19/M 128,000 tools · reasoning
Accounts Fireworks Models Glm 4p6 $0.550/M $2.19/M 202,800 tools · reasoning
Accounts Fireworks Models DeepSeek V3p1 $0.560/M $1.68/M 128,000 reasoning
Accounts Fireworks Models DeepSeek V3p1 Terminus $0.560/M $1.68/M 128,000 reasoning
Accounts Fireworks Models DeepSeek V3p2 $0.560/M $1.68/M 163,840 tools · reasoning
Accounts Fireworks Models Glm 4p7 $0.600/M $2.20/M 202,800 tools · reasoning
Accounts Fireworks Models Kimi K2 Instruct $0.600/M $2.50/M 131,072 tools
Accounts Fireworks Models Kimi K2 Instruct 0905 $0.600/M $2.50/M 262,144 tools
Accounts Fireworks Models Kimi K2 Thinking $0.600/M $2.50/M 262,144 tools
Accounts Fireworks Models Kimi K2p5 $0.600/M $3.00/M 262,144 tools
Glm 4p7 $0.600/M $2.20/M 202,800 tools · reasoning
Kimi K2p5 $0.600/M $3.00/M 262,144 tools
Accounts Fireworks Models Code Llama 34B $0.900/M $0.900/M 16,384
Accounts Fireworks Models Code Llama 34B Instruct $0.900/M $0.900/M 16,384
Accounts Fireworks Models Code Llama 34B Python $0.900/M $0.900/M 16,384
Accounts Fireworks Models Code Llama 70B $0.900/M $0.900/M 4,096
Accounts Fireworks Models Code Llama 70B Instruct $0.900/M $0.900/M 4,096
Accounts Fireworks Models Code Llama 70B Python $0.900/M $0.900/M 4,096
Accounts Fireworks Models Cogito V1 preview Llama 70B $0.900/M $0.900/M 131,072
Accounts Fireworks Models Cogito V1 preview Qwen 32B $0.900/M $0.900/M 131,072
Accounts Fireworks Models DeepSeek Coder 33B Instruct $0.900/M $0.900/M 16,384
Accounts Fireworks Models DeepSeek R1 Distill Llama 70B $0.900/M $0.900/M 131,072
Accounts Fireworks Models DeepSeek R1 Distill Qwen 32B $0.900/M $0.900/M 131,072
Accounts Fireworks Models DeepSeek v3 $0.900/M $0.900/M 128,000
Accounts Fireworks Models DeepSeek v3.0324 $0.900/M $0.900/M 163,840
Accounts Fireworks Models Devstral Small 2505 $0.900/M $0.900/M 131,072
Accounts Fireworks Models Dobby Unhinged Llama 3.3 70B New $0.900/M $0.900/M 131,072
Accounts Fireworks Models Dolphin 2.9 2 Qwen2.72b $0.900/M $0.900/M 131,072
Accounts Fireworks Models Fare 20B $0.900/M $0.900/M 131,072
Accounts Fireworks Models Firefunction v2 $0.900/M $0.900/M 8,192 tools
Accounts Fireworks Models Gemma 3.27B It $0.900/M $0.900/M 131,072
Accounts Fireworks Models Internvl3.38b $0.900/M $0.900/M 16,384
Accounts Fireworks Models Internvl3.78b $0.900/M $0.900/M 16,384
Accounts Fireworks Models Kat Coder $0.900/M $0.900/M 262,144
Accounts Fireworks Models Kat Dev 32B $0.900/M $0.900/M 131,072
Accounts Fireworks Models Kat Dev 72B exp $0.900/M $0.900/M 131,072
Accounts Fireworks Models Llama V2.70b Chat $0.900/M $0.900/M 2,048
Accounts Fireworks Models Llama V3.70b Instruct $0.900/M $0.900/M 8,192
Accounts Fireworks Models Llama V3.70b Instruct Hf $0.900/M $0.900/M 8,192
Accounts Fireworks Models Llama V3p1.70b Instruct $0.900/M $0.900/M 131,072
Accounts Fireworks Models Llama V3p1 Nemotron 70B Instruct $0.900/M $0.900/M 131,072
Accounts Fireworks Models Llama V3p2.90b Vision Instruct $0.900/M $0.900/M 16,384 vision
Accounts Fireworks Models Llama V3p3.70b Instruct $0.900/M $0.900/M 131,072
Accounts Fireworks Models Llava Yi 34B $0.900/M $0.900/M 4,096
Accounts Fireworks Models Mistral Small 24B Instruct 2501 $0.900/M $0.900/M 32,768
Accounts Fireworks Models Nous Hermes 2 Yi 34B $0.900/M $0.900/M 4,096
Accounts Fireworks Models Nous Hermes Llama2.70b $0.900/M $0.900/M 4,096
Accounts Fireworks Models Phind Code Llama 34B Python v1 $0.900/M $0.900/M 16,384
Accounts Fireworks Models Phind Code Llama 34B v1 $0.900/M $0.900/M 16,384
Accounts Fireworks Models Phind Code Llama 34B v2 $0.900/M $0.900/M 16,384
Accounts Fireworks Models Qwen Qwq 32B preview $0.900/M $0.900/M 32,768
Accounts Fireworks Models Qwen1p5.72b Chat $0.900/M $0.900/M 32,768
Accounts Fireworks Models Qwen2.72b Instruct $0.900/M $0.900/M 32,768
Accounts Fireworks Models Qwen2 VL 72B Instruct $0.900/M $0.900/M 32,768
Accounts Fireworks Models Qwen2p5.32b $0.900/M $0.900/M 131,072
Accounts Fireworks Models Qwen2p5.32b Instruct $0.900/M $0.900/M 32,768
Accounts Fireworks Models Qwen2p5.72b $0.900/M $0.900/M 131,072
Accounts Fireworks Models Qwen2p5.72b Instruct $0.900/M $0.900/M 32,768
Accounts Fireworks Models Qwen2p5 Coder 32B $0.900/M $0.900/M 32,768
Accounts Fireworks Models Qwen2p5 Coder 32B Instruct $0.900/M $0.900/M 4,096
Accounts Fireworks Models Qwen2p5 Coder 32B Instruct 128k $0.900/M $0.900/M 131,072
Accounts Fireworks Models Qwen2p5 Coder 32B Instruct 32k Rope $0.900/M $0.900/M 32,768
Accounts Fireworks Models Qwen2p5 Coder 32B Instruct 64k $0.900/M $0.900/M 65,536
Accounts Fireworks Models Qwen2p5 Math 72B Instruct $0.900/M $0.900/M 4,096
Accounts Fireworks Models Qwen2p5 VL 32B Instruct $0.900/M $0.900/M 128,000
Accounts Fireworks Models Qwen2p5 VL 72B Instruct $0.900/M $0.900/M 128,000
Accounts Fireworks Models Qwen3.30b A3b Thinking 2507 $0.900/M $0.900/M 262,144
Accounts Fireworks Models Qwen3.32b $0.900/M $0.900/M 131,072 reasoning
Accounts Fireworks Models Qwen3 Coder 480B Instruct Bf16 $0.900/M $0.900/M 4,096
Accounts Fireworks Models Qwen3 Next 80B A3b Instruct $0.900/M $0.900/M 4,096
Accounts Fireworks Models Qwen3 Next 80B A3b Thinking $0.900/M $0.900/M 4,096
Accounts Fireworks Models Qwen3 VL 32B Instruct $0.900/M $0.900/M 4,096
Accounts Fireworks Models Qwq 32B $0.900/M $0.900/M 131,072
Accounts Fireworks Models Yi 34B $0.900/M $0.900/M 4,096
Accounts Fireworks Models Yi 34B 200k Capybara $0.900/M $0.900/M 200,000
Accounts Fireworks Models Yi 34B Chat $0.900/M $0.900/M 4,096
Accounts Fireworks Models Cogito 671B V2 P1 $1.20/M $1.20/M 163,840
Accounts Fireworks Models Dbrx Instruct $1.20/M $1.20/M 32,768
Accounts Fireworks Models DeepSeek Coder V2 Instruct $1.20/M $1.20/M 65,536
Accounts Fireworks Models DeepSeek Prover v2 $1.20/M $1.20/M 163,840
Accounts Fireworks Models DeepSeek V2p5 $1.20/M $1.20/M 32,768
Accounts Fireworks Models Glm 4p5v $1.20/M $1.20/M 131,072 reasoning
Accounts Fireworks Models GPT Oss Safeguard 120B $1.20/M $1.20/M 131,072
Accounts Fireworks Models Mistral Large 3 Fp8 $1.20/M $1.20/M 256,000
Accounts Fireworks Models Mixtral 8×22B $1.20/M $1.20/M 65,536
Accounts Fireworks Models Mixtral 8×22B Instruct $1.20/M $1.20/M 65,536
Accounts Fireworks Models Mixtral 8×22B Instruct Hf $1.20/M $1.20/M 65,536 tools
Accounts Fireworks Models DeepSeek R1 $3.00/M $8.00/M 128,000
Accounts Fireworks Models DeepSeek R1.0528 $3.00/M $8.00/M 160,000
Accounts Fireworks Models Llama V3p1.405b Instruct $3.00/M $3.00/M 128,000 tools
Accounts Fireworks Models Yi Large $3.00/M $3.00/M 32,768

image generation 9

Model Input / 1M Output / 1M Context Caps
Accounts Fireworks Models Japanese Stable Diffusion Xl $0.000130/M $0.000130/M 4,096
Accounts Fireworks Models Playground V2.1024px Aesthetic $0.000130/M $0.000130/M 4,096
Accounts Fireworks Models Playground V2.5 1024px Aesthetic $0.000130/M $0.000130/M 4,096
Accounts Fireworks Models Ssd 1B $0.000130/M $0.000130/M 4,096
Accounts Fireworks Models Stable Diffusion Xl 1024 v1.0 $0.000130/M $0.000130/M 4,096
Accounts Fireworks Models Flux 1 Schnell Fp8 $0.000350/M $0.000350/M 4,096
Accounts Fireworks Models Flux 1 Dev Fp8 $0.000500/M $0.000500/M 4,096
Accounts Fireworks Models Flux Kontext Pro $0.0400/M $0.0400/M 4,096
Accounts Fireworks Models Flux Kontext Max $0.0800/M $0.0800/M 4,096

embedding 8

Model Input / 1M Output / 1M Context Caps
Nomic AI Nomic Embed Text v1 $0.008000/M 8,192
Nomic AI Nomic Embed Text v1.5 $0.008000/M 8,192
Thenlper Gte Base $0.008000/M 512
Thenlper Gte Large $0.0160/M 512
Whereisai Uae Large v1 $0.0160/M 512
Accounts Fireworks Models $0.1000/M 40,960
Accounts Fireworks Models Qwen3 Embedding 0p6b 32,768
Accounts Fireworks Models Qwen3 Embedding 4B 40,960

audio transcription 4

Model Input / 1M Output / 1M Context Caps
Accounts Fireworks Models Fireworks Asr Large 4,096
Accounts Fireworks Models Fireworks Asr v2 4,096
Accounts Fireworks Models Whisper v3 4,096
Accounts Fireworks Models Whisper V3 Turbo 4,096

rerank 3

Model Input / 1M Output / 1M Context Caps
Accounts Fireworks Models Qwen3 Reranker 0p6b 40,960
Accounts Fireworks Models Qwen3 Reranker 4B 40,960
Accounts Fireworks Models Qwen3 Reranker 8B 40,960

FAQ

How many Fireworks AI models are there?

268 Fireworks AI models are listed across 5 modalities on this page. 259 have public per-token pricing.

How is Fireworks AI pricing verified?

Pricing is aggregated from BerriAI/litellm, models.dev, and OpenRouter and refreshed weekly. Each row shows a per-model "verified" date. If a price is wrong, click the row to open the model page and use the inline "suggest edit" link — submissions go into a public review queue.

Which Fireworks AI model is cheapest?

Input pricing on Fireworks AI starts at $0.000130 per 1M tokens. Sort the table by price (or use the in-page filter at the top) to find the cheapest model that matches your capability requirements.

Can I route to Fireworks AI via an OpenAI-compatible API?

Yes — point your OpenAI client at Future AGI's Agent Command Center, configure a Fireworks AI target, and call Fireworks AI models with the standard /v1/chat/completions surface. The same gateway can route to other providers as fallback. Free for the first 100K requests/month.

Route any Fireworks AI model via Agent Command Center →
OpenAI-compatible endpoint. Caching, fallback, guardrails, observability. Free for 100K requests/month.