Together AI models & pricing

Together AI hosts 42 models (35 with public pricing) covering 2 modalities. Open-model inference cloud — Llama, DeepSeek, Qwen, Mixtral, FLUX with serverless and dedicated endpoints. Cheapest input starts at $0.008000/M tokens; the most premium goes up to $3.50/M. Use Future AGI's Agent Command Center to route any Together AI model with cost-optimized fallback and unified observability.

Homepage ↗ Docs ↗
42

chat 39

Model Input / 1M Output / 1M Context Caps
OpenAI GPT Oss 20B $0.0500/M $0.200/M 128,000 tools
Together AI Up To 4B $0.1000/M $0.1000/M
OpenAI GPT Oss 120B $0.150/M $0.600/M 131,072 tools · reasoning
Qwen Qwen3 Next 80B A3b Instruct $0.150/M $1.50/M 262,144 tools
Qwen Qwen3 Next 80B A3b Thinking $0.150/M $1.50/M 262,144 tools
Meta Llama Llama 4 Scout 17B 16e Instruct $0.180/M $0.590/M tools
Meta Llama Meta Llama 3.1 8B Instruct Turbo $0.180/M $0.180/M tools
Qwen Qwen3.235b A22b Fp8 Tput $0.200/M $0.600/M 40,000
Qwen Qwen3.235b A22b Instruct 2507 Tput $0.200/M $6.00/M 262,000 tools
Together AI 4.1B 8B $0.200/M $0.200/M
Zai Org Glm 4.5 Air Fp8 $0.200/M $1.10/M 128,000 tools
Meta Llama Llama 4 Maverick 17B 128e Instruct Fp8 $0.270/M $0.850/M tools
Together AI 8.1B 21B $0.300/M $0.300/M 1,000
Zai Org Glm 4.7 $0.450/M $2.00/M 200,000 tools · reasoning
Moonshotai Kimi K2.5 $0.500/M $2.80/M 256,000 tools · vision · reasoning
DeepSeek AI DeepSeek R1.0528 Tput $0.550/M $2.19/M 128,000 tools
DeepSeek AI DeepSeek v3.1 $0.600/M $1.70/M 128,000 tools · reasoning
Mistralai Mixtral 8×7B Instruct v0.1 $0.600/M $0.600/M tools
Qwen Qwen3.5 397B A17b $0.600/M $3.60/M 262,144 tools
Zai Org Glm 4.6 $0.600/M $2.20/M 200,000 tools · reasoning
Qwen Qwen3.235b A22b Thinking 2507 $0.650/M $3.00/M 256,000 tools
Together AI 21.1B 41B $0.800/M $0.800/M
Meta Llama Llama 3.3 70B Instruct Turbo $0.880/M $0.880/M tools
Meta Llama Meta Llama 3.1 70B Instruct Turbo $0.880/M $0.880/M tools
Together AI 41.1B 80B $0.900/M $0.900/M
Moonshotai Kimi K2 Instruct $1.00/M $3.00/M tools
Moonshotai Kimi K2 Instruct 0905 $1.00/M $3.00/M 262,144 tools
DeepSeek AI DeepSeek v3 $1.25/M $1.25/M 65,536 tools
Together AI 81.1B 110B $1.80/M $1.80/M
Qwen Qwen3 Coder 480B A35b Instruct Fp8 $2.00/M $2.00/M 256,000 tools
DeepSeek AI DeepSeek R1 $3.00/M $7.00/M 128,000 tools
Meta Llama Meta Llama 3.1 405B Instruct Turbo $3.50/M $3.50/M tools
Meta Llama Llama 3.2 3B Instruct Turbo tools
Meta Llama Llama 3.3 70B Instruct Turbo Free tools
Mistralai Mistral 7B Instruct v0.1 tools
Mistralai Mistral Small 24B Instruct 2501 tools
Qwen Qwen2.5 72B Instruct Turbo tools
Qwen Qwen2.5 7B Instruct Turbo tools
Togethercomputer Codellama 34B Instruct tools

embedding 3

Model Input / 1M Output / 1M Context Caps
Baai Bge Base En v1.5 $0.008000/M 512
Together AI Embedding Up To 150M $0.008000/M
Together AI Embedding 151M To 350M $0.0160/M

FAQ

How many Together AI models are there?

42 Together AI models are listed across 2 modalities on this page. 35 have public per-token pricing.

How is Together AI pricing verified?

Pricing is aggregated from BerriAI/litellm, models.dev, and OpenRouter and refreshed weekly. Each row shows a per-model "verified" date. If a price is wrong, click the row to open the model page and use the inline "suggest edit" link — submissions go into a public review queue.

Which Together AI model is cheapest?

Input pricing on Together AI starts at $0.008000 per 1M tokens. Sort the table by price (or use the in-page filter at the top) to find the cheapest model that matches your capability requirements.

Can I route to Together AI via an OpenAI-compatible API?

Yes — point your OpenAI client at Future AGI's Agent Command Center, configure a Together AI target, and call Together AI models with the standard /v1/chat/completions surface. The same gateway can route to other providers as fallback. Free for the first 100K requests/month.

Route any Together AI model via Agent Command Center →
OpenAI-compatible endpoint. Caching, fallback, guardrails, observability. Free for 100K requests/month.