Claude Sonnet 4.6 vs GPT 5.2 Chat latest
Claude Sonnet 4.6 vs GPT 5.2 Chat latest: GPT 5.2 Chat latest is cheaper by 13%. Claude Sonnet 4.6 from Anthropic (1,000,000-token context) vs. GPT 5.2 Chat latest from OpenAI (128,000-token context). Use Agent Command Center to A/B both in shadow mode and pick the winner per workload.
Side-by-side cost
Live workload comparison
Same workload run through both models. The cheaper one is highlighted.
3,000
01,000,000
400
064,000
5,000
01,000,000
At this workload, GPT 5.2 Chat latest is 28% cheaper than Claude Sonnet 4.6 — a savings of $632/month ($7,579/year).
Production recipe — Agent Command Center
strategy: cost-optimized
primary:
model: gpt-5-2-chat-latest
provider: openai
fallback:
model: claude-sonnet-4-6
provider: anthropic
shadow: { sample_rate: 0.05 } # mirror 5% of traffic to compare quality live| Claude Sonnet 4.6 | GPT 5.2 Chat latest | |
|---|---|---|
| Input price | $3.00/M | $1.75/M |
| Output price | $15.00/M | $14.00/M |
| Context window | 1,000,000 | 128,000 |
| Max output | 64,000 | 16,384 |
| Function calling | ✓ | ✓ |
| Vision | ✓ | ✓ |
| Audio input | — | — |
| Reasoning | ✓ | ✓ |
| Prompt caching | ✓ | ✓ |
| Structured output | ✓ | ✓ |
| Pricing verified | May 16, 2026 | May 16, 2026 |
Benchmark comparison
Side-by-side public benchmark scores. Greener bar = winner.
Chatbot Arena ELOgeneral
Claude Sonnet 4.6
1,466
GPT 5.2 Chat latest
1,477
GPQA Diamondreasoning
Claude Sonnet 4.6
89.9%
GPT 5.2 Chat latest
92.4%
MMLUgeneral
Claude Sonnet 4.6
89.3%
GPT 5.2 Chat latest
89.6%
SWE-bench Verifiedagent
Claude Sonnet 4.6
79.6%
GPT 5.2 Chat latest
80.0%
MMMU-Promultimodal⚠ different settings
Claude Sonnet 4.6
74.5%
GPT 5.2 Chat latest
79.5%
ARC-AGI-2reasoning⚠ different settings
Claude Sonnet 4.6
58.3%
GPT 5.2 Chat latest
52.9%
Humanity's Last Examreasoning⚠ different settings
Claude Sonnet 4.6
33.2%
GPT 5.2 Chat latest
34.5%