GPT 5.2 vs GPT 5.2 (2025-12-11)

GPT 5.2 vs GPT 5.2 (2025-12-11): GPT 5.2 is cheaper by 0%. GPT 5.2 from OpenAI (272,000-token context) vs. GPT 5.2 (2025-12-11) from OpenAI (272,000-token context). Use Agent Command Center to A/B both in shadow mode and pick the winner per workload.

Side-by-side cost

Live workload comparison

Same workload run through both models. The cheaper one is highlighted.

3,000
0272,000
400
0128,000
5,000
01,000,000
OpenAI
$1,651/mo
Input $1.75/M · Output $14.00/M
OpenAI
$1,651/mo
Input $1.75/M · Output $14.00/M
At this workload, GPT 5.2 (2025-12-11) is 0% cheaper than GPT 5.2 — a savings of $0.000000/month ($0.000000/year).
Production recipe — Agent Command Center
strategy: cost-optimized
primary:
  model: gpt-5-2-2025-12-11
  provider: openai
fallback:
  model: gpt-5-2
  provider: openai
shadow: { sample_rate: 0.05 }   # mirror 5% of traffic to compare quality live
GPT 5.2 GPT 5.2 (2025-12-11)
Input price $1.75/M $1.75/M
Output price $14.00/M $14.00/M
Context window 272,000 272,000
Max output 128,000 128,000
Function calling
Vision
Audio input
Reasoning
Prompt caching
Structured output
Pricing verified May 16, 2026 May 16, 2026
Cheaper option
Larger context
272,000 tokens
More capabilities
5 of 6 capability flags advertised

Benchmark comparison

Side-by-side public benchmark scores. Greener bar = winner.

Chatbot Arena ELOgeneral
GPT 5.2
1,477
GPT 5.2 (2025-12-11)
1,477
AIME 2025math
GPT 5.2
100.0%
GPT 5.2 (2025-12-11)
GPQA Diamondreasoning
GPT 5.2
92.4%
GPT 5.2 (2025-12-11)
MMLUgeneral
GPT 5.2
89.6%
GPT 5.2 (2025-12-11)
ARC-AGIreasoning
GPT 5.2
86.2%
GPT 5.2 (2025-12-11)
SWE-bench Verifiedagent
GPT 5.2
80.0%
GPT 5.2 (2025-12-11)
MMMU-Promultimodal
GPT 5.2
79.5%
GPT 5.2 (2025-12-11)
SWE-benchagent
GPT 5.2
55.6%
GPT 5.2 (2025-12-11)
ARC-AGI-2reasoning
GPT 5.2
52.9%
GPT 5.2 (2025-12-11)
FrontierMathmath
GPT 5.2
40.3%
GPT 5.2 (2025-12-11)
Humanity's Last Examreasoning
GPT 5.2
34.5%
GPT 5.2 (2025-12-11)