Baidu Ernie 4.5 VL 28B A3b vs OpenAI GPT Oss 120B

Baidu Ernie 4.5 VL 28B A3b vs OpenAI GPT Oss 120B: Baidu Ernie 4.5 VL 28B A3b is cheaper by 7% on average. Baidu Ernie 4.5 VL 28B A3b from Novita AI (30,000-token context, reasoning, tool calls) vs. OpenAI GPT Oss 120B from Groq (131,072-token context, reasoning, tool calls). Use Agent Command Center to A/B both in shadow mode and pick the winner per workload.

Side-by-side cost

Live workload comparison

Same workload run through both models. The cheaper one is highlighted.

3,000
0131,072
400
032,766
5,000
01,000,000
Novita AI
$98.01/mo
Input $0.140/M · Output $0.560/M
Groq
$105/mo
Input $0.150/M · Output $0.600/M
At this workload, Baidu Ernie 4.5 VL 28B A3b is 7% cheaper than OpenAI GPT Oss 120B — a savings of $7.00/month ($84.01/year).
Production recipe — Agent Command Center
strategy: cost-optimized
primary:
  model: baidu-ernie-4-5-vl-28b-a3b
  provider: novita-ai
fallback:
  model: openai-gpt-oss-120b
  provider: groq
shadow: { sample_rate: 0.05 }   # mirror 5% of traffic to compare quality live
Baidu Ernie 4.5 VL 28B A3b OpenAI GPT Oss 120B
Input price $0.140/M $0.150/M
Output price $0.560/M $0.600/M
Context window 30,000 131,072
Max output 8,000 32,766
Function calling
Vision
Audio input
Reasoning
Prompt caching
Structured output
Pricing verified May 12, 2026 May 12, 2026
Cheaper option
~7% cheaper than OpenAI GPT Oss 120B
Larger context
131,072 tokens
More capabilities
3 of 6 capability flags advertised