Baidu Ernie 4.5 VL 28B A3b vs OpenAI GPT Oss 120B
Baidu Ernie 4.5 VL 28B A3b vs OpenAI GPT Oss 120B: Baidu Ernie 4.5 VL 28B A3b is cheaper by 7% on average. Baidu Ernie 4.5 VL 28B A3b from Novita AI (30,000-token context, reasoning, tool calls) vs. OpenAI GPT Oss 120B from Groq (131,072-token context, reasoning, tool calls). Use Agent Command Center to A/B both in shadow mode and pick the winner per workload.
Side-by-side cost
Live workload comparison
Same workload run through both models. The cheaper one is highlighted.
3,000
0131,072
400
032,766
5,000
01,000,000
At this workload, Baidu Ernie 4.5 VL 28B A3b is 7% cheaper than OpenAI GPT Oss 120B — a savings of $7.00/month ($84.01/year).
Production recipe — Agent Command Center
strategy: cost-optimized
primary:
model: baidu-ernie-4-5-vl-28b-a3b
provider: novita-ai
fallback:
model: openai-gpt-oss-120b
provider: groq
shadow: { sample_rate: 0.05 } # mirror 5% of traffic to compare quality live| Baidu Ernie 4.5 VL 28B A3b | OpenAI GPT Oss 120B | |
|---|---|---|
| Input price | $0.140/M | $0.150/M |
| Output price | $0.560/M | $0.600/M |
| Context window | 30,000 | 131,072 |
| Max output | 8,000 | 32,766 |
| Function calling | ✓ | ✓ |
| Vision | ✓ | — |
| Audio input | — | — |
| Reasoning | ✓ | ✓ |
| Prompt caching | — | — |
| Structured output | — | ✓ |
| Pricing verified | May 12, 2026 | May 12, 2026 |