fal.ai models & pricing

fal.ai hosts 12 models (12 with public pricing) covering 1 modalities. Real-time inference for FLUX, Stable Diffusion, Whisper, and audio models. Cheapest input starts at free tier; the most premium goes up to enterprise tier. Use Future AGI's Agent Command Center to route any fal.ai model with cost-optimized fallback and unified observability.

Homepage ↗ Docs ↗

image generation 12

Model Input / 1M Output / 1M Blended Context Caps
Bria Text To Image 3.2
Fal AI Bytedance Dreamina V3.1 Text To Image
Fal AI Bytedance Seedream V3 Text To Image
Fal AI Flux Pro v1.1
Fal AI Flux Pro V1.1 Ultra
Fal AI Flux Schnell
Fal AI Ideogram v3
Fal AI Imagen4 preview
Fal AI Imagen4 preview Fast
Fal AI Imagen4 preview Ultra
Fal AI Recraft V3 Text To Image
Fal AI Stable Diffusion V35 Medium

FAQ

How many fal.ai models are there?

12 fal.ai models are listed across 1 modality on this page. 12 have public per-token pricing.

How is fal.ai pricing verified?

Pricing is aggregated from BerriAI/litellm, models.dev, and OpenRouter and refreshed weekly. Each row shows a per-model "verified" date. If a price is wrong, click the row to open the model page and use the inline "suggest edit" link — submissions go into a public review queue.

Which fal.ai model is cheapest?

Several fal.ai models are free or have public-pricing pending. Browse the rows above and submit a source if you spot one we're missing.

Can I route to fal.ai via an OpenAI-compatible API?

Yes — point your OpenAI client at Future AGI's Agent Command Center, configure a fal.ai target, and call fal.ai models with the standard /v1/chat/completions surface. The same gateway can route to other providers as fallback. Free for the first 100K requests/month.

Route any fal.ai model via Agent Command Center →
OpenAI-compatible endpoint. Caching, fallback, guardrails, observability. Free for 100K requests/month.