fal.ai models & pricing

fal.ai hosts 12 models (12 with public pricing) covering 1 modalities. Real-time inference for FLUX, Stable Diffusion, Whisper, and audio models. Cheapest input starts at free tier; the most premium goes up to enterprise tier. Use Future AGI's Agent Command Center to route any fal.ai model with cost-optimized fallback and unified observability.

Homepage ↗ Docs ↗

image generation 12

FAQ

How many fal.ai models are there?

12 fal.ai models are listed across 1 modality on this page. 12 have public per-token pricing.

How is fal.ai pricing verified?

Pricing is aggregated from BerriAI/litellm, models.dev, and OpenRouter and refreshed weekly. Each row shows a per-model "verified" date. If a price is wrong, click the row to open the model page and use the inline "suggest edit" link — submissions go into a public review queue.

Which fal.ai model is cheapest?

Several fal.ai models are free or have public-pricing pending. Browse the rows above and submit a source if you spot one we're missing.

Can I route to fal.ai via an OpenAI-compatible API?

Yes — point your OpenAI client at Future AGI's Agent Command Center, configure a fal.ai target, and call fal.ai models with the standard /v1/chat/completions surface. The same gateway can route to other providers as fallback. Free for the first 100K requests/month.

Route any fal.ai model via Agent Command Center →
OpenAI-compatible endpoint. Caching, fallback, guardrails, observability. Free for 100K requests/month.