fal.ai models & pricing

fal.ai hosts 12 models (12 with public pricing) covering 1 modalities. Real-time inference for FLUX, Stable Diffusion, Whisper, and audio models. Cheapest input starts at free tier; the most premium goes up to enterprise tier. Use Future AGI's Agent Command Center to route any fal.ai model with cost-optimized fallback and unified observability.

Homepage ↗ Docs ↗

image generation 12

Model ↕	Input / 1M ↕	Output / 1M ↕	Blended ↕	Context ↕
Bria Text To Image 3.2	—	—	—	—
Fal AI Bytedance Dreamina V3.1 Text To Image	—	—	—	—
Fal AI Bytedance Seedream V3 Text To Image	—	—	—	—
Fal AI Flux Pro v1.1	—	—	—	—
Fal AI Flux Pro V1.1 Ultra	—	—	—	—
Fal AI Flux Schnell	—	—	—	—
Fal AI Ideogram v3	—	—	—	—
Fal AI Imagen4 preview	—	—	—	—
Fal AI Imagen4 preview Fast	—	—	—	—
Fal AI Imagen4 preview Ultra	—	—	—	—
Fal AI Recraft V3 Text To Image	—	—	—	—
Fal AI Stable Diffusion V35 Medium	—	—	—	—

FAQ

How many fal.ai models are there?

12 fal.ai models are listed across 1 modality on this page. 12 have public per-token pricing.

How is fal.ai pricing verified?

Pricing is aggregated from BerriAI/litellm, models.dev, and OpenRouter and refreshed weekly. Each row shows a per-model "verified" date. If a price is wrong, click the row to open the model page and use the inline "suggest edit" link — submissions go into a public review queue.

Which fal.ai model is cheapest?

Several fal.ai models are free or have public-pricing pending. Browse the rows above and submit a source if you spot one we're missing.

Can I route to fal.ai via an OpenAI-compatible API?

Yes — point your OpenAI client at Future AGI's Agent Command Center, configure a fal.ai target, and call fal.ai models with the standard /v1/chat/completions surface. The same gateway can route to other providers as fallback. Free for the first 100K requests/month.

Route any fal.ai model via Agent Command Center →

OpenAI-compatible endpoint. Caching, fallback, guardrails, observability. Free for 100K requests/month.