Choosing a Model
Framo runs many AI models. Every one is declared in the model registry
(lib/fal-ai/models), and each carries guide metadata — a speed badge
(fast/medium/slow), a quality badge (standard/high/ultra), and a
bestFor list. That registry is the source of truth for what exists and what
each model does. This doc is the routing layer: given a task, which family
to reach for. Deeper, model-specific prompting craft lives in the per-family
Model docs (Wan is the first — see Wan).
The axes that matter
Pick along four axes, in order:
- Modality — image, video, 3D, upscale/utility, or training. This narrows the field first.
- Input mode — text-to-image, image-to-image, multi-reference, or
start+end frame. A model only accepts the modes in its
compatibleModes. - Speed vs quality —
fast/highfor iteration and drafts;ultra(oftenmedium/slow) for finals, print, and hero shots. - Cost — video and 3D scale with duration / resolution; the Generate button always shows the exact credit cost before you commit.
By task
Generate an image from text
- Nano Banana 2 (
fast/high) — Google's fast text-to-image with resolution control. A strong default for everyday generation. See Nano Banana. - FLUX 2 (
fast/high) and FLUX Schnell (fast/high) — fast FLUX options for quick drafts and iteration. See FLUX. - FLUX Ultra (
medium/ultra) and FLUX Pro v1.1 (medium/ultra) — reach here for finals: 4× resolution, Raw mode for natural texture, large-format print. See FLUX. - GPT Image 2 (
medium/high) — OpenAI's typography specialist; reach here when legible in-image text and strict prompt adherence matter. See GPT Image.
Edit or composite an existing image (no mask)
- Nano Banana Pro (
medium/ultra) — semantic multi-image editing, up to 14 input images, 1K/2K/4K. Best for character consistency and high-res compositing. See Nano Banana. - Seedream v4 Edit (
medium/high) — natural-language editing, up to 10 input images; background replacement, style and material changes. See Seedream. - Qwen Image Edit (
medium/high) — prompt-driven edits that work without a mask. See Qwen. - GPT Image 2 Edit (
medium/high) — fine-grained edits with excellent text rendering, multi-image reference, and native mask support. See GPT Image.
Fill or inpaint a masked region
- FLUX Pro v1 Fill (
medium/ultra) — inpainting for masked areas. See FLUX. - Z-Image Turbo Inpaint (
fast/high) — fast masked fill on the 6B turbo core, optionally LoRA-styled. See Z-Image. - GPT Image 2 Edit (
medium/high) — native mask inpainting with strong text rendering. See GPT Image. - The mask-injection variants (Nano Banana / Nano Banana Pro / Nano Banana 2) apply a semantic edit confined to a drawn selection. See Nano Banana and Image tools.
Animate a still (image-to-video)
- Wan 2.6 (
medium/high) — fast previews and multi-shot narrative from a single image (timestamped shot syntax). See Wan. - Veo 3.1 Lite (
medium/high) — Google's high-quality image-to-video with native audio and lip-sync, single start frame. See Veo. - Kling v3 Standard (
medium/high) — balanced; optional end frame and native audio. See Kling. - Seedance 2 (
medium/ultra) — flagship cinematic motion; optional end frame and free synced audio. See Seedance.
Transition or compose video
- Smooth start→end: Kling v3 Standard or Seedance 2 (optional end
frame); Veo 3.1 Lite FLF (
medium/high) or Wan 2.1 FLF2V (medium/standard) for interpolation that requires both frames. - From many references: Seedance 2 Reference (
medium/ultra) — up to 9 reference images and 3 reference videos.
See Generating Videos and the image-to-video playbook.
Image to 3D model
- Trellis 2 (
medium/high) — high-quality 3D from a single image. See Trellis. - Hyper3D Rodin v2 / v2.5 (
slow/ultra) — production-ready models from reference images. See Hyper3D Rodin. - Hunyuan 3D Pro (
slow/ultra) — top-end quality. See Hunyuan 3D.
See the 3D scene editor.
Upscale and utility
- Topaz Upscale (
fast/ultra) — premium upscaling. - Bria Background Remove (
fast/high) — commercial-safe background removal. - Object Removal (
fast/high) — erase an object cleanly. - Qwen Multiple Angles (
medium/high) — new camera angles of a subject. See Qwen.
Train a custom model
- FLUX LoRA Fast Training (
slow/high) — fast LoRA training for FLUX. See FLUX. - Z-Image Trainer (
medium/high) — train a Z-Image Turbo LoRA. See Z-Image.
See LoRA training and the train-a-LoRA playbook.
Keeping this honest
The model list and the speed/quality/bestFor facts above are mirrored from
the registry — re-verify against lib/fal-ai/models when you bump the
timestamp. The registry is canonical for parameters, resolutions, durations,
and pricing; this doc never restates those numbers. Per-family prompting craft
(how to get the most out of each model) lives in Models.