Model registry

See all available model APIs provided by fal.ai

Show
Stable Diffusion with LoRAs

Run Stable Diffusion with customizable LoRA weights.

text-to-image
inference
stylized
Stable Diffusion XL

Run SDXL at the speed of light

text-to-image
inference
Whisper

Whisper is a model for speech transcription and translation.

speech-to-text
inference
speech
Latent Consistency (SDXL & SDv1.5)

Produce high-quality images with minimal inference steps.

text-to-image
inference
Optimized Latent Consistency (SDv1.5)

Produce high-quality images with minimal inference steps. Optimized for 512x512 input image size.

image-to-image
inference
Comfy Workflow Executor

Execute Comfy workflows in fal.

json-to-image
inference
Fooocus

Default parameters with automated optimizations and quality improvements.

text-to-image
inference
stylized
Segment Anything Model

SAM.

image-to-image
inference
masks
AnimateDiff

Animate Your Texts!

text-to-video
inference
stylized
Illusion Diffusion

Create illusions conditioned on image.

text-to-image
inference
stylized
Midas Depth Estimation

Create depth maps using Midas depth estimation.

image-to-image
inference
utility
AnimateDiff Video to Video

Stylize your videos

video-to-video
inference
stylized
Remove Background

Remove the background from an image.

image-to-image
inference
utility
Upscale Images

Upscale images by a given factor.

image-to-image
inference
utility
ControlNet SDXL

Generate Images with ControlNet.

image-to-image
inference
Inpainting sdxl and sd

Inpaint images with SD and SDXL

image-to-image
inference
Animatediff LCM

Animate Your Texts with Latent Consistency Models!

text-to-image
inference
stylized
Stable Video Diffusion

Generate short video clips from your images.

image-to-image
inference
stylized