Two multimodal systems, every generative job

One model handles generate, edit, image-to-image, and 5s or 10s video — not a menu of narrow endpoints. Priced at a fraction of fal.ai and the usual providers.

Odyssey-image$0.02 / generation

One image model for text-to-image, image-to-image, in-context edits, inpainting and multi-reference conditioning. No switching models, no re-learning the API.

  • Text-to-image and image-to-image in the same call
  • In-context edits, inpainting and outpainting with masks
  • Multi-reference conditioning for style and subject control
  • $0.02 per generation — typically 5–10× cheaper than fal.ai
Production ready · region-pinned · SOC 2

Inference that scales with your traffic

Purpose-built for generative image and video workloads — from the first prototype to millions of requests a day

$0.02 / IMAGE
$0.10 / VIDEO
5-10× CHEAPER
CUSTOM INFERENCE
$0.02 / IMAGE
$0.10 / VIDEO
5-10× CHEAPER
CUSTOM INFERENCE
$0.02 / IMAGE
$0.10 / VIDEO
5-10× CHEAPER
CUSTOM INFERENCE

Two multimodal systems. A tenth of the price.

Odyssey-image and Odyssey-1-video cover generation, editing, image-to-image and video in one API — at 5–10× lower cost than fal.ai and comparable providers

Odyssey-image

One multimodal image system: text-to-image, image-to-image, inpainting and in-context edits — $0.02 per generation

Odyssey-1-video

One multimodal video system: text-to-video and image-to-video, 5 or 10 second clips — $0.10 per generation

5–10× cheaper than fal.ai

Same-class quality at a fraction of the price — metered per generation with no minimums, no prepay and no idle fees

Unified endpoints

Switch between generate, edit and animate on the same model and the same auth — no re-integration, no vendor sprawl

Custom Inference

Ship your own checkpoints, LoRAs and pipelines on our autoscaling GPU fleet with private, region-pinned endpoints

Production API

Webhook callbacks, async queues, retries and per-request observability — the boring things that matter at scale

One API for every modality — one bill

Shipping in production, not demos

Teams ship image and video workloads on the VisionQ API — from marketing automation to creative SaaS

Marketing & Ad Creative

Spin up on-brand hero images and short video variants at a scale manual production can't match

Key Benefits

  • On-brand variants at scale
  • Reference-image conditioning
  • Custom LoRA per brand
  • Batch queues with webhooks
Perfect for marketing & ad creative

E-commerce Visuals

Generate product shots, lifestyle scenes and animated previews from a single reference photo

Key Benefits

  • Background replacement
  • Lifestyle scene generation
  • Animated product previews
  • Consistent product identity
Perfect for e-commerce visuals

Creative Tools & SaaS

Embed text-to-image and video generation inside your own editor without running your own GPUs

Key Benefits

  • Embeddable REST endpoints
  • Latency-budget friendly
  • Stream and async modes
  • Per-tenant rate limits
Perfect for creative tools & saas

Social & Short-form Video

Turn scripts or stills into scroll-stopping clips for TikTok, Reels and Shorts in seconds

Key Benefits

  • Text-to-video in seconds
  • Loopable outputs
  • 9:16 and 16:9 natively
  • Motion and camera prompts
Perfect for social & short-form video

Games & Interactive Media

Generate concept art, textures and cinematic cutscenes directly from your engine or pipeline

Key Benefits

  • Concept art and textures
  • Deterministic seeds
  • Image-to-image restyling
  • Pipeline-friendly CLI + SDK
Perfect for games & interactive media

Media & Post-production

Restyle footage, extend shots and upscale archives with deterministic, batch-friendly workflows

Key Benefits

  • Upscaling and restoration
  • Shot extension
  • Frame-sequence outputs
  • SOC 2 and region pinning
Perfect for media & post-production
Ready to move your generation pipeline to the cloud?

Trusted by ML teams in production

How engineering and creative teams ship generative image and video workloads on VisionQ

"We replaced three separate image and video vendors with a single VisionQ endpoint. Cold-start latency is measured in milliseconds and our render pipeline finally stopped being the bottleneck."

Sarah Chen

"Uploading our fine-tuned checkpoint and getting a private inference endpoint took one afternoon. The autoscaler just works — traffic spiked 40x during a campaign and p95 latency didn't blink."

Marcus Rodriguez

"The image-to-video API is the first one we've used that keeps subject consistency across frames. Our creative team now iterates on motion prompts instead of waiting overnight for renders."

Emily Watson

"Determinism matters when legal signs off on every asset. Seed control, queue retries and per-request logs mean we can reproduce any output six months later. That's rare in this space."

David Kim

Join the teams shipping generative media on VisionQ
5–10× cheaper than fal.ai on comparable workloads

Flat pricing per generation

No tokens, no megapixel math, no minimums. One price per image, one price per video.

Flat rate
Photo Gen

Odyssey-image

One multimodal image model: text-to-image, image-to-image, inpainting and in-context edits — in a single endpoint.

$0.02per generation
Flat price, any resolution up to 2K
Edits and image-to-image cost the same
  • Text-to-image + image-to-image
    One model, one price
  • In-context edits & inpainting
    Masks and prompts in one call
  • Multi-reference conditioning
    Style, subject and pose guidance
  • 5–10× cheaper than fal.ai
    Across comparable image workloads
Most Popular
Flat rate
Video Gen

Odyssey-1-video

One multimodal video model: text-to-video and image-to-video, 5 or 10 second clips — same price, one endpoint.

$0.10per generation
5s and 10s clips, 720p / 1080p
Text-to-video and image-to-video in one call
  • Text-to-video + image-to-video
    One model, one price
  • 5s and 10s clips
    24fps, 720p and 1080p
  • Motion & camera controls
    End-frame conditioning supported
  • 5–10× cheaper than fal.ai
    Benchmarked on equivalent clips
From
per GPU-hour, per second

Custom Inference

Deploy your own checkpoints, LoRAs or custom pipelines on the same autoscaling GPU fleet we run Odyssey on.

$1.80per GPU-hour
A100 40GB: $1.80 / hr, billed per second
H100 80GB: $3.40 / hr, billed per second
  • Bring your own model
    SDXL, Flux, SVD, WAN, fine-tunes
  • Autoscaling to zero
    No cold-start tax, no idle fee
  • Private endpoints
    VPC peering on enterprise plans
  • Region pinning
    EU, US, APAC data residency

Run the numbers yourself

New accounts start with $10 in credits — enough for 500 image generations or around 100 videos on the Odyssey models. Bring the same prompts you're running on fal.ai and compare side by side.

Frequently Asked Questions

Got Questions?

Everything developers and ML teams ask before they move production generation workloads to VisionQ

Still have questions?

Engineering handles support directly — benchmarks, custom model deployments and integration questions welcome.