LongCat-Video: Image to Video
Why Use LongCat-Video for Image to Video?
Identity & Style Preservation
Minutes‑Long Continuation
Efficient 720p @ 30fps
Unified 13.6B Architecture
Multi‑Reward RLHF (GRPO)
Creator‑Friendly Controls
LongCat‑Video Image to Video Pricing (Credits)
| Name & Role | Credits |
|---|---|
2s — 480p Quick SD preview | 8 |
2s — 720p Quick HD preview | 12 |
3s — 480p Short concept, SD | 12 |
3s — 720p Short concept, HD | 18 |
4s — 480p Preview, SD | 15 |
4s — 720p Preview, HD | 24 |
5s — 480p Social clip, SD | 19 |
5s — 720p Social clip, HD | 30 |
6s — 480p Extended shot, SD | 23 |
6s — 720p Extended shot, HD | 36 |
7s — 480p Longer take, SD | 27 |
7s — 720p Longer take, HD | 42 |
8s — 480p Long take, SD | 30 |
8s — 720p Long take, HD | 48 |
Frequently Asked Questions
What is Image to Video with LongCat‑Video?
Provide one image as the reference frame; the model animates it into a smooth, coherent clip while preserving subject identity, composition cues, and color consistency.
How does it stay consistent over time?
LongCat‑Video is pretrained for continuation and refined with multi‑reward RLHF (GRPO), improving temporal coherence, motion smoothness, and palette stability across longer shots.
How fast can I get results?
A coarse‑to‑fine schedule with Block Sparse Attention enables 720p 30fps generation in minutes on modern accelerators.
How are credits calculated?
Credits match the requested video seconds directly. Choose 480p (15fps) or 720p (30fps) for 2–8 second clips per the pricing table.
Prompt tips for Image to Video?
Specify subject action and pacing (e.g., "gentle head turn, slow camera push"), environment and lighting, and avoid conflicting instructions. For longer clips, outline beats succinctly.







