icon AI Text to Video Generator

LongCat-Video AI Video Generator

Create coherent, minutes-long videos from simple prompts. LongCat-Video unifies Text-to-Video, Image-to-Video, and Video-Continuation with efficient 720p/30fps generation, stable color consistency, and modern RLHF tuning for reliable, cinematic outputs.
icon Model Highlights

Why Choose LongCat-Video?

A unified, efficient, and reliable long video generator built for modern creative workflows — from social clips to minutes-long narratives.

Unified Multi‑Task Pipeline

One 13.6B‑parameter model for Text‑to‑Video, Image‑to‑Video, and Video‑Continuation. Keep style and motion consistent across clips while switching tasks without model hopping.

Minutes‑Long Continuation

Pretraining on continuation enables minutes‑long outputs with stable composition, reduced color drifting, and smooth scene evolution — ideal for storytelling and product demos.

Efficient 720p @ 30fps

Coarse‑to‑fine generation across temporal and spatial axes produces 720p, 30fps videos in minutes. Block Sparse Attention accelerates high‑resolution inference without sacrificing detail.

RLHF with Multi‑Reward GRPO

Multi‑reward RLHF (GRPO) aligns outputs to human preferences across motion quality, temporal coherence, and visual fidelity — delivering competitive results vs. top open‑source and commercial systems.

Consistent Colors & Motion

LongCat‑Video maintains stable palettes and temporal consistency across long sequences, minimizing flicker and drift for professional‑grade edits and post pipelines.

Creator‑Friendly Controls

Natural‑language prompts guide subjects, environments, and pacing. Choose aspect ratios for landscape, portrait, or square delivery to match your platform strategy.
icon Transparent Pricing

LongCat‑Video Pricing (Credits)

Pricing maps 1:1 to requested video seconds. Select 480p (15fps) for budget runs or 720p (30fps) for sharper delivery.
Name & RoleCredits
2s — 480p
Quick preview, SD
8
2s — 720p
Quick preview, HD
12
3s — 480p
Short concept, SD
12
3s — 720p
Short concept, HD
18
4s — 480p
Preview, SD
15
4s — 720p
Preview, HD
24
5s — 480p
Social clip, SD
19
5s — 720p
Social clip, HD
30
6s — 480p
Extended shot, SD
23
6s — 720p
Extended shot, HD
36
7s — 480p
Longer take, SD
27
7s — 720p
Longer take, HD
42
8s — 480p
Long take, SD
30
8s — 720p
Long take, HD
48
icon FAQ

Frequently Asked Questions

Explore more articles related to this topic

What is LongCat‑Video?

LongCat‑Video is a 13.6B‑parameter foundation model for Text‑to‑Video, Image‑to‑Video, and Video‑Continuation. It’s optimized for efficient, high‑quality long video generation with stable colors and temporal coherence.

How fast is inference?

Thanks to a coarse‑to‑fine schedule across time and space plus Block Sparse Attention, LongCat‑Video can produce 720p, 30fps videos within minutes on modern accelerators.

How does pricing work?

Credits equal requested video seconds. Choose 480p (15fps) or 720p (30fps) and select 2–8 seconds — your total credits match the duration.

What makes outputs feel consistent?

The model is natively pretrained for continuation and refined with multi‑reward RLHF (GRPO), improving motion smoothness, color stability, and narrative coherence over longer shots.

Any tips for prompts?

Keep subjects and actions specific, add environment cues and pacing words (e.g., "slow camera push", "daylight soft shadows"). For longer clips, outline beats in one sentence each.

icon Related Tools

Explore More AI Video Models

Discover more powerful text-to-video generation tools for diverse creative styles and production needs.