NVIDIA Research unveils LongLive-2.0 low-precision system for long video generation

Details

NVIDIA Research introduced LongLive-2.0, an end-to-end NVFP4 training and inference system for long video generation.
The system targets the systems-level challenges of generating long video, emphasizing efficient low-precision (4-bit) deployment.
LongLive-2.0 is designed to avoid the typical mismatch between full-precision training and post-training quantization used for deployment.
It generates long 720p video sequences while preserving subject and background consistency across multi-shot scenes.
The model supports prompt switching at chunk boundaries, enabling multi-shot narratives with evolving text descriptions.
NVIDIA is releasing the full project resources, including research paper, code, pretrained models, and demos, via an official project page.
The work builds on NVIDIA’s broader push into video generative AI, aligning model design, numeric formats, and hardware for better efficiency.

Impact

By tightly coupling NVFP4 training and inference, LongLive-2.0 advances the efficiency of high-resolution video generation at scale, directly addressing a key bottleneck for deploying generative video models on NVIDIA GPUs. This raises the bar for rivals working on long-form video, such as OpenAI and Google, and could accelerate real-world adoption in media, advertising, and simulation where cost-effective, consistent multi-shot video is essential.

NVIDIA Research unveils LongLive-2.0 low-precision system for long video generation

Details

Impact

Social

CONTENT

INFO