AILab-CVC/CV-VAE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AILab-CVC/CV-VAE)

AILab-CVC / CV-VAE

[NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models

☆285

Alternatives and similar repositories for CV-VAE

Users that are interested in CV-VAE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

bytedance / 1d-tokenizer
View on GitHub
This repo contains the code for 1D tokenizer and generator
☆1,167Mar 20, 2025Updated last year
Alpha-VLLM / Lumina-T2X
View on GitHub
Lumina-T2X is a unified framework for Text to Any Modality Generation
☆2,247Feb 16, 2025Updated last year
FoundationVision / OmniTokenizer
View on GitHub
[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.
☆325Jul 9, 2024Updated 2 years ago
ssyang2020 / ZeroSmooth
View on GitHub
☆66Jun 4, 2024Updated 2 years ago
snap-research / Panda-70M
View on GitHub
[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
☆700Oct 25, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
baaivision / NOVA
View on GitHub
[ICLR 2025] Autoregressive Video Generation without Vector Quantization
☆656Oct 29, 2025Updated 8 months ago
Vchitect / Latte
View on GitHub
[TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.
☆1,946Oct 30, 2025Updated 8 months ago
VideoVerses / VideoVAEPlus
View on GitHub
[ICCV 2025] VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE
☆409Jan 19, 2025Updated last year
Vchitect / VEnhancer
View on GitHub
Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation
☆576Sep 16, 2024Updated last year
NJU-PCALab / OpenVid-1M
View on GitHub
[ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation
☆452May 30, 2025Updated last year
YingqingHe / ScaleCrafter
View on GitHub
[ICLR 2024 Spotlight] Official implementation of ScaleCrafter for higher-resolution visual generation at inference time.
☆507Mar 7, 2024Updated 2 years ago
FoundationVision / LlamaGen
View on GitHub
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
☆1,960Aug 15, 2024Updated last year
jjihwan / FIFO-Diffusion_public
View on GitHub
Official implementation of FIFO-Diffusion: Generating Infinite Videos from Text without Training (NeurIPS 2024)
☆486Oct 18, 2024Updated last year
AILab-CVC / FreeNoise
View on GitHub
[ICLR 2024] Code for FreeNoise based on VideoCrafter
☆429Aug 25, 2025Updated 10 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
TianxingWu / FreeInit
View on GitHub
[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models
☆544Jan 18, 2024Updated 2 years ago
TencentARC / SEED-Voken
View on GitHub
SEED-Voken: A Series of Powerful Visual Tokenizers
☆1,018Nov 25, 2025Updated 8 months ago
NUS-HPC-AI-Lab / VideoSys
View on GitHub
VideoSys: An easy and efficient system for video generation
☆2,026Aug 27, 2025Updated 10 months ago
magic-research / piecewise-rectified-flow
View on GitHub
PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)
☆538Sep 8, 2025Updated 10 months ago
hehao13 / CameraCtrl
View on GitHub
☆657May 24, 2024Updated 2 years ago
MC-E / ReVideo
View on GitHub
NeurIPS 2024
☆395Sep 26, 2024Updated last year
maxin-cn / Cinemo
View on GitHub
[CVPR 2025] Consistent and Controllable Image Animation with Motion Diffusion Models
☆296May 17, 2025Updated last year
PixArt-alpha / PixArt-alpha
View on GitHub
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
☆3,300Oct 31, 2024Updated last year
aigc-apps / EasyAnimate
View on GitHub
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
☆2,268Mar 6, 2025Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
SilentView / GigaTok
View on GitHub
[ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"
☆204Jan 7, 2026Updated 6 months ago
mira-space / Mira
View on GitHub
☆361Oct 21, 2024Updated last year
NVIDIA / Cosmos-Tokenizer
View on GitHub
A suite of image and video neural tokenizers
☆1,731Feb 11, 2025Updated last year
microsoft / Reducio-VAE
View on GitHub
☆217Feb 11, 2025Updated last year
pixeli99 / SVD_Xtend
View on GitHub
Stable Video Diffusion Training Code and Extensions.
☆733Jul 25, 2024Updated 2 years ago
Vchitect / VBench
View on GitHub
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
☆1,705Mar 23, 2026Updated 4 months ago
causalfusion / causalfusion
View on GitHub
☆196Dec 17, 2024Updated last year
arthur-qiu / FreeTraj
View on GitHub
Code for FreeTraj, a tuning-free method for trajectory-controllable video generation
☆114Sep 19, 2025Updated 10 months ago
Ji4chenLi / t2v-turbo
View on GitHub
Code repository for T2V-Turbo and T2V-Turbo-v2
☆312Jan 31, 2025Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
TencentARC / MotionCtrl
View on GitHub
Official Code for MotionCtrl [SIGGRAPH 2024]
☆1,497Feb 19, 2025Updated last year
mihirp1998 / VADER
View on GitHub
Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…
☆315Mar 12, 2025Updated last year
ShoufaChen / PixelFlow
View on GitHub
Pixel-Space Generative Models
☆317May 11, 2025Updated last year
lmbxmu / CutDiffusion
View on GitHub
CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method
☆27Oct 9, 2025Updated 9 months ago
google-research / magvit
View on GitHub
Official JAX implementation of MAGVIT: Masked Generative Video Transformer
☆1,002Jan 17, 2024Updated 2 years ago
MyNiuuu / MOFA-Video
View on GitHub
[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.
☆765Dec 5, 2024Updated last year
hustvl / LightningDiT
View on GitHub
[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
☆1,510Dec 16, 2025Updated 7 months ago