[ICCV 2025] VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE
☆392Jan 19, 2025Updated last year
Alternatives and similar repositories for VideoVAEPlus
Users that are interested in VideoVAEPlus are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] Autoregressive Video Generation without Vector Quantization☆627Oct 29, 2025Updated 4 months ago
- a family of versatile and state-of-the-art video tokenizers.☆435Sep 1, 2025Updated 5 months ago
- [CVPR 2025🔥] Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model☆194May 11, 2025Updated 9 months ago
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models☆286Dec 4, 2024Updated last year
- SEED-Voken: A Series of Powerful Visual Tokenizers☆996Nov 25, 2025Updated 3 months ago
- A suite of image and video neural tokenizers☆1,707Feb 11, 2025Updated last year
- Let's finetune video generation models!☆543Sep 15, 2025Updated 5 months ago
- Scalable and memory-optimized training of diffusion models☆1,338Jun 4, 2025Updated 8 months ago
- A unified inference and post-training framework for accelerated video generation.☆3,093Updated this week
- [AAAI-2026]FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation☆457Mar 5, 2025Updated 11 months ago
- [CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis☆1,546Nov 10, 2025Updated 3 months ago
- [CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models☆1,398Dec 16, 2025Updated 2 months ago
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)☆86Feb 27, 2025Updated last year
- [NeurIPS 2025] Improving Video Generation with Human Feedback☆428Sep 24, 2025Updated 5 months ago
- ☆213Feb 11, 2025Updated last year
- [SIGGRAPH 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control☆807Jun 9, 2025Updated 8 months ago
- Official Implementation of VideoDPO☆160Jun 1, 2025Updated 8 months ago
- [ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video☆1,746Nov 28, 2025Updated 3 months ago
- [ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"☆198Jan 7, 2026Updated last month
- [ICLR'25] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints☆679May 23, 2025Updated 9 months ago
- [ICCV'25]DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion☆1,327Oct 17, 2025Updated 4 months ago
- [CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers☆674Oct 25, 2024Updated last year
- Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation☆17Apr 3, 2024Updated last year
- Code for: "Long-Context Autoregressive Video Modeling with Next-Frame Prediction"☆301Apr 23, 2025Updated 10 months ago
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆307Mar 12, 2025Updated 11 months ago
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project☆183Mar 20, 2025Updated 11 months ago
- (CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models☆1,220Aug 7, 2025Updated 6 months ago
- [ICML'25] EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.☆174Jun 26, 2025Updated 8 months ago
- [NeurIPS 2025] Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆215Sep 27, 2025Updated 5 months ago
- [CVPR'25 Highlight] You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale☆708Apr 16, 2025Updated 10 months ago
- [ICCV 2025] Video-T1: Test-Time Scaling for Video Generation☆306Jun 29, 2025Updated 8 months ago
- [CVPR 2025] Official code of "DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Long…☆321Mar 30, 2025Updated 11 months ago
- Next-Token Prediction is All You Need☆2,350Jan 12, 2026Updated last month
- This is the official implementation of SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation.☆115Nov 26, 2024Updated last year
- MAGI-1: Autoregressive Video Generation at Scale☆3,643Jun 17, 2025Updated 8 months ago
- ☆414Mar 10, 2025Updated 11 months ago
- This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generat…☆244Oct 12, 2025Updated 4 months ago
- [CVPR'25 Highlight] Official implementation for paper - LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis☆158Apr 15, 2025Updated 10 months ago
- [AAAI 2025] Follow-Your-Canvas: This repo is the official implementation of "Follow-Your-Canvas: Higher-Resolution Video Outpainting with…☆161Aug 26, 2025Updated 6 months ago