[ICCV 2025] VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE
☆407Jan 19, 2025Updated last year
Alternatives and similar repositories for VideoVAEPlus
Users that are interested in VideoVAEPlus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2025] Autoregressive Video Generation without Vector Quantization☆652Oct 29, 2025Updated 8 months ago
- a family of versatile and state-of-the-art video tokenizers.☆451Sep 1, 2025Updated 9 months ago
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models☆285Dec 4, 2024Updated last year
- [CVPR 2025🔥] Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model☆203May 11, 2025Updated last year
- A suite of image and video neural tokenizers☆1,726Feb 11, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Let's finetune video generation models!☆550Sep 15, 2025Updated 9 months ago
- SEED-Voken: A Series of Powerful Visual Tokenizers☆1,012Nov 25, 2025Updated 7 months ago
- Scalable and memory-optimized training of diffusion models☆1,358May 26, 2026Updated last month
- ☆216Feb 11, 2025Updated last year
- A unified inference and post-training framework for accelerated video generation.☆3,768Updated this week
- [AAAI-2026]FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation☆460Mar 5, 2025Updated last year
- [CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis☆1,576Apr 16, 2026Updated 2 months ago
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)☆88Feb 27, 2025Updated last year
- [CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models☆1,498Dec 16, 2025Updated 6 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Official Implementation of VideoDPO☆170Jun 1, 2025Updated last year
- [SIGGRAPH 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control☆821Jun 9, 2025Updated last year
- [ICLR'25] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints☆691May 23, 2025Updated last year
- [NeurIPS 2025] Improving Video Generation with Human Feedback☆475Sep 24, 2025Updated 9 months ago
- [ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"☆204Jan 7, 2026Updated 5 months ago
- [ICML'25] EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.☆181Mar 18, 2026Updated 3 months ago
- (CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models☆1,376Aug 7, 2025Updated 10 months ago
- [NeurIPS 2025] Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆219Sep 27, 2025Updated 9 months ago
- [ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video☆1,827Nov 28, 2025Updated 7 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation☆17Apr 3, 2024Updated 2 years ago
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆313Mar 12, 2025Updated last year
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆167Jan 31, 2025Updated last year
- [CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers☆695Oct 25, 2024Updated last year
- [ICCV'25]DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion☆1,332Oct 17, 2025Updated 8 months ago
- Code for: "Long-Context Autoregressive Video Modeling with Next-Frame Prediction"☆308Apr 23, 2025Updated last year
- Next-Token Prediction is All You Need☆2,423Jan 12, 2026Updated 5 months ago
- ☆131Jun 24, 2025Updated last year
- [AAAI 2025] Follow-Your-Canvas: This repo is the official implementation of "Follow-Your-Canvas: Higher-Resolution Video Outpainting with…☆168Aug 26, 2025Updated 10 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generat…☆251Oct 12, 2025Updated 8 months ago
- This repo contains the code for 1D tokenizer and generator☆1,162Mar 20, 2025Updated last year
- MAGI-1: Autoregressive Video Generation at Scale☆3,713Jun 17, 2026Updated last week
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project☆186Mar 20, 2025Updated last year
- The best OSS video generation models, created by Genmo☆3,675Nov 14, 2025Updated 7 months ago
- [TPAMI 2025] ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis☆1,567Dec 13, 2025Updated 6 months ago
- [ICCV 2025] Video-T1: Test-Time Scaling for Video Generation☆315Mar 7, 2026Updated 3 months ago