☆28Mar 4, 2025Updated last year
Alternatives and similar repositories for iv-vae
Users that are interested in iv-vae are comparing it to the libraries listed below
Sorting:
- [ICLR2026] AliTok: Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model☆53Oct 12, 2025Updated 4 months ago
- [NeurIPS 2025] ViewPoint: Panoramic Video Generation with Pretrained Diffusion Models☆31Jul 1, 2025Updated 8 months ago
- [CVPR 2025🔥] Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model☆198May 11, 2025Updated 9 months ago
- [AAAI 2026] Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devices☆95Nov 30, 2025Updated 3 months ago
- This repo is text to speech with learnable audio encoder without alignment with transcript reference☆53Sep 20, 2025Updated 5 months ago
- Pytorch implementation of Self-Refining Video Sampling☆146Feb 6, 2026Updated 3 weeks ago
- [NeurIPS'25 Spotlight] MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation☆20Feb 23, 2025Updated last year
- Zero-shot voice cloning text-to-speech (TTS) with explicit emotion class conditioning built on F5-TTS☆29Feb 19, 2026Updated 2 weeks ago
- ☆20Jan 1, 2026Updated 2 months ago
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)☆86Feb 27, 2025Updated last year
- [ICML'25] EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.☆174Jun 26, 2025Updated 8 months ago
- ☆31Jul 16, 2025Updated 7 months ago
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models☆286Dec 4, 2024Updated last year
- [NeurIPS2022] FreGAN: Exploiting Frequency Components for Training GANs under Limited Data☆57Oct 17, 2022Updated 3 years ago
- CODA: Repurposing Continuous VAEs for Discrete Tokenization☆35Jul 4, 2025Updated 8 months ago
- ☆37May 28, 2025Updated 9 months ago
- ☆33Jan 6, 2025Updated last year
- [NIPS 25'] Evaluation code of paper "KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models"☆40Oct 19, 2025Updated 4 months ago
- [CVPR 2023] GLeaD: Improving GANs with A Generator-Leading Task☆32Jun 5, 2023Updated 2 years ago
- [Preprint] Efficient Generative Model Training via Embedded Representation Warmup☆36Oct 15, 2025Updated 4 months ago
- Improving Motion in Image-to-Video Models via Adaptive Low-Pass Guidance (CVPR 2026)☆53Feb 23, 2026Updated last week
- DiT for VAE (and Video Generation)☆35Sep 2, 2024Updated last year
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆166Jan 31, 2025Updated last year
- [Preprint] UCGM: Unified Continuous Generative Models☆182May 27, 2025Updated 9 months ago
- ACM MM 2023☆14Jul 15, 2025Updated 7 months ago
- Official implement of paper "Revisiting Multimodal Positional Encoding in Vision–Language Models"☆69Dec 9, 2025Updated 2 months ago
- [AAAI2026] Bring Your Dreams to Life: Continual Text-to-Video Customization☆36Dec 9, 2025Updated 2 months ago
- The official implementation of our paper "Cockatiel: Ensembling Synthetic and Human Preferenced Training for Detailed Video Caption"☆38May 21, 2025Updated 9 months ago
- ☆10Sep 7, 2019Updated 6 years ago
- Official PyTorch implementation of FlowMo.☆114Apr 7, 2025Updated 10 months ago
- [ICML 2025] Official PyTorch Implementation of "History-Guided Video Diffusion"☆630Jul 1, 2025Updated 8 months ago
- [CVPR2025 Highlight] SeedVR: Seeding Infinity in Diffusion Transformer Towards Generic Video Restoration☆106Jun 18, 2025Updated 8 months ago
- Benchmark dataset and code of MSRVTT-Personalization☆52Nov 10, 2025Updated 3 months ago
- Official implementation of "Force Prompting: Video Generation Models Can Learn and Generalize Physics-based Control Signals" (NeurIPS 202…☆148Sep 27, 2025Updated 5 months ago
- Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).☆420Aug 26, 2025Updated 6 months ago
- The `onnx` Python library (not `onnxruntime`, to be clear) running in the browser using Pyodide.☆12Oct 12, 2023Updated 2 years ago
- ☆10Sep 24, 2024Updated last year
- ☆10Sep 17, 2022Updated 3 years ago
- A large-scale training and benchmarking framework for rPPG.☆10Nov 26, 2024Updated last year