microsoft/Reducio-VAE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/microsoft/Reducio-VAE)

microsoft / Reducio-VAE

☆217

Alternatives and similar repositories for Reducio-VAE

Users that are interested in Reducio-VAE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

microsoft / VidTok
View on GitHub
a family of versatile and state-of-the-art video tokenizers.
☆453Sep 1, 2025Updated 10 months ago
VideoVerses / VideoVAEPlus
View on GitHub
[ICCV 2025] VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE
☆409Jan 19, 2025Updated last year
hustvl / Turbo-VAED
View on GitHub
[AAAI 2026] Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devices
☆131Jul 10, 2026Updated 2 weeks ago
baaivision / NOVA
View on GitHub
[ICLR 2025] Autoregressive Video Generation without Vector Quantization
☆656Oct 29, 2025Updated 8 months ago
AILab-CVC / CV-VAE
View on GitHub
[NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models
☆285Dec 4, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
NVIDIA / Cosmos-Tokenizer
View on GitHub
A suite of image and video neural tokenizers
☆1,731Feb 11, 2025Updated last year
sihyun-yu / REPA
View on GitHub
[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
☆1,680Mar 16, 2025Updated last year
huang-yh / Owl
View on GitHub
☆52Dec 13, 2024Updated last year
ali-vilab / CDT
View on GitHub
Official implementation for our paper: Rethinking Video Tokenization: A Conditioned Diffusion-based Approach
☆17Apr 2, 2025Updated last year
FoundationVision / Infinity
View on GitHub
[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
☆1,579Apr 16, 2026Updated 3 months ago
Dawn-LX / CausalCache-VDM
View on GitHub
Official implementation of our paper: "Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing" …
☆83May 22, 2025Updated last year
JIA-Lab-research / MagicMirror
View on GitHub
[ICCV 2025] MagicMirror: ID-Preserved Video Generation in Video Diffusion Transformers
☆130Jun 26, 2025Updated last year
kwsong0113 / diffusion-forcing-transformer
View on GitHub
[ICML 2025] Official PyTorch Implementation of "History-Guided Video Diffusion"
☆705Jul 1, 2025Updated last year
TencentARC / SEED-Voken
View on GitHub
SEED-Voken: A Series of Powerful Visual Tokenizers
☆1,018Nov 25, 2025Updated 8 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
wdrink / SimpleAR
View on GitHub
Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"
☆431Jun 20, 2025Updated last year
LTH14 / mar
View on GitHub
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
☆1,943Feb 20, 2026Updated 5 months ago
TencentARC / Divot
View on GitHub
Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)
☆87Feb 27, 2025Updated last year
hustvl / LightningDiT
View on GitHub
[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
☆1,510Dec 16, 2025Updated 7 months ago
jy0205 / Pyramid-Flow
View on GitHub
[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling
☆3,199Dec 21, 2024Updated last year
Osilly / Interleaving-Reasoning-Generation
View on GitHub
[ICLR 2026] This is an early exploration to introduce Interleaving Reasoning to Text-to-image Generation field and achieve the SoTA bench…
☆100Jan 26, 2026Updated 5 months ago
MiniMax-AI / VTP
View on GitHub
[ECCV 2026] Towards Scalable Pre-training of Visual Tokenizers for Generation
☆495Apr 15, 2026Updated 3 months ago
VideoVerses / VideoTuna
View on GitHub
Let's finetune video generation models!
☆551Sep 15, 2025Updated 10 months ago
ant-research / Aurora
View on GitHub
Official implementation of Aurora
☆86Sep 20, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
facebookresearch / metaquery
View on GitHub
Official Implementation of Paper Transfer between Modalities with MetaQueries
☆325Oct 12, 2025Updated 9 months ago
eai-lab / On-device-Sora
View on GitHub
[arXiv] On-device Sora: Enabling Diffusion-Based Text-to-Video Generation for Mobile Devices
☆138Nov 27, 2025Updated 7 months ago
lxa9867 / ImageFolder
View on GitHub
High-performance Image Tokenizers for VAR and AR
☆307Apr 25, 2025Updated last year
alibaba-damo-academy / Lumos
View on GitHub
[ICLR 2026] Lumos Project: Frontier video unified model research by Alibaba DAMO Academy.
☆161Apr 6, 2026Updated 3 months ago
SilentView / GigaTok
View on GitHub
[ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"
☆204Jan 7, 2026Updated 6 months ago
Hhhhhhao / continuous_tokenizer
View on GitHub
☆321May 29, 2025Updated last year
tianciB / VFM-VAE
View on GitHub
[CVPR 2026] VFM-VAE: Vision Foundation Models Can Be Good Tokenizers for Latent Diffusion Models
☆28Apr 23, 2026Updated 3 months ago
bytetriper / RAE
View on GitHub
Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"
☆1,978Feb 25, 2026Updated 4 months ago
TencentARC / RollingForcing
View on GitHub
[ICLR 2026] Official Repo for Rolling Forcing: Autoregressive Long Video Diffusion in Real Time
☆444Oct 31, 2025Updated 8 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
SandAI-org / MAGI-1
View on GitHub
MAGI-1: Autoregressive Video Generation at Scale
☆3,746Jun 17, 2026Updated last month
ShoufaChen / PixelFlow
View on GitHub
Pixel-Space Generative Models
☆317May 11, 2025Updated last year
hao-ai-lab / FastVideo
View on GitHub
A unified inference and post-training framework for accelerated video generation.
☆3,879Updated this week
mit-han-lab / efficientvit
View on GitHub
Efficient vision foundation models for high-resolution generation and perception.
☆3,332Sep 5, 2025Updated 10 months ago
zelaki / eqvae
View on GitHub
[ICML'25] EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.
☆182Mar 18, 2026Updated 4 months ago
FoundationVision / FlashVideo
View on GitHub
[AAAI-2026]FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation
☆460Mar 5, 2025Updated last year
tianweiy / CausVid
View on GitHub
(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models
☆1,408Aug 7, 2025Updated 11 months ago