zai-org/SSVAE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zai-org/SSVAE)

zai-org / SSVAE

official implementation of the paper "Delving into Latent Spectral Biasing of Video VAEs for Superior Diffusability".

☆71

Alternatives and similar repositories for SSVAE

Users that are interested in SSVAE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ali-vilab / CDT
View on GitHub
Official implementation for our paper: Rethinking Video Tokenization: A Conditioned Diffusion-based Approach
☆17Apr 2, 2025Updated last year
deepshwang / crepa
View on GitHub
☆15Jun 21, 2025Updated last year
RayYuki / CodecBench
View on GitHub
☆24Nov 16, 2025Updated 8 months ago
sii-research / GAE
View on GitHub
Official code of Geometric Autoencoder for Diffusion Models.
☆21Mar 12, 2026Updated 4 months ago
ali-vilab / iv-vae
View on GitHub
☆34Mar 4, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
dc-ai-projects / DC-VideoGen
View on GitHub
DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder
☆192Oct 5, 2025Updated 9 months ago
alibaba / OmniDoc-TokenBench
View on GitHub
☆69May 14, 2026Updated 2 months ago
guolinke / SphereAR
View on GitHub
Implementation of "Hyperspherical Latents Improve Continuous-Token Autoregressive Generation"
☆104Feb 28, 2026Updated 4 months ago
Jiawei-Yang / DeTok
View on GitHub
Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"
☆195Feb 24, 2026Updated 5 months ago
End2End-Diffusion / REPA-E
View on GitHub
[ICCV 2025] Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers
☆511Dec 6, 2025Updated 7 months ago
tongdaxu / Making-rFID-Predictive-of-Diffusion-gFID
View on GitHub
Predicting the generation FID of latent diffusion, with a variant of reconstruction FID of Variational Auto-encoder.
☆84Jun 15, 2026Updated last month
VisionXLab / AdapTok
View on GitHub
[CVPR'26] AdapTok: Learning Adaptive and Temporally Causal Video Tokenization in a 1D Latent Space
☆29Mar 15, 2026Updated 4 months ago
hustvl / LightningDiT
View on GitHub
[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
☆1,510Dec 16, 2025Updated 7 months ago
ZitengWangNYU / Scale-RAE
View on GitHub
Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders
☆255Feb 13, 2026Updated 5 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
multimodal-art-projection / Open-Suno
View on GitHub
trying to reproduce suno v3
☆34Jan 29, 2025Updated last year
KdaiP / DC-Speech-VAE
View on GitHub
5Hz Deep-Compression Speech VAE for AR-Diffusion and CALMs
☆57Nov 19, 2025Updated 8 months ago
End2End-Diffusion / diffusion-bench
View on GitHub
Towards Holistic evaluation of Generative Diffusion Transformers!
☆98Jul 1, 2026Updated 3 weeks ago
MiniMax-AI / VTP
View on GitHub
[ECCV 2026] Towards Scalable Pre-training of Visual Tokenizers for Generation
☆495Apr 15, 2026Updated 3 months ago
DAMO-NLP-SG / DiGIT
View on GitHub
[NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective
☆78Oct 31, 2024Updated last year
snap-research / diffusability
View on GitHub
Source code for "Improving the Diffusability of Autoencoders" [ICML 2025]
☆21Jan 6, 2026Updated 6 months ago
XIANGLONGYAN / PBS2P
View on GitHub
PyTorch code for our paper "Progressive Binarization with Semi-Structured Pruning for LLMs"
☆13Jul 11, 2026Updated last week
zai-org / Kaleido
View on GitHub
Kaleido: Open-sourced multi-subject reference video generation model, enabling controllable, high-fidelity video synthesis from multiple …
☆144Mar 2, 2026Updated 4 months ago
bytetriper / RAE
View on GitHub
Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"
☆1,978Feb 25, 2026Updated 4 months ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
OpenMOSS / claude-codex-handoff
View on GitHub
Drop-in async file-based handoff protocol for two AI coding agents (Claude Code + Codex), installed as one shared .handoff/ in your proje…
☆30Jul 4, 2026Updated 3 weeks ago
HiDream-ai / SPM-Diff
View on GitHub
[ICLR 2025] Official lmplementation of SPM-Diff: Incorporating Visual Correspondence into Diffusion Model for Virtual Try-On
☆48Mar 3, 2025Updated last year
TaatiTeam / Token-Perturbation-Guidance
View on GitHub
Official implementation of "Token Perturbation Guidance for Diffusion Models" [NeurIPS 2025]
☆17May 19, 2026Updated 2 months ago
thu-coai / VPO
View on GitHub
☆25Jul 20, 2025Updated last year
ShivamDuggal4 / adaptive-length-tokenizer
View on GitHub
Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?
☆146Feb 11, 2025Updated last year
zelaki / ReDi
View on GitHub
[NeurIPS'25 Spotlight] Boosting Generative Image Modeling via Joint Image-Feature Synthesis
☆121Nov 3, 2025Updated 8 months ago
WeichenFan / UAE
View on GitHub
Official repo for UAE
☆207Jun 21, 2026Updated last month
apple / ml-sid-dit
View on GitHub
☆49Oct 29, 2025Updated 8 months ago
KexinHUANG19 / InstructTTSEval
View on GitHub
☆51Jun 25, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
hustvl / Turbo-VAED
View on GitHub
[AAAI 2026] Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devices
☆131Jul 10, 2026Updated 2 weeks ago
bytedance / 1d-tokenizer
View on GitHub
This repo contains the code for 1D tokenizer and generator
☆1,167Mar 20, 2025Updated last year
KaiyueSun98 / T2I-Personalization-with-AR
View on GitHub
☆47Apr 20, 2025Updated last year
SilentView / EMCID
View on GitHub
Official Implementation for "Editing Massive Concepts in Text-to-Image Diffusion Models"
☆19Mar 21, 2024Updated 2 years ago
vita-epfl / RDM
View on GitHub
☆79Jul 3, 2026Updated 3 weeks ago
NJU-PCALab / LUVE
View on GitHub
[ICML 2026] LUVE : Latent-Cascaded Ultra-High-Resolution Video Generation with Dual Frequency Experts
☆18May 11, 2026Updated 2 months ago
csguoh / DummyForcing
View on GitHub
Minute-long video generation at 24FPS.
☆69Mar 28, 2026Updated 3 months ago