NVlabs/DDO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NVlabs/DDO)

NVlabs / DDO

[ICML 2025 Spotlight] Direct Discriminative Optimization: Reinforcing Diffusion/Autoregressive with GAN Discrimination

☆124

Alternatives and similar repositories for DDO

Users that are interested in DDO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mingyuanzhou / SiD
View on GitHub
PyTorch code and model checkpoints for Score identity Distillation (SiD) and its adversarial version (SiDA)
☆155Mar 29, 2025Updated last year
tyshiwo1 / Awesome-Visual-Tokenizer
View on GitHub
Awesome Visual Tokenizers/Autoencoders
☆20Nov 19, 2025Updated 8 months ago
Shy-98 / MELLE
View on GitHub
Unofficial PyTorch implementation of "Autoregressive Speech Synthesis without Vector Quantization (MELLE)"
☆41Jun 28, 2025Updated last year
NVlabs / DiffusionNFT
View on GitHub
[ICLR 2026 Oral] DiffusionNFT: Online Diffusion Reinforcement with Forward Process
☆985Feb 10, 2026Updated 5 months ago
ali-vilab / FACM
View on GitHub
FACM: Flow-Anchored Consistency Models
☆147Aug 6, 2025Updated 11 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
samsad35 / code-ancogen
View on GitHub
[ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder
☆14Mar 11, 2025Updated last year
yinboc / dito
View on GitHub
Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"
☆169Jan 31, 2025Updated last year
RayYuki / CodecBench
View on GitHub
☆24Nov 16, 2025Updated 8 months ago
yoongi43 / VRVQ
View on GitHub
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Apr 10, 2025Updated last year
LINs-lab / UCGM
View on GitHub
[Preprint] UCGM: Unified Continuous Generative Models
☆185May 27, 2025Updated last year
hustvl / LightningDiT
View on GitHub
[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
☆1,510Dec 16, 2025Updated 7 months ago
mingyuanzhou / SiD-LSG
View on GitHub
Score identity Distillation with Long and Short Guidance for One-Step Text-to-Image Generation
☆96Dec 4, 2025Updated 7 months ago
G-U-N / Awesome-Pixel-Flow
View on GitHub
☆38Dec 25, 2025Updated 7 months ago
NVlabs / HMAR
View on GitHub
[CVPR 2025] HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation
☆63Jul 8, 2025Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
fluxions-ai / stftvae
View on GitHub
Inference for the STFT-VAE continuous audio codec (24kHz, 3.125Hz latent)
☆43Jul 12, 2026Updated 2 weeks ago
bfs18 / armel
View on GitHub
poorman's ar-dit tts
☆45Dec 31, 2025Updated 6 months ago
NVlabs / edm2
View on GitHub
EDM2 and Autoguidance -- Official PyTorch implementation
☆847Dec 9, 2024Updated last year
Jiawei-Yang / DeTok
View on GitHub
Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"
☆195Feb 24, 2026Updated 5 months ago
flamed-tts / Flamed-TTS
View on GitHub
This repository implement a novel zero-shot TTS framework, named Flamed-TTS, focusing on the efficient generation and dynamic pacing in …
☆57Aug 9, 2025Updated 11 months ago
locuslab / ect
View on GitHub
Consistency Models Made Easy
☆333Oct 13, 2024Updated last year
X-LANCE / LSCodec-Inference
View on GitHub
Inference code for Interspeech 2025 paper, "LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec"
☆36Oct 23, 2025Updated 9 months ago
thu-ml / CCA
View on GitHub
Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"
☆37Feb 11, 2025Updated last year
thu-ml / GFT
View on GitHub
☆53Jun 13, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
primepake / dac_vae
View on GitHub
Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder
☆38Aug 30, 2025Updated 10 months ago
WZDTHU / TiM
View on GitHub
Transition Models
☆156May 11, 2026Updated 2 months ago
OliverRensu / FlowAR
View on GitHub
“FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with an…
☆171May 1, 2025Updated last year
NVlabs / rcm
View on GitHub
rCM & Causal-rCM: Leading and Unified Algorithms/Infrastructures for Bidirectional/Autoregressive Video Diffusion Distillation at Scale
☆772Jun 25, 2026Updated last month
thu-ml / Efficient-Diffusion-Alignment
View on GitHub
Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)
☆15Oct 29, 2024Updated last year
Jiawei-Yang / FD-Loss
View on GitHub
☆547May 1, 2026Updated 2 months ago
hs-oh-prml / DurFlexEVC
View on GitHub
☆82Jan 22, 2025Updated last year
icandle / GenDR
View on GitHub
GenDR: Lightning Generative Detail Restorator
☆38Feb 24, 2026Updated 5 months ago
End2End-Diffusion / REPA-E
View on GitHub
[ICCV 2025] Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers
☆511Dec 6, 2025Updated 7 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ictnlp / SLED-TTS
View on GitHub
Streamable Text-to-Speech model using a language modeling approach, without vector quantization
☆108May 20, 2025Updated last year
tang-bd / v-grpo
View on GitHub
[CVPR 2026 Findings] V-GRPO: Online Reinforcement Learning for Denoising Generative Models Is Easier than You Think
☆56Apr 28, 2026Updated 2 months ago
Lakonik / GMFlow
View on GitHub
[ICML 2025] Gaussian Mixture Flow Matching Models (GMFlow)
☆194Nov 7, 2025Updated 8 months ago
yuhuUSTC / FAR
View on GitHub
Frequency Autoregressive Image Generation with Continuous Tokens
☆101Jun 9, 2025Updated last year
FoundationVision / BitVAE
View on GitHub
official training and inference code of bitwise tokenizer
☆71May 18, 2025Updated last year
kandinskylab / kvae-audio
View on GitHub
KVAE-Audio: a continuous full-band audio waveform autoencoder
☆101Updated this week
dc-ai-projects / DC-Gen
View on GitHub
DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space
☆398Updated this week