[ICML 2025 Spotlight] Direct Discriminative Optimization: Reinforcing Diffusion/Autoregressive with GAN Discrimination
☆122Jan 27, 2026Updated 3 months ago
Alternatives and similar repositories for DDO
Users that are interested in DDO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)☆15Oct 29, 2024Updated last year
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆167Jan 31, 2025Updated last year
- Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"☆38Feb 11, 2025Updated last year
- Transition Models☆149May 11, 2026Updated 2 weeks ago
- Consistency Models Made Easy☆330Oct 13, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- EDM2 and Autoguidance -- Official PyTorch implementation☆843Dec 9, 2024Updated last year
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated last year
- ☆169Apr 1, 2025Updated last year
- [ICLR 2026 Oral] DiffusionNFT: Online Diffusion Reinforcement with Forward Process☆862Feb 10, 2026Updated 3 months ago
- TPDiff: Temporal Pyramid Video Diffusion Model☆25Mar 13, 2025Updated last year
- ☆20Dec 8, 2024Updated last year
- Score identity Distillation with Long and Short Guidance for One-Step Text-to-Image Generation☆96Dec 4, 2025Updated 5 months ago
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆15Apr 22, 2026Updated last month
- [ICCV25] TACA: Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers☆41Jul 23, 2025Updated 10 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆321May 29, 2025Updated 11 months ago
- The demo page for ALMTokenizer☆59Apr 14, 2025Updated last year
- JoPano: Unified Panorama Generation via Joint Modeling☆24Mar 6, 2026Updated 2 months ago
- Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)☆58Nov 8, 2024Updated last year
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆23May 19, 2026Updated last week
- Open Source code for our paper, Steering Autoregressive Music Generation with Recursive Feature Machines (Zhao et al., 2025). aka MusicRF…☆40Oct 26, 2025Updated 7 months ago
- Official implementation of Decoupled MeanFlow☆42Oct 28, 2025Updated 6 months ago
- [ICLR 2025] Rectified Diffusion: Straightness Is Not Your Need☆250Mar 11, 2025Updated last year
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project☆186Mar 20, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report☆47Sep 2, 2025Updated 8 months ago
- [CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models☆1,469Dec 16, 2025Updated 5 months ago
- [ICML26] Distribution Matching Variational AutoEncoder (DMVAE)☆49Dec 9, 2025Updated 5 months ago
- Official implementation of Inductive Moment Matching☆585Jul 11, 2025Updated 10 months ago
- Official codebase for "Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis" (https://arxiv.org/abs/2312.03491).☆131Jul 12, 2024Updated last year
- Image Tokenizer Needs Post-Training☆24Oct 4, 2025Updated 7 months ago
- Sound Separation, Omni modal☆28Sep 15, 2025Updated 8 months ago
- [ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think☆1,635Mar 16, 2025Updated last year
- Video Diffusion State Space Models☆19Mar 27, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [NeurIPS 2025] Official implementation for our paper "Scaling Diffusion Transformers Efficiently via μP".☆98Nov 2, 2025Updated 6 months ago
- official training and inference code of bitwise tokenizer☆72May 18, 2025Updated last year
- Official Implementation for Diffusion Models Without Classifier-free Guidance☆174Feb 18, 2025Updated last year
- [NeurIPS 2025] Encoder-Decoder Diffusion Language Models for Efficient Training and Inference☆42Oct 29, 2025Updated 6 months ago
- PyTorch code and model checkpoints for Score identity Distillation (SiD) and its adversarial version (SiDA)☆152Mar 29, 2025Updated last year
- Implementation of Negative-aware Finetuning (NFT) algorithm for "Bridging Supervised Learning and Reinforcement Learning in Math Reasonin…☆82Sep 8, 2025Updated 8 months ago
- This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generat…☆249Oct 12, 2025Updated 7 months ago