thu-ml / Efficient-Diffusion-AlignmentLinks
Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)
☆13Updated 7 months ago
Alternatives and similar repositories for Efficient-Diffusion-Alignment
Users that are interested in Efficient-Diffusion-Alignment are comparing it to the libraries listed below
Sorting:
- Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction (ICML 2023)☆48Updated last year
- [ICML 2025 Spotlight] Direct Discriminative Optimization: Supercharging Diffusion/Autoregressive with GAN-type Discrimination☆47Updated this week
- Implementation of Negative-aware Finetuning (NFT) algorithm for "Bridging Supervised Learning and Reinforcement Learning in Math Reasonin…☆18Updated last week
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆38Updated last year
- Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction☆31Updated last year
- ☆23Updated last year
- Official implementation for Diffusion Alignment as Sampling (DAS), ICLR'25, Spotlight☆40Updated 4 months ago
- Official PyTorch implementation for "Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data" (ICLR…☆49Updated 3 weeks ago
- Code release for "Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning" (NeurIPS 2023), https://ar…☆64Updated 8 months ago
- An ML research template with good documentation by Boyuan Chen, an MIT PhD student☆75Updated 3 months ago
- ☆52Updated 9 months ago
- ElasticTok: Adaptive Tokenization for Image and Video☆70Updated 7 months ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆36Updated 7 months ago
- [ICML 2024] On Discrete Prompt Optimization for Diffusion Models - Google☆56Updated 10 months ago
- ☆30Updated 2 months ago
- Official implementation of "Self-Improving Video Generation"☆66Updated 2 months ago
- ☆119Updated 4 months ago
- ☆152Updated last week
- Official repository for "RLVR-World: Training World Models with Reinforcement Learning", https://arxiv.org/abs/2505.13934☆45Updated 2 weeks ago
- ☆21Updated 7 months ago
- Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).☆44Updated last year
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆38Updated last month
- ☆37Updated last month
- Code release for paper "Test-Time Training Done Right"☆149Updated last week
- Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"☆33Updated 4 months ago
- Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)☆54Updated 7 months ago
- Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223☆134Updated last month
- SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enh…☆31Updated 10 months ago
- ☆54Updated 7 months ago
- Code for the paper "Learning a Diffusion Model Policy from Rewards via Q-Score Matching"☆23Updated 2 months ago