thu-ml / Efficient-Diffusion-AlignmentLinks
Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)
☆15Updated last year
Alternatives and similar repositories for Efficient-Diffusion-Alignment
Users that are interested in Efficient-Diffusion-Alignment are comparing it to the libraries listed below
Sorting:
- Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction (ICML 2023)☆50Updated 2 years ago
- ☆25Updated last year
- Code release for "Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning" (NeurIPS 2023), https://ar…☆69Updated last year
- Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223☆161Updated 3 months ago
- Official implementation for Diffusion Alignment as Sampling (DAS), ICLR'25, Spotlight☆56Updated 10 months ago
- An ML research template with good documentation by Boyuan Chen, an MIT PhD student☆103Updated 9 months ago
- Official implementation of "Self-Improving Video Generation"☆76Updated 8 months ago
- ElasticTok: Adaptive Tokenization for Image and Video☆87Updated last year
- Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction☆35Updated 2 years ago
- Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).☆48Updated last year
- ☆158Updated 11 months ago
- DiffusionNFT: Online Diffusion Reinforcement with Forward Process☆510Updated 3 months ago
- ☆16Updated last year
- [ICML'25] The PyTorch implementation of paper: "AdaWorld: Learning Adaptable World Models with Latent Actions".☆189Updated 6 months ago
- Official Implementation for Inference-time Scaling of Diffusion Models through Classical Search☆26Updated 2 months ago
- Official repository for "RLVR-World: Training World Models with Reinforcement Learning" (NeurIPS 2025), https://arxiv.org/abs/2505.13934☆175Updated 2 months ago
- The official implementations of Intention-conditioned Flow Occupancy Models (InFOM)☆29Updated 2 months ago
- ☆129Updated 10 months ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆42Updated last year
- ☆57Updated last year
- Source codes for the paper "COMBO: Compositional World Models for Embodied Multi-Agent Cooperation"☆44Updated 9 months ago
- ☆138Updated 5 months ago
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆40Updated last year
- SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enh…☆34Updated last year
- [ICLR 2025 Spotlight] Grounding Video Models to Actions through Goal Conditioned Exploration☆58Updated 7 months ago
- Pytorch implementation of "Genie: Generative Interactive Environments", Bruce et al. (2024).☆240Updated last year
- Official PyTorch implementation for "Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data" (ICLR…☆73Updated 7 months ago
- [CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"☆242Updated last year
- A collection of paper/projects that trains flow matching model/policies via RL.☆329Updated 3 weeks ago
- Repository of "Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning" (NeurIPS 2023 Spotlight)☆39Updated 2 years ago