thu-ml / Efficient-Diffusion-AlignmentLinks
Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)
☆15Updated last year
Alternatives and similar repositories for Efficient-Diffusion-Alignment
Users that are interested in Efficient-Diffusion-Alignment are comparing it to the libraries listed below
Sorting:
- Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction (ICML 2023)☆50Updated 2 years ago
- Code release for "Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning" (NeurIPS 2023), https://ar…☆70Updated last year
- Official PyTorch implementation for "Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data" (ICLR…☆78Updated 8 months ago
- Official repo for vidar and vidarc: video foundation model for robotics.☆37Updated last month
- An ML research template with good documentation by Boyuan Chen, an MIT PhD student☆127Updated 11 months ago
- ☆35Updated 9 months ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆42Updated last year
- ElasticTok: Adaptive Tokenization for Image and Video☆88Updated last year
- Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction☆35Updated 2 years ago
- ☆26Updated last year
- Official implementation for Diffusion Alignment as Sampling (DAS), ICLR'25, Spotlight☆59Updated 11 months ago
- DiffusionNFT: Online Diffusion Reinforcement with Forward Process☆600Updated last month
- Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223☆164Updated 4 months ago
- SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enh…☆34Updated last year
- Official repository for "RLVR-World: Training World Models with Reinforcement Learning" (NeurIPS 2025), https://arxiv.org/abs/2505.13934☆208Updated 3 months ago
- PyTorch Implementation of Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model☆27Updated last year
- ☆163Updated last year
- Official implementation of "Self-Improving Video Generation"☆77Updated 9 months ago
- ☆58Updated last year
- Source codes for the paper "COMBO: Compositional World Models for Embodied Multi-Agent Cooperation"☆45Updated 10 months ago
- [NeurIPS 2025] Official implementation for our paper "Scaling Diffusion Transformers Efficiently via μP".☆95Updated 3 months ago
- [ICLR 2025] Official implementation and benchmark evaluation repository of <PhysBench: Benchmarking and Enhancing Vision-Language Models …☆83Updated 2 weeks ago
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆40Updated last year
- Code for the paper "Training Diffusion Models with Reinforcement Learning"☆551Updated 2 years ago
- [ICLR 2026] Uni-CoT: Towards Unified Chain-of-Thought Reasoning Across Text and Vision☆207Updated last week
- JAX reimplementation of the DeepMind paper "Genie: Generative Interactive Environments"☆98Updated last year
- Official PyTorch implementation for Diffusion Rejection Sampling (DiffRS) in ICML 2024.☆22Updated last year
- Implementation of Negative-aware Finetuning (NFT) algorithm for "Bridging Supervised Learning and Reinforcement Learning in Math Reasonin…☆68Updated 4 months ago
- Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give…☆207Updated 3 months ago
- A Video Tokenizer Evaluation Dataset☆150Updated last year