Alescontrela / score_matching_rl
Code for the paper "Learning a Diffusion Model Policy from Rewards via Q-Score Matching"
☆21Updated last month
Alternatives and similar repositories for score_matching_rl
Users that are interested in score_matching_rl are comparing it to the libraries listed below
Sorting:
- ☆28Updated last year
- DAC: Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning.☆18Updated 11 months ago
- PWM: Policy Learning with Large World Models☆46Updated 2 months ago
- [RA-L 2024] Novel action spaces leveraging redundancy in 7 DoF arms enable efficient & precise learning in robotic manipulation☆18Updated 11 months ago
- Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024☆62Updated last year
- Official codebase for "Privileged Sensing Scaffolds Reinforcement Learning", contains the Scaffolder algorithm and Sensory Scaffolding Su…☆27Updated last year
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆25Updated last year
- Code for "DittoGym: Learning to Control Soft Shape-Shifting Robots" by Suning Huang, Boyuan Chen, Huazhe Xu, and Vincent Sitzmann.☆29Updated 2 weeks ago
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆31Updated 7 months ago
- ☆43Updated 5 months ago
- ☆44Updated last month
- Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"☆48Updated 4 months ago
- Official implementation for: Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning ICLR'24☆25Updated 8 months ago
- Official implementation of DEMO3☆47Updated last month
- [NeurIPS'24] The Official PyTorch implementation of DRAIL☆35Updated 5 months ago
- [ICML'2023] "AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners"☆59Updated last year
- Code for SAPG: Split and Aggregate Policy Gradients (ICML 2024)☆48Updated 8 months ago
- ☆26Updated 11 months ago
- Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation☆70Updated last year
- ☆55Updated 10 months ago
- [ICLR 2025] Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning.☆46Updated last month
- Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.☆77Updated last year
- ☆32Updated last year
- (ICLR 2024) Reverse Forward Curriculum Learning☆47Updated 5 months ago
- ☆38Updated 9 months ago
- [ICLR 2024] DMBP: Diffusion Model-Based Predictor for Robust Offline Reinforcement Learning against State Observations Perturbations.☆11Updated 11 months ago
- MoDem-V2 combines the sample efficiency of the original MoDem with conservative exploration in order to quickly and safely learn manipula…☆19Updated last year
- Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).☆43Updated last year
- Learning to Walk from Three Minutes of Real-World Data with Semi-structured Dynamics Models☆30Updated 6 months ago
- Streaming Diffusion Policy: Fast Policy Synthesis with Variable Noise Diffusion Models☆55Updated this week