Alescontrela / score_matching_rl
Code for the paper "Learning a Diffusion Model Policy from Rewards via Q-Score Matching"
☆20Updated last week
Alternatives and similar repositories for score_matching_rl:
Users that are interested in score_matching_rl are comparing it to the libraries listed below
- ☆27Updated last year
- Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024☆62Updated last year
- Code for SAPG: Split and Aggregate Policy Gradients (ICML 2024)☆46Updated 7 months ago
- ☆42Updated 4 months ago
- PWM: Policy Learning with Large World Models☆43Updated 2 months ago
- [RA-L 2024] Novel action spaces leveraging redundancy in 7 DoF arms enable efficient & precise learning in robotic manipulation☆17Updated 10 months ago
- Official repo for paper "TD-M(PC)^2: Improving Temporal Difference MPC Through Policy Constraint"☆52Updated 2 months ago
- ☆35Updated 8 months ago
- Streaming Diffusion Policy: Fast Policy Synthesis with Variable Noise Diffusion Models☆53Updated 7 months ago
- Code release for the paper "Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control"☆16Updated last year
- Learning to Walk from Three Minutes of Real-World Data with Semi-structured Dynamics Models☆29Updated 6 months ago
- Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"☆48Updated 3 months ago
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆25Updated last year
- [ICML'2023] "AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners"☆58Updated last year
- Code for "DittoGym: Learning to Control Soft Shape-Shifting Robots" by Suning Huang, Boyuan Chen, Huazhe Xu, and Vincent Sitzmann.☆27Updated 11 months ago
- ☆22Updated 10 months ago
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆30Updated 6 months ago
- official implementation of QVPO☆32Updated 6 months ago
- (ICLR 2024) Reverse Forward Curriculum Learning☆44Updated 4 months ago
- ☆42Updated 9 months ago
- Official codebase for "Privileged Sensing Scaffolds Reinforcement Learning", contains the Scaffolder algorithm and Sensory Scaffolding Su…☆27Updated last year
- [NeurIPS 2023] Refining Diffusion Planner for Reliable Behavior Synthesis by Automatic Detection of Infeasible Plans☆19Updated last year
- Official implementation for: Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning ICLR'24☆25Updated 7 months ago
- ☆42Updated last month
- Official Code Repository for DDAT: Diffusion Policies Enforcing Dynamically Admissible Robot Trajectories☆10Updated 2 weeks ago
- ☆48Updated 2 months ago
- ☆31Updated last year
- Official code repository for CurricuLLM: Automatic Task Curricula Design for Learning Complex Robot Skills using Large Language Models☆13Updated last week
- ☆18Updated last year
- TACO-RL: Latent Plans for Task-Agnostic Offline Reinforcement Learning☆27Updated 2 years ago