Alescontrela / score_matching_rl
Code for the paper "Learning a Diffusion Model Policy from Rewards via Q-Score Matching"
☆14Updated 2 months ago
Alternatives and similar repositories for score_matching_rl:
Users that are interested in score_matching_rl are comparing it to the libraries listed below
- [RA-L 2024] Novel action spaces leveraging redundancy in 7 DoF arms enable efficient & precise learning in robotic manipulation☆17Updated 8 months ago
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆26Updated 4 months ago
- Streaming Diffusion Policy: Fast Policy Synthesis with Variable Noise Diffusion Models☆46Updated 4 months ago
- Code release for the paper "Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control"☆15Updated 10 months ago
- ☆13Updated 4 months ago
- Connect agent policies for distributed ML applications☆30Updated 4 months ago
- MoDem-V2 combines the sample efficiency of the original MoDem with conservative exploration in order to quickly and safely learn manipula…☆17Updated 10 months ago
- ☆35Updated 3 weeks ago
- Code for "DittoGym: Learning to Control Soft Shape-Shifting Robots" by Suning Huang, Boyuan Chen, Huazhe Xu, and Vincent Sitzmann.☆26Updated 9 months ago
- (ICLR 2024) Reverse Forward Curriculum Learning☆42Updated 2 months ago
- Official implementation of "Cross-Domain Transfer via Semantic Skill Imitation", Pertsch et al., CoRL 2022☆14Updated 2 years ago
- Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024☆62Updated 11 months ago
- PWM: Policy Learning with Large World Models☆41Updated 6 months ago
- ☆21Updated 3 weeks ago
- [NeurIPS 2024] GenRL: Multimodal-foundation world models enable grounding language and video prompts into embodied domains, by turning th…☆68Updated last month
- ☆55Updated last year
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆25Updated last year
- [IJCAI'24] An index of algorithms, approaches, and systems on cross-domain policy transfer for embodied agents☆43Updated last week
- ☆43Updated last year
- Coarse-to-fine Q-Network☆38Updated 6 months ago
- ☆42Updated 7 months ago
- Code for the paper "Policy Adaptation via Language Optimization: Decomposing Tasks for Few-Shot Imitation"☆28Updated 2 months ago
- KitchenShift: Evaluating Zero-Shot Generalization of Imitation-Based Policy Learning Under Domain Shifts☆19Updated 2 years ago
- ☆23Updated 3 years ago
- Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"☆45Updated last month
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆27Updated last year
- [ICLR 2023] Choreographer: a model-based agent that discovers and learns unsupervised skills in latent imagination, and it's able to effi…☆40Updated 8 months ago
- Code for Watch and Match: Supercharging Imitation with Regularized Optimal Transport☆77Updated last year
- ☆26Updated 11 months ago
- Code release for paper "Autonomous Improvement of Instruction Following Skills via Foundation Models" | CoRL 2024☆62Updated last month