friolero / self_aligned_reward_learningLinks
[ICML 2024] Learning Reward for Robot Skills Using Large Language Models via Self-Alignment
☆17Updated last year
Alternatives and similar repositories for self_aligned_reward_learning
Users that are interested in self_aligned_reward_learning are comparing it to the libraries listed below
Sorting:
- Codebase for paper: RoCo: Dialectic Multi-Robot Collaboration with Large Language Models☆225Updated 2 years ago
- Code for Reinforcement Learning from Vision Language Foundation Model Feedback☆126Updated last year
- [ICLR 2024] PyTorch Code for Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon Robotics Tasks☆119Updated last year
- [IJCAI'24] An index of algorithms, approaches, and systems on cross-domain policy transfer for embodied agents☆55Updated 8 months ago
- ☆298Updated 2 weeks ago
- [CoRL 2023] REFLECT: Summarizing Robot Experiences for Failure Explanation and Correction☆101Updated last year
- ☆47Updated last year
- ☆77Updated 3 months ago
- https://arxiv.org/abs/2312.10807☆75Updated 11 months ago
- Coarse-to-fine Q-Network☆54Updated last year
- Official implementation of paper on Nature Machine Intelligence: "Preserving and Combining Knowledge in Robotic Lifelong Reinforcement Le…☆106Updated 7 months ago
- ☆156Updated last year
- Official code for "QueST: Self-Supervised Skill Abstractions for Continuous Control" [NeurIPS 2024]☆104Updated 11 months ago
- LLM3: Large Language Model-based Task and Motion Planning with Motion Failure Reasoning☆93Updated last year
- A list of awesome and popular robot learning environments☆114Updated last year
- [CVPR 2024] Hierarchical Diffusion Policy for Multi-Task Robotic Manipulation☆213Updated last year
- Code for NeurIPS 2023 paper "Active Vision Reinforcement Learning with Limited Visual Observability"☆53Updated last year
- Demo-Driven Mobile Bi-Manual Manipulation Benchmark.☆198Updated 5 months ago
- ☆83Updated 2 years ago
- Public release for "Distillation and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections"☆46Updated last year
- A library of long-horizon Task-and-Motion-Planning (TAMP) problems in kitchen and household scenes, as well as planners to solve them☆150Updated 5 months ago
- Official implementation for pi0 steering via DSRL, Steering Your Diffusion Policy with Latent Space Reinforcement Learning (CoRL 2025)☆124Updated 3 months ago
- ☆32Updated last year
- Official code repository for CurricuLLM: Automatic Task Curricula Design for Learning Complex Robot Skills using Large Language Models☆20Updated last month
- ☆92Updated last year
- Code for Watch and Match: Supercharging Imitation with Regularized Optimal Transport☆83Updated 2 years ago
- Code for "Policy Decorator: Model-Agnostic Online Refinement for Large Policy Model"☆102Updated last week
- [ECCV 2024] 💐Official implementation of the paper "Diffusion Reward: Learning Rewards via Conditional Video Diffusion"☆111Updated last year
- Official code release of AAAI 2024 paper SayCanPay.☆50Updated 2 weeks ago
- SDP☆71Updated last year