zhehuazhou / LLM_Reward_DesignLinks
☆9Updated last year
Alternatives and similar repositories for LLM_Reward_Design
Users that are interested in LLM_Reward_Design are comparing it to the libraries listed below
Sorting:
- ☆45Updated last year
- Official code repository for CurricuLLM: Automatic Task Curricula Design for Learning Complex Robot Skills using Large Language Models☆16Updated last month
- (NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation☆29Updated 2 years ago
- This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement …☆26Updated last month
- Implementation of Language-Conditioned Path Planning (Amber Xie, Youngwoon Lee, Pieter Abbeel, Stephen James)☆23Updated last year
- M^3PC: Test-Time Model Predictive Control for Pretrained Masked Trajectory Model, ICLR 2025☆16Updated 2 months ago
- ☆19Updated last year
- ☆10Updated last year
- [ICML 2024] Learning Reward for Robot Skills Using Large Language Models via Self-Alignment☆13Updated 9 months ago
- Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"☆49Updated 4 months ago
- This is the source code of FUSION, a safety-aware causal representation for generalizable driving agents.☆19Updated 7 months ago
- [ICML'2023] "AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners"☆59Updated last year
- Source codes for the paper "COMBO: Compositional World Models for Embodied Multi-Agent Cooperation"☆36Updated 2 months ago
- ☆27Updated 5 months ago
- Chain-of-Thought Predictive Control☆57Updated 2 years ago
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆32Updated 7 months ago
- Official code release of AAAI 2024 paper SayCanPay.☆48Updated last year
- ☆56Updated 11 months ago
- Official implementation of "Cross-Domain Transfer via Semantic Skill Imitation", Pertsch et al., CoRL 2022☆14Updated 2 years ago
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆26Updated last year
- Official codebase for "Privileged Sensing Scaffolds Reinforcement Learning", contains the Scaffolder algorithm and Sensory Scaffolding Su…☆28Updated last year
- Code for Compositional Diffusion-Based Continuous Constraint Solvers (CoRL 23)☆58Updated last year
- Unofficial baselines for ManiSkill, including RL and BC algorithms.☆14Updated last year
- Official implementation for the LABOR (LAnguage-model-based Bimanual ORchestration) Agent.☆19Updated 6 months ago
- MiniGrid Implementation of BEHAVIOR Tasks☆46Updated 9 months ago
- ☆29Updated last year
- PWM: Policy Learning with Large World Models☆49Updated 3 months ago
- ☆18Updated 9 months ago
- [NeurIPS 2023] Refining Diffusion Planner for Reliable Behavior Synthesis by Automatic Detection of Infeasible Plans☆21Updated last year
- OpenAI gym environments for goal-conditioned and language-conditioned reinforcement learning☆14Updated 3 months ago