zhehuazhou / LLM_Reward_Design
☆9Updated last year
Alternatives and similar repositories for LLM_Reward_Design:
Users that are interested in LLM_Reward_Design are comparing it to the libraries listed below
- ☆45Updated last year
- M^3PC: Test-Time Model Predictive Control for Pretrained Masked Trajectory Model, ICLR 2025☆13Updated last month
- ☆42Updated 9 months ago
- Implementation of Language-Conditioned Path Planning (Amber Xie, Youngwoon Lee, Pieter Abbeel, Stephen James)☆22Updated last year
- [ICML 2024] Learning Reward for Robot Skills Using Large Language Models via Self-Alignment☆11Updated 8 months ago
- This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement …☆25Updated 6 months ago
- ☆9Updated last year
- (NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation☆26Updated 2 years ago
- Repo for Bring Your Own Vision-Language-Action (VLA) model, arxiv 2024☆27Updated 3 months ago
- [ICML'2023] "AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners"☆58Updated last year
- Interactive Fleet Learning Benchmark☆36Updated last year
- Code for Compositional Diffusion-Based Continuous Constraint Solvers (CoRL 23)☆56Updated last year
- ☆27Updated last year
- ☆38Updated 8 months ago
- Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"☆48Updated 3 months ago
- PWM: Policy Learning with Large World Models☆43Updated 2 months ago
- Official implementation of "Cross-Domain Transfer via Semantic Skill Imitation", Pertsch et al., CoRL 2022☆14Updated 2 years ago
- ☆25Updated 3 months ago
- Code for the paper "Policy Adaptation via Language Optimization: Decomposing Tasks for Few-Shot Imitation"☆29Updated 4 months ago
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆36Updated last year
- Official codebase for "Privileged Sensing Scaffolds Reinforcement Learning", contains the Scaffolder algorithm and Sensory Scaffolding Su…☆27Updated last year
- Code release for H-GAP Humanoid Control with a Generalist Planner☆24Updated 5 months ago
- Code release for the paper "Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control"☆16Updated last year
- Official implementation for the LABOR (LAnguage-model-based Bimanual ORchestration) Agent.☆17Updated 5 months ago
- MiniGrid Implementation of BEHAVIOR Tasks☆44Updated 8 months ago
- Source codes for the paper "COMBO: Compositional World Models for Embodied Multi-Agent Cooperation"☆32Updated last month
- ☆53Updated 10 months ago
- ☆17Updated 8 months ago
- Official code release of AAAI 2024 paper SayCanPay.☆46Updated last year
- Official code repository for CurricuLLM: Automatic Task Curricula Design for Learning Complex Robot Skills using Large Language Models☆13Updated last week