zhehuazhou / LLM_Reward_Design
☆9Updated last year
Related projects ⓘ
Alternatives and complementary repositories for LLM_Reward_Design
- ☆45Updated 9 months ago
- Chain-of-Thought Predictive Control☆55Updated last year
- [ICML'2023] "AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners"☆50Updated last year
- Implementation of Language-Conditioned Path Planning (Amber Xie, Youngwoon Lee, Pieter Abbeel, Stephen James)☆22Updated last year
- ☆17Updated 8 months ago
- (NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation☆25Updated last year
- Code for the paper Bootstrap Your Own Skills: Learning to Solve New Tasks with Large Language Model Guidance, accepted to CoRL 2023 as an…☆22Updated 2 months ago
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆31Updated 7 months ago
- Source codes for the paper "COMBO: Compositional World Models for Embodied Multi-Agent Cooperation"☆25Updated 6 months ago
- Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"☆36Updated this week
- MiniGrid Implementation of BEHAVIOR Tasks☆32Updated 3 months ago
- Official codebase for "Privileged Sensing Scaffolds Reinforcement Learning", contains the Scaffolder algorithm and Sensory Scaffolding Su…☆18Updated 6 months ago
- PWM: Policy Learning with Large World Models☆37Updated 2 months ago
- Official code release of AAAI 2024 paper SayCanPay.☆36Updated 7 months ago
- Official code for "QueST: Self-Supervised Skill Abstractions for Continuous Control" [NeurIPS 2024]☆36Updated last month
- Official implementation for the LABOR (LAnguage-model-based Bimanual ORchestration) Agent.☆11Updated last month
- [NeurIPS 2022] Official implementation of the paper: "Human-AI Shared Control via Policy Dissection"☆48Updated last year
- Code for Watch and Match: Supercharging Imitation with Regularized Optimal Transport☆73Updated last year
- Code release for H-GAP Humanoid Control with a Generalist Planner☆18Updated 4 months ago
- Code for the paper "Policy Adaptation via Language Optimization: Decomposing Tasks for Few-Shot Imitation"☆24Updated last month
- Official implementation of "Cross-Domain Transfer via Semantic Skill Imitation", Pertsch et al., CoRL 2022☆14Updated last year
- Code release for the paper "Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control"☆13Updated 7 months ago
- Pre-Training Goal-based Models for Sample-Efficient Reinforcement Learning☆23Updated 8 months ago
- [ICLR 2024] PyTorch Code for Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon Robotics Tasks☆74Updated 2 months ago
- ☆41Updated 4 months ago
- Code for Reinforcement Learning from Vision Language Foundation Model Feedback☆49Updated 5 months ago
- Official PyTorch implementation of "ACE:Off-Policy Actor-Critic with Causality-Aware Entropy Regularization"☆25Updated 5 months ago
- Interactive Fleet Learning Benchmark☆36Updated last year
- Public release for "Distillation and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections"☆37Updated 4 months ago
- InterPreT: Interactive Predicate Learning from Language Feedback for Generalizable Task Planning (RSS 2024)☆27Updated 4 months ago