zhehuazhou / LLM_Reward_Design
☆9Updated last year
Alternatives and similar repositories for LLM_Reward_Design:
Users that are interested in LLM_Reward_Design are comparing it to the libraries listed below
- ☆45Updated last year
- This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement …☆22Updated 4 months ago
- Implementation of Language-Conditioned Path Planning (Amber Xie, Youngwoon Lee, Pieter Abbeel, Stephen James)☆22Updated last year
- (NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation☆26Updated 2 years ago
- Official implementation of "Cross-Domain Transfer via Semantic Skill Imitation", Pertsch et al., CoRL 2022☆14Updated 2 years ago
- Interactive Fleet Learning Benchmark☆36Updated last year
- ☆24Updated 2 months ago
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆35Updated 11 months ago
- LLM multi-agent discussion framework for multi-agent/robot situations.☆30Updated 5 months ago
- Public release for "Distillation and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections"☆44Updated 9 months ago
- Official code release of AAAI 2024 paper SayCanPay.☆45Updated 11 months ago
- Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"☆47Updated 2 months ago
- [ICML'2023] "AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners"☆57Updated last year
- Official implementation for the LABOR (LAnguage-model-based Bimanual ORchestration) Agent.☆13Updated 3 months ago
- ☆30Updated last year
- Official PyTorch implementation of "ACE:Off-Policy Actor-Critic with Causality-Aware Entropy Regularization"☆28Updated 10 months ago
- ☆17Updated last year
- This is the source code of FUSION, a safety-aware causal representation for generalizable driving agents.☆17Updated 5 months ago
- [IJCAI'24] An index of algorithms, approaches, and systems on cross-domain policy transfer for embodied agents☆45Updated last month
- ☆51Updated 8 months ago
- PWM: Policy Learning with Large World Models☆42Updated last month
- Chain-of-Thought Predictive Control☆56Updated last year
- Papers, codes, datasets, applications, tutorials.☆18Updated this week
- ☆42Updated 8 months ago
- Official codebase for "Privileged Sensing Scaffolds Reinforcement Learning", contains the Scaffolder algorithm and Sensory Scaffolding Su…☆27Updated 11 months ago
- Streaming Diffusion Policy: Fast Policy Synthesis with Variable Noise Diffusion Models☆52Updated 5 months ago
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆41Updated 11 months ago
- ☆42Updated last month