zhehuazhou / LLM_Reward_DesignLinks
☆9Updated last year
Alternatives and similar repositories for LLM_Reward_Design
Users that are interested in LLM_Reward_Design are comparing it to the libraries listed below
Sorting:
- (NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation☆31Updated 2 years ago
- ☆45Updated last year
- Official code repository for CurricuLLM: Automatic Task Curricula Design for Learning Complex Robot Skills using Large Language Models☆16Updated 3 months ago
- Implementation of Language-Conditioned Path Planning (Amber Xie, Youngwoon Lee, Pieter Abbeel, Stephen James)☆23Updated last year
- M^3PC: Test-Time Model Predictive Control for Pretrained Masked Trajectory Model, ICLR 2025☆16Updated 4 months ago
- ☆45Updated 11 months ago
- Official code release of AAAI 2024 paper SayCanPay.☆49Updated last year
- MiniGrid Implementation of BEHAVIOR Tasks☆47Updated 11 months ago
- ☆11Updated last year
- This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement …☆28Updated 2 months ago
- PWM: Policy Learning with Large World Models☆53Updated 4 months ago
- ☆47Updated 3 months ago
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆38Updated last year
- Chain-of-Thought Predictive Control☆58Updated 2 years ago
- Source codes for the paper "COMBO: Compositional World Models for Embodied Multi-Agent Cooperation"☆38Updated 4 months ago
- Official repo for Offline RL for Online RL☆17Updated last year
- Interactive Fleet Learning Benchmark☆36Updated 2 years ago
- Deep reinforcement learning-basedskill transfer and composition method☆9Updated 5 years ago
- ☆29Updated 6 months ago
- Codebase for PRISE: Learning Temporal Action Abstractions as a Sequence Compression Problem☆24Updated last year
- Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"☆51Updated 6 months ago
- ☆44Updated 7 months ago
- Framework to transform natural language into formal language (Temporal Logics).☆27Updated last year
- Code for the paper "Policy Adaptation via Language Optimization: Decomposing Tasks for Few-Shot Imitation"☆29Updated 7 months ago
- Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces☆41Updated last year
- ☆59Updated last year
- Controllability-Aware Unsupervised Skill Discovery (ICML 2023)☆27Updated 2 years ago
- ☆24Updated last month
- Codebase for HiP☆90Updated last year
- ☆33Updated last month