zhehuazhou / LLM_Reward_DesignLinks

☆9

Alternatives and similar repositories for LLM_Reward_Design

Users that are interested in LLM_Reward_Design are comparing it to the libraries listed below

Sorting:

Div99 / LISA
(NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation
☆31Updated 2 years ago
ademiadeniji / lamp
☆45Updated last year
labicon / CurricuLLM
Official code repository for CurricuLLM: Automatic Task Curricula Design for Learning Complex Robot Skills using Large Language Models
☆16Updated 3 months ago
amberxie88 / lapp
Implementation of Language-Conditioned Path Planning (Amber Xie, Youngwoon Lee, Pieter Abbeel, Stephen James)
☆23Updated last year
wkh923 / m3pc
M^3PC: Test-Time Model Predictive Control for Pretrained Masked Trajectory Model, ICLR 2025
☆16Updated 4 months ago
cheryyunl / Make-An-Agent
☆45Updated 11 months ago
RishiHazra / saycanpay
Official code release of AAAI 2024 paper SayCanPay.
☆49Updated last year
StanfordVL / mini_behavior
MiniGrid Implementation of BEHAVIOR Tasks
☆47Updated 11 months ago
tan-liam / CableRouting
☆11Updated last year
ComputationalRobotics / TRAC
This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement …
☆28Updated 2 months ago
imgeorgiev / PWM
PWM: Policy Learning with Large World Models
☆53Updated 4 months ago
wang-kevin3290 / scaling-crl
☆47Updated 3 months ago
pickxiguapi / Clean-Offline-RLHF
Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …
☆38Updated last year
SeanJia / CoTPC
Chain-of-Thought Predictive Control
☆58Updated 2 years ago
UMass-Embodied-AGI / COMBO
Source codes for the paper "COMBO: Compositional World Models for Embodied Multi-Agent Cooperation"
☆38Updated 4 months ago
MaxSobolMark / OOO
Official repo for Offline RL for Online RL
☆17Updated last year
BerkeleyAutomation / ifl_benchmark
Interactive Fleet Learning Benchmark
☆36Updated 2 years ago
ahq1993 / compositional_reinforcement_learning
Deep reinforcement learning-basedskill transfer and composition method
☆9Updated 5 years ago
TUMcps / human-robot-gym
☆29Updated 6 months ago
FrankZheng2022 / PRISE
Codebase for PRISE: Learning Temporal Action Abstractions as a Sequence Compression Problem
☆24Updated last year
Lei-Kun / Uni-O4
Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"
☆51Updated 6 months ago
sukhijab / maxinforl_torch
☆44Updated 7 months ago
yongchao98 / NL2TL
Framework to transform natural language into formal language (Temporal Logics).
☆27Updated last year
vivekmyers / palo
Code for the paper "Policy Adaptation via Language Optimization: Decomposing Tasks for Few-Shot Imitation"
☆29Updated 7 months ago
Toshihiro-Ota / decision-mamba
Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces
☆41Updated last year
AlignmentResearch / vlmrm
☆59Updated last year
seohongpark / CSD-locomotion
Controllability-Aware Unsupervised Skill Discovery (ICML 2023)
☆27Updated 2 years ago
graliuce / sgcrl
☆24Updated last month
anuragajay / hip
Codebase for HiP
☆90Updated last year
jianlanluo / SAQ
☆33Updated last month