ZJLAB-AMMI / LLM4RLLinks

A RL approach to enable cost-effective, intelligent interactions between a local agent and a remote LLM

☆78

Alternatives and similar repositories for LLM4RL

Users that are interested in LLM4RL are comparing it to the libraries listed below

Sorting:

flowersteam / Grounding_LLMs_with_online_RL
We perform functional grounding of LLMs' knowledge in BabyAI-Text
☆268Updated 11 months ago
maohangyu / TIT_open_source
The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"
☆57Updated last year
ZJLAB-AMMI / LLM4Teach
Python code to implement LLM4Teach, a policy distillation approach for teaching reinforcement learning agents with Large Language Model
☆44Updated last year
yuqingd / ellm
☆81Updated last year
eric-ai-lab / llm_coordination
Code repository for the NAACL 2025 paper "LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language…
☆38Updated 9 months ago
devindeng94 / smac-hard
Enabling Mixed Opponent Strategy Script and Self-play on SMAC
☆33Updated last week
123penny123 / Awesome-LLM-RL
A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.
☆372Updated last year
WeihaoTan / TWOSOME
Implementation of TWOSOME
☆77Updated 6 months ago
HosnLS / Hierarchical-Language-Agent
☆33Updated last year
PKU-Alignment / ProAgent
AAAI24(Oral) ProAgent: Building Proactive Cooperative Agents with Large Language Models
☆89Updated 5 months ago
csmile-1006 / PreferenceTransformer
Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)
☆163Updated last year
GuanSuns / LLMs-World-Models-for-Planning
The source code of the paper "Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Pla…
☆99Updated 11 months ago
xlang-ai / text2reward
[ICLR 2024 Spotlight] Code for the paper "Text2Reward: Reward Shaping with Language Models for Reinforcement Learning"
☆172Updated 7 months ago
minaek / reward_design_with_llms
☆220Updated 2 years ago
bic4907 / Overcooked-AI
Offline Multi-Agent Reinforcement Learning Implementations: Solving Overcooked Game with Data-Driven Method
☆40Updated 10 months ago
flowersteam / lamorel
Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).
☆236Updated 9 months ago
haotiansun14 / AdaPlanner
AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback
☆111Updated 4 months ago
srzer / LaMo-2023
Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".
☆53Updated last year
Shanghai-Digital-Brain-Laboratory / BDM-DB1
A large-scale multi-modal pre-trained model
☆132Updated 2 years ago
1989Ryan / llm-mcts
[NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling bett…
☆281Updated 8 months ago
NJU-RL / Meta-DT
[NeurIPS 2024] Official Implementation of Meta-DT
☆45Updated 9 months ago
mxu34 / prompt-dt
Official code repository for Prompt-DT.
☆114Updated 3 years ago
microsoft / SmartPlay
SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. …
☆140Updated last year
elicassion / StARformer
[ECCV2022] [T-PAMI] StARformer: Transformer with State-Action-Reward Representations.
☆94Updated 2 years ago
pickxiguapi / Clean-Offline-RLHF
Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …
☆39Updated last year
BladeTransformerLLC / OvercookedGPT
An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic mult…
☆69Updated 2 years ago
YaoMarkMu / Awesome-Pretrained-RL
☆89Updated 2 years ago
agentification / Language-Integrated-VI
☆19Updated last year
floodsung / LLM-with-RL-papers
A collection of LLM with RL papers
☆276Updated last year
OpenRL-Lab / TiZero
Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体
☆61Updated last year