ZJLAB-AMMI / LLM4RLLinks
A RL approach to enable cost-effective, intelligent interactions between a local agent and a remote LLM
☆80Updated last year
Alternatives and similar repositories for LLM4RL
Users that are interested in LLM4RL are comparing it to the libraries listed below
Sorting:
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆273Updated last year
- Python code to implement LLM4Teach, a policy distillation approach for teaching reinforcement learning agents with Large Language Model☆49Updated last year
- ☆85Updated 2 years ago
- ☆37Updated last year
- AAAI24(Oral) ProAgent: Building Proactive Cooperative Agents with Large Language Models☆90Updated 7 months ago
- Enabling Mixed Opponent Strategy Script and Self-play on SMAC☆35Updated 2 months ago
- Implementation of TWOSOME☆80Updated 8 months ago
- The source code of the paper "Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Pla…☆101Updated last year
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆41Updated last year
- ☆219Updated 2 years ago
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆239Updated 2 weeks ago
- A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.☆377Updated last year
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆57Updated last year
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆164Updated last year
- [ICLR 2024 Spotlight] Text2Reward: Reward Shaping with Language Models for Reinforcement Learning☆180Updated 9 months ago
- [ICLR 2024] Source codes for the paper "Building Cooperative Embodied Agents Modularly with Large Language Models"☆273Updated 6 months ago
- Offline Multi-Agent Reinforcement Learning Implementations: Solving Overcooked Game with Data-Driven Method☆41Updated last year
- [NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling bett…☆283Updated 10 months ago
- An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic mult…☆70Updated 2 years ago
- A collection of LLM with RL papers☆278Updated last year
- Code repository for the NAACL 2025 paper "LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language…☆39Updated 11 months ago
- A large-scale multi-modal pre-trained model☆132Updated 2 years ago
- AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback☆120Updated 6 months ago
- Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)☆42Updated last year
- Implementation of Multi-Game Decision Transformers in PyTorch☆47Updated 2 years ago
- ☆14Updated last year
- Overcooked human-AI experiment platform☆38Updated last year
- ☆35Updated 2 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆62Updated 2 years ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆41Updated 10 months ago