ZJLAB-AMMI / LLM4TeachLinks
Python code to implement LLM4Teach, a policy distillation approach for teaching reinforcement learning agents with Large Language Model
☆52Updated last year
Alternatives and similar repositories for LLM4Teach
Users that are interested in LLM4Teach are comparing it to the libraries listed below
Sorting:
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆56Updated 2 years ago
- A RL approach to enable cost-effective, intelligent interactions between a local agent and a remote LLM☆79Updated last year
- LLM-Empowered State Representation for Reinforcement Learning (ICML2024 Accepted paper)☆37Updated last year
- Enabling Mixed Opponent Strategy Script and Self-play on SMAC☆41Updated 6 months ago
- Offline Multi-Agent Reinforcement Learning Implementations: Solving Overcooked Game with Data-Driven Method☆46Updated last year
- [NeurIPS 2024] Official Implementation of Meta-DT☆53Updated last year
- ☆89Updated 2 years ago
- Code repository for the NAACL 2025 paper "LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language…☆44Updated last year
- Official codebase for CuGRO: Continual Offline Reinforcement Learning via Diffusion-based Dual Generative Replay☆33Updated last year
- Open-source codebase for MAZero, from "Efficient Multi-agent Reinforcement Learning by Planning" at ICLR 2024.☆42Updated last year
- [ICLR 2024] Official Implementation of ACORM☆63Updated last year
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆32Updated 2 months ago
- ☆45Updated last year
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆67Updated 2 years ago
- A collection of recent MARL papers☆104Updated last year
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆74Updated 3 years ago
- [AAAI 2023 Oral] Contrastive Identity-Aware Learning for Multi-Agent Value Decomposition☆38Updated last year
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆28Updated 4 years ago
- Code accompanying paper "Models as Agents: Optimizing Multi-Step Predictions of Interactive Local Models in Model-Based Multi-Agent Reinf…☆15Updated 2 years ago
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆41Updated last year
- The implementation of ICLR 2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆46Updated last year
- This is the official implementation of ERL-Re2.☆73Updated last year
- Implementation of TWOSOME☆82Updated last year
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆167Updated 2 years ago
- A large-scale multi-modal pre-trained model☆133Updated 2 years ago
- ☆16Updated last year
- LLM-PySC2 is NKAI Decision Team and NUDT Decision Team's Python component of the StarCraft II LLM Decision Environment. It exposes Deepmi…☆148Updated 9 months ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆94Updated 2 years ago
- This code accompanies the paper "Scalable Multi-Agent Model-Based Reinforcement Learning".☆62Updated 9 months ago
- ☆33Updated 2 years ago