ZJLAB-AMMI / LLM4TeachLinks
Python code to implement LLM4Teach, a policy distillation approach for teaching reinforcement learning agents with Large Language Model
☆48Updated last year
Alternatives and similar repositories for LLM4Teach
Users that are interested in LLM4Teach are comparing it to the libraries listed below
Sorting:
- Enabling Mixed Opponent Strategy Script and Self-play on SMAC☆35Updated last month
- [NeurIPS 2024] Official Implementation of Meta-DT☆45Updated 11 months ago
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆57Updated last year
- ☆83Updated 2 years ago
- Open-source codebase for MAZero, from "Efficient Multi-agent Reinforcement Learning by Planning" at ICLR 2024.☆34Updated last year
- LLM-Empowered State Representation for Reinforcement Learning (ICML2024 Accepted paper)☆33Updated last year
- [AAAI 2023 Oral] Contrastive Identity-Aware Learning for Multi-Agent Value Decomposition☆36Updated last year
- A collection of recent MARL papers☆95Updated 10 months ago
- [ICLR 2024] Official Implementation of ACORM☆59Updated last year
- A RL approach to enable cost-effective, intelligent interactions between a local agent and a remote LLM☆79Updated last year
- Offline Multi-Agent Reinforcement Learning Implementations: Solving Overcooked Game with Data-Driven Method☆41Updated last year
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆53Updated last year
- Implementation of TWOSOME☆78Updated 8 months ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆163Updated last year
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆74Updated 2 years ago
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆41Updated last year
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆29Updated 9 months ago
- Official codebase for CuGRO: Continual Offline Reinforcement Learning via Diffusion-based Dual Generative Replay☆32Updated last year
- Code repository for the NAACL 2025 paper "LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language…☆39Updated 11 months ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆88Updated last year
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆62Updated 2 years ago
- ☆31Updated 2 years ago
- Adversarial Reinforcement Learning papers (single-agent setting and multi-agent setting)☆73Updated 3 years ago
- ☆93Updated 2 months ago
- ☆41Updated last year
- [NeurIPS 2024] PyTorch code for the paper "Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement Learning…☆22Updated 3 months ago
- The implementation of ICLR 2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆44Updated 10 months ago
- This is the official implementation of ERL-Re2.☆67Updated last year
- ☆114Updated 2 years ago
- Transformers are Meta-Reinforcement Learners - International Conference on Machine Learning (ICML) 2022☆63Updated 2 years ago