ZJLAB-AMMI / LLM4TeachLinks

Python code to implement LLM4Teach, a policy distillation approach for teaching reinforcement learning agents with Large Language Model

☆44

Alternatives and similar repositories for LLM4Teach

Users that are interested in LLM4Teach are comparing it to the libraries listed below

Sorting:

ZJLAB-AMMI / LLM4RL
A RL approach to enable cost-effective, intelligent interactions between a local agent and a remote LLM
☆78Updated 11 months ago
devindeng94 / smac-hard
Enabling Mixed Opponent Strategy Script and Self-play on SMAC
☆33Updated 2 weeks ago
NJU-RL / Meta-DT
[NeurIPS 2024] Official Implementation of Meta-DT
☆45Updated 9 months ago
maohangyu / TIT_open_source
The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"
☆57Updated last year
liushunyu / CIA
[AAAI 2023 Oral] Contrastive Identity-Aware Learning for Multi-Agent Value Decomposition
☆35Updated last year
yuqingd / ellm
☆81Updated last year
NJU-RL / CuGRO
Official codebase for CuGRO: Continual Offline Reinforcement Learning via Diffusion-based Dual Generative Replay
☆31Updated last year
thu-rllab / LESR
LLM-Empowered State Representation for Reinforcement Learning (ICML2024 Accepted paper)
☆32Updated last year
pickxiguapi / Clean-Offline-RLHF
Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …
☆39Updated last year
chrisyrniu / Recent-Advances-in-Multi-Agent-Reinforcement-Learning
A collection of recent MARL papers
☆94Updated 8 months ago
NJU-RL / ACORM
[ICLR 2024] Official Implementation of ACORM
☆58Updated last year
jhejna / few-shot-preference-rl
☆35Updated 2 years ago
liuqh16 / MAZero
Open-source codebase for MAZero, from "Efficient Multi-agent Reinforcement Learning by Planning" at ICLR 2024.
☆34Updated last year
csmile-1006 / PreferenceTransformer
Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)
☆163Updated last year
xihuai18 / A2PO-ICLR2023
Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)
☆29Updated 7 months ago
PKU-RL / CLIP4MC
An RL-Friendly Vision-Language Model for Minecraft
☆33Updated 9 months ago
AhmedMagdyHendawy / MOORE
Official code of the paper "Multi-Task Reinforcement Learning with Mixture of Orthogonal Experts" at ICLR2024
☆21Updated 8 months ago
WeihaoTan / TWOSOME
Implementation of TWOSOME
☆77Updated 6 months ago
OpenRL-Lab / TiZero
Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体
☆61Updated last year
liushunyu / OPT
[TPAMI] Interaction Pattern Disentangling for Multi-Agent Reinforcement Learning
☆30Updated last year
srzer / LaMo-2023
Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".
☆53Updated last year
HosnLS / Hierarchical-Language-Agent
☆33Updated last year
NKAI-Decision-Team / LLM-PySC2
LLM-PySC2 is NKAI Decision Team and NUDT Decision Team's Python component of the StarCraft II LLM Decision Environment. It exposes Deepmi…
☆138Updated 3 months ago
TimeBreaker / Adversarial-Reinforcement-Learning-Papers
Adversarial Reinforcement Learning papers (single-agent setting and multi-agent setting)
☆71Updated 2 years ago
bic4907 / Overcooked-AI
Offline Multi-Agent Reinforcement Learning Implementations: Solving Overcooked Game with Data-Driven Method
☆40Updated 10 months ago
tyq1024 / RLx2
☆31Updated 2 years ago
elicassion / StARformer
[ECCV2022] [T-PAMI] StARformer: Transformer with State-Action-Reward Representations.
☆94Updated 2 years ago
quantumiracle / MARS
MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.
☆49Updated last year
eric-ai-lab / llm_coordination
Code repository for the NAACL 2025 paper "LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language…
☆38Updated 9 months ago
Shanghai-Digital-Brain-Laboratory / BDM-DB1
A large-scale multi-modal pre-trained model
☆132Updated 2 years ago