ZJLAB-AMMI / LLM4Teach
Python code to implement LLM4Teach, a policy distillation approach for teaching reinforcement learning agents with Large Language Model
☆25Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for LLM4Teach
- ☆67Updated last year
- Implementation of TWOSOME☆49Updated 6 months ago
- A RL approach to enable cost-effective, intelligent interactions between a local agent and a remote LLM☆63Updated 3 months ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆52Updated last month
- [ICLR 2024] Code for the paper "Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning"☆129Updated last month
- ☆15Updated 3 months ago
- [AAAI 2023 Oral] Contrastive Identity-Aware Learning for Multi-Agent Value Decomposition☆29Updated 5 months ago
- Official code repository for Prompt-DT.☆98Updated 2 years ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆151Updated last year
- ICLR 2024: SafeDreamer: Safe Reinforcement Learning with World Models☆48Updated 7 months ago
- Code repository for the paper "LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language Models"☆23Updated last month
- (NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation☆25Updated last year
- ☆39Updated 2 years ago
- Implementation of Multi-Game Decision Transformers in PyTorch☆43Updated last year
- Pre-Training Goal-based Models for Sample-Efficient Reinforcement Learning☆23Updated 8 months ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆54Updated last year
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆31Updated 7 months ago
- ☆26Updated last year
- Overcooked human-AI experiment platform☆30Updated 11 months ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆48Updated last year
- ☆32Updated last year
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆54Updated 10 months ago
- ☆53Updated last week
- Implementation of "MADiff: Offline Multi-agent Learning with Diffusion Models"☆42Updated 3 weeks ago
- Baselines for Neural MMO -- new users should treat this repo as a starter project☆46Updated 3 months ago
- Implementation of "Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction"☆42Updated last year
- ☆12Updated 10 months ago
- Implementations of safe reinforcement learning algorithms☆21Updated 8 months ago
- ☆86Updated 2 years ago
- The offcial implementation of "ToM2C: Target-oriented Multi-agent Communication and Cooperation with Theory of Mind" (ICLR 2022) .☆55Updated 2 weeks ago