Stanford-ILIAD / Diverse-ConventionsLinks
Exploring techniques to generate diverse conventions in multi-agent settings
☆15Updated last year
Alternatives and similar repositories for Diverse-Conventions
Users that are interested in Diverse-Conventions are comparing it to the libraries listed below
Sorting:
- ☆32Updated last year
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆59Updated 9 months ago
- Official code repository for Prompt-DT.☆113Updated 2 years ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆56Updated last year
- Implementation of the Off Belief Learning algorithm.☆47Updated 2 years ago
- ☆17Updated last year
- ☆12Updated last year
- ☆31Updated 2 years ago
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆54Updated 3 years ago
- Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)☆24Updated 4 years ago
- Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options☆45Updated 3 years ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆67Updated 2 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆63Updated last year
- ☆48Updated last year
- ExORL: Exploratory Data for Offline Reinforcement Learning☆115Updated 3 years ago
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Updated 11 months ago
- ☆14Updated 3 years ago
- ☆35Updated 2 years ago
- Change-Based Exploration Transfer☆35Updated 3 years ago
- Predictable MDP Abstraction for Unsupervised Model-Based RL (ICML 2023)☆32Updated 2 years ago
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆52Updated last year
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆28Updated 2 years ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆25Updated 2 years ago
- ☆89Updated 2 years ago
- Implementation of Multi-Game Decision Transformers in PyTorch☆47Updated 2 years ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆87Updated 2 years ago
- OpenAI gym environments for goal-conditioned and language-conditioned reinforcement learning☆14Updated 4 months ago
- ☆54Updated last year
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆21Updated 4 years ago
- ☆19Updated 2 years ago