nuwuxian / RL-state_maskLinks
☆14Updated last year
Alternatives and similar repositories for RL-state_mask
Users that are interested in RL-state_mask are comparing it to the libraries listed below
Sorting:
- [ICML 2022] Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning☆36Updated 3 years ago
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆130Updated 3 years ago
- Official code repository for Prompt-DT.☆115Updated 3 years ago
- re-implementation of the offline model-based RL algorithm MOPO in pytorch☆25Updated 3 years ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆85Updated 2 years ago
- This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmar…☆142Updated 2 years ago
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆20Updated 2 years ago
- Overcooked human-AI experiment platform☆38Updated last year
- ☆43Updated 2 years ago
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆26Updated 2 years ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆163Updated last year
- Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)☆18Updated 2 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆148Updated 2 years ago
- rlplot is an easy to use and highly encapsulated RL plot library (including basic error bar lineplot and a wrapper to "rliable").☆33Updated last year
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆108Updated last year
- A PyTorch implementation of Implicit Q-Learning☆86Updated 3 years ago
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆30Updated 5 years ago
- PyTorch Implementation of COPA for coordinating teams that can dynamically change.☆21Updated 3 years ago
- This is a repository for Hidden-utility Self-Play.☆25Updated 2 years ago
- Robust Reinforcement Learning with the Alternating Training of Learned Adversaries (ATLA) framework☆66Updated 4 years ago
- Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …☆78Updated 9 months ago
- Google Research Football MARL Benchmark and Research Toolkit☆47Updated last year
- Conservative Q Learning on top of SAC