nuwuxian / RL-state_maskLinks

☆14

Alternatives and similar repositories for RL-state_mask

Users that are interested in RL-state_mask are comparing it to the libraries listed below

Sorting:

PKU-RL / CORRO
[ICML 2022] Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning
☆36Updated 3 years ago
rll-research / BPref
Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.
☆130Updated 3 years ago
mxu34 / prompt-dt
Official code repository for Prompt-DT.
☆115Updated 3 years ago
yihaosun1124 / pytorch-mopo
re-implementation of the offline model-based RL algorithm MOPO in pytorch
☆25Updated 3 years ago
lich14 / CDS
[NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.
☆85Updated 2 years ago
apexrl / GCRL-Collection
This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmar…
☆142Updated 2 years ago
danielshin1 / oprl
Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning
☆20Updated 2 years ago
liyang619 / COLE-Platform
Overcooked human-AI experiment platform
☆38Updated last year
jhejna / inverse-preference-learning
☆43Updated 2 years ago
ruizhaogit / maximum_entropy_population_based_training
Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination
☆26Updated 2 years ago
csmile-1006 / PreferenceTransformer
Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)
☆163Updated last year
benellis3 / pymarl2
Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
☆18Updated 2 years ago
sfujim / TD7
Author's PyTorch implementation of TD7 for online and offline RL
☆148Updated 2 years ago
mantle2048 / rlplot
rlplot is an easy to use and highly encapsulated RL plot library (including basic error bar lineplot and a wrapper to "rliable").
☆33Updated last year
conglu1997 / v-d4rl
Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations
☆108Updated last year
gwthomas / IQL-PyTorch
A PyTorch implementation of Implicit Q-Learning
☆86Updated 3 years ago
joonleesky / train-procgen-pytorch
Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.
☆30Updated 5 years ago
Cranial-XIX / marl-copa
PyTorch Implementation of COPA for coordinating teams that can dynamically change.
☆21Updated 3 years ago
samjia2000 / HSP
This is a repository for Hidden-utility Self-Play.
☆25Updated 2 years ago
huanzhang12 / ATLA_robust_RL
Robust Reinforcement Learning with the Alternating Training of Learned Adversaries (ATLA) framework
☆66Updated 4 years ago
TonghanWang / RODE
Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …
☆78Updated 9 months ago
jidiai / GRF_MARL
Google Research Football MARL Benchmark and Research Toolkit
☆47Updated last year
young-geng / CQL
Conservative Q Learning on top of SAC
☆132Updated 2 years ago
namsan96 / SiMPL
☆48Updated 2 years ago
snu-mllab / DCPG
Official PyTorch implementation of "Rethinking Value Function Learning for Generalization in Reinforcement Learning" (NeurIPS 2022)
☆13Updated 2 years ago
awarelab / continual_world
☆102Updated last year
marc-rigter / rambo
Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022
☆30Updated 2 years ago
alirezakazemipour / DIAYN-PyTorch
Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.
☆74Updated last year
AGI-Labs / continual_rl
Continual reinforcement learning baselines: experiment specifications, implementation of existing methods, and common metrics. Easily ext…
☆125Updated 2 years ago
lweitkamp / option-critic-pytorch
PyTorch implementation of the Option-Critic framework, Harb et al. 2016
☆134Updated last year