asonabend / ESRLLinks
Code for Expert Supervised Reinforcement Learning
☆10Updated 4 years ago
Alternatives and similar repositories for ESRL
Users that are interested in ESRL are comparing it to the libraries listed below
Sorting:
- Code for FOCAL Paper Published at ICLR 2021☆51Updated last year
- ☆30Updated 3 years ago
- ☆18Updated 3 years ago
- Scalable Bayesian Inverse Reinforcement Learning (ICLR 2021) by Alex J. Chan and Mihaela van der Schaar.☆48Updated 4 years ago
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆38Updated 7 months ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆51Updated 2 years ago
- Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]☆47Updated 2 years ago
- Model-Based Offline Reinforcement Learning☆50Updated 4 years ago
- Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7☆20Updated 6 years ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆57Updated 2 years ago
- Anti exploration in offline reinforcement learning☆10Updated 4 years ago
- Implementation of Relational Deep Reinforcement Learning☆25Updated 5 years ago
- Official implementation of Causal Curiosity: RL Agents Discovering Self-supervised Experiments for Causal Representation Learning at ICML…☆43Updated 4 years ago
- Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting☆35Updated 4 years ago
- ☆15Updated 5 years ago
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆35Updated 2 years ago
- ☆43Updated 4 years ago
- ☆57Updated 2 years ago
- on-policy optimization baselines for deep reinforcement learning☆32Updated 5 years ago
- Implementation codes and datasets used in ICLR'22 Spotlight paper AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning…☆40Updated last year
- Safe Policy Improvement with Baseline Bootstrapping☆26Updated 5 years ago
- ☆25Updated 3 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆40Updated 5 years ago
- ☆131Updated last year
- Transformers are Meta-Reinforcement Learners - International Conference on Machine Learning (ICML) 2022☆63Updated 2 years ago
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆37Updated 2 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 4 years ago
- Code for the paper "Meta-Q-Learning"( ICLR 2020)☆103Updated 3 years ago
- Offline Reinforcement Learning Reading Group☆28Updated 2 years ago
- This repo is the implementation of paper ''SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning''.☆49Updated last year