asonabend / ESRLLinks
Code for Expert Supervised Reinforcement Learning
☆10Updated 4 years ago
Alternatives and similar repositories for ESRL
Users that are interested in ESRL are comparing it to the libraries listed below
Sorting:
- ☆17Updated 3 years ago
- Code for FOCAL Paper Published at ICLR 2021☆51Updated last year
- Model-Based Offline Reinforcement Learning☆51Updated 4 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆51Updated 2 years ago
- ☆29Updated 3 years ago
- TensorFlow implementation of "A Relational Intervention Approach for Unsupervised Dynamics Generalization in Model-Based Reinforcement Le…☆16Updated 3 years ago
- on-policy optimization baselines for deep reinforcement learning☆30Updated 5 years ago
- Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]☆47Updated 2 years ago
- ☆15Updated 4 years ago
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆37Updated 5 months ago
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆18Updated 3 years ago
- Robust Reinforcement Learning Benchmark☆10Updated 10 months ago
- Code for MOPO: Model-based Offline Policy Optimization☆182Updated 3 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆41Updated 5 years ago
- Anti exploration in offline reinforcement learning☆9Updated 4 years ago
- Safe Policy Improvement with Baseline Bootstrapping☆26Updated 5 years ago
- An unofficial implementation for online decision transformer☆40Updated 2 years ago
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆35Updated 2 years ago
- Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting☆35Updated 4 years ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆57Updated 2 years ago
- Reinforcement Learning with Perturbed Reward, AAAI 2020☆29Updated last year
- V-MPO torch version with DMLab30 and GTrXL☆13Updated 4 years ago
- An implementation of Constrained Policy Optimization (Achiam 2017) in PyTorch☆26Updated 5 years ago
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆17Updated 5 years ago
- Toolkit of Causal Model-based Reinforcement Learning.☆34Updated 2 years ago
- ☆42Updated 3 years ago
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆20Updated 2 years ago
- Learning Individual Intrinsic Reward in MARL☆63Updated 2 years ago
- Python interface for accessing the near real-world offline reinforcement learning (NeoRL) benchmark datasets☆124Updated 8 months ago
- ☆15Updated 4 years ago