HippolyteBourel / UCRL_implementation
Various implementations and modification of algorithm around UCRL.
☆10Updated 5 years ago
Alternatives and similar repositories for UCRL_implementation:
Users that are interested in UCRL_implementation are comparing it to the libraries listed below
- ☆28Updated 8 months ago
- Gym environments modified with adversarial agents☆35Updated 7 years ago
- ☆41Updated 3 years ago
- Project on Successor Features in Deep Reinforcement Learning and Transfer Learning☆24Updated 6 years ago
- A collection of research and survey papers of hierarchical reinforcement learning (HRL).☆39Updated 4 years ago
- OPE Tools based on Empirical Study of Off Policy Policy Estimation paper.☆61Updated 2 years ago
- ☆60Updated 6 years ago
- ☆71Updated 7 months ago
- A Tensorflow implementation of the Option-Critic Architecture☆71Updated 7 years ago
- Implementation of the Option-Critic Architecture on the Atari (ALE) environment☆174Updated 7 years ago
- A3C style Option-Critic with deliberation cost☆39Updated 7 years ago
- The Reinforcement-Learning-Related Papers of ICLR 2019☆47Updated 5 years ago
- Learning Laplacian Representations in Reinforcement Learning☆17Updated 4 years ago
- Author's PyTorch implementation of paper "Provably Good Batch Reinforcement Learning Without Great Exploration"☆12Updated 4 years ago
- Model-Based Offline Reinforcement Learning☆48Updated 4 years ago
- Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstractions and Intrinsic Motivation☆87Updated 6 years ago
- Implementation of DeDOL algorithm - Deep Reinforcement Learning based algorithm for Green Security Games with Real Time Information☆16Updated 5 years ago
- Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]☆46Updated 2 years ago
- ☆65Updated 10 months ago
- Diversity−Driven Extensible Hierarchical Reinforcement Learning. AAAI 2019.☆48Updated 5 years ago
- An implementation of Constrained Policy Optimization (Achiam 2017) in PyTorch☆23Updated 4 years ago
- Safe Option-Critic: Learning Safety in the Option-Critic Architecture☆20Updated 6 years ago
- ☆191Updated last year
- ☆54Updated 11 months ago
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Updated 6 years ago
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆42Updated 4 months ago
- ☆28Updated 2 years ago
- Offline Reinforcement Learning Reading Group☆25Updated 2 years ago
- ☆11Updated 5 years ago
- Code to train RL agents along with Adversarial distrubance agents☆63Updated 7 years ago