HippolyteBourel / UCRL_implementation
Various implementations and modification of algorithm around UCRL.
☆10Updated 4 years ago
Related projects: ⓘ
- Project on Successor Features in Deep Reinforcement Learning and Transfer Learning☆25Updated 6 years ago
- OPE Tools based on Empirical Study of Off Policy Policy Estimation paper.☆61Updated 2 years ago
- Soft Actor-Critic☆142Updated 6 years ago
- Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction☆158Updated 4 years ago
- Implementation of the Option-Critic Architecture on the Atari (ALE) environment☆167Updated 7 years ago
- Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstractions and Intrinsic Motivation☆86Updated 6 years ago
- ☆62Updated 4 years ago
- ☆59Updated 6 years ago
- ☆90Updated 9 months ago
- Hindsight policy gradients☆42Updated 4 years ago
- A list of papers regarding generalization in (deep) reinforcement learning☆141Updated last year
- Offline Reinforcement Learning Reading Group☆24Updated last year
- Explorer is a PyTorch reinforcement learning framework for exploring new ideas.☆88Updated 3 weeks ago
- ☆187Updated last year
- A Tensorflow implementation of the Option-Critic Architecture☆71Updated 7 years ago
- Gym environments modified with adversarial agents☆35Updated 7 years ago
- Inverse Reinforcement Learning via State Marginal Matching, CoRL 2020☆37Updated last year
- ☆42Updated 7 years ago
- ☆41Updated 5 years ago
- Neural Fictitious Self-Play in Leduc Holdem☆10Updated 6 years ago
- ☆28Updated 3 months ago
- ☆107Updated last year
- Code to train RL agents along with Adversarial distrubance agents☆63Updated 7 years ago
- A3C style Option-Critic with deliberation cost☆39Updated 6 years ago
- Hierarchical Deep RL Network☆29Updated 7 years ago
- Diversity−Driven Extensible Hierarchical Reinforcement Learning. AAAI 2019.☆48Updated 5 years ago
- Simple maze environments using mujoco-py☆52Updated 8 months ago
- Code for Diagnosing Bottlenecks in Deep Q-learning. Contains implementations of tabular environments plus solvers.☆19Updated 5 years ago
- ☆95Updated last year
- Code for "Learning to Reach Goals via Iterated Supervised Learning"☆76Updated 2 years ago
- Implementation of DeDOL algorithm - Deep Reinforcement Learning based algorithm for Green Security Games with Real Time Information☆16Updated 4 years ago