chainer / chainerrl-visualizer
☆53Updated last year
Related projects ⓘ
Alternatives and complementary repositories for chainerrl-visualizer
- Simple tools for statistical analyses in RL experiments☆66Updated 6 years ago
- ☆35Updated 6 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- Implementation of clipped action policy gradient (CAPG) with PPO and TRPO☆30Updated 6 years ago
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆17Updated 5 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆90Updated 6 years ago
- Sample-Efficient Reinforcement Learning with Bootstrapped Dual Policy Iteration☆25Updated 5 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆48Updated 2 years ago
- Deep Reinforcement Learning algorithms implemented in PyTorch☆49Updated 6 years ago
- PyTorch implementation of Proximal Policy Optimization☆50Updated 6 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Updated 3 years ago
- Modifiable OpenAI Gym environments for studying generalization in RL☆86Updated 5 years ago
- AGAC: Adversarially Guided Actor-Critic☆47Updated 3 years ago
- Reinforcement learning algorithms in RLlib☆56Updated 6 months ago
- ShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical Perspectives (Deep RL Workshop 2021)☆44Updated 2 years ago
- Example implementation of Alpha Zero' s algotirhm on Jupyter notebook☆15Updated 4 years ago
- PyTorch code to train and evaluate Procgen tasks☆23Updated 4 years ago
- A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning☆94Updated 4 years ago
- Code accompanying the OptionGAN paper.☆43Updated 6 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆78Updated last year
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 5 years ago
- ☆19Updated 5 years ago
- [ICML 2019] TensorFlow Code for Self-Supervised Exploration via Disagreement☆123Updated 5 years ago
- Continual Reinforcement Learning in 3D Non-stationary Environments☆35Updated 5 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Updated 5 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 3 years ago
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆20Updated last year
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆31Updated 4 years ago