chrodan / tdlearn
some common TD Learning algorithms
☆67Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for tdlearn
- A working implementation of the Categorical DQN (Distributional RL).☆96Updated 6 years ago
- Simple tools for statistical analyses in RL experiments☆66Updated 6 years ago
- ☆42Updated 7 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆153Updated 7 years ago
- ☆66Updated 7 months ago
- Implimentation of the Model Free Episodic Control paper by Deep Mind : http://arxiv.org/abs/1606.04460☆56Updated 8 years ago
- Code accompanying the OptionGAN paper.☆43Updated 6 years ago
- Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…☆131Updated 5 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆90Updated 6 years ago
- ☆98Updated 8 years ago
- Deep Reinforcement Learning algorithms implemented in PyTorch☆49Updated 6 years ago
- ☆99Updated 8 years ago
- Train an RL agent to play multiple Atari games at once☆71Updated 8 years ago
- TensorFlow impementation of: Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images☆65Updated 8 years ago
- A platform of grid world that supports up to 1 million reinforcement-learning agents.☆70Updated 7 years ago
- Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆80Updated 6 years ago
- A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning☆94Updated 4 years ago
- ☆26Updated 5 years ago
- ☆161Updated 7 years ago
- A parallel version of Trust Region Policy Optimization☆65Updated 7 years ago
- Stochastic Neural Networks for Hierarchical Reinforcement Learning☆96Updated 6 years ago
- NIPS 2017 Value Prediction Network☆166Updated 6 years ago
- Code for paper "Learning Multimodal Transition Dynamics for Model-Based Reinforcement Learning".☆33Updated 6 years ago
- A3C style Option-Critic with deliberation cost☆39Updated 6 years ago
- ☆69Updated 6 years ago
- reinforcement learning. policy gradient. PCL☆38Updated 7 years ago
- Code for 'The Grand Atari Challenge dataset' paper☆52Updated 7 years ago
- Implementation of "Control of Memory, Active Perception, and Action in Minecraft"☆86Updated 7 years ago