PatrykChrabaszcz / Canonical_ES_Atari
Benchmarking Canonical Evolution Strategies for Playing Atari
☆81Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for Canonical_ES_Atari
- A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)☆56Updated 7 years ago
- Distributed implementation of popular evolutionary methods☆64Updated 6 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆153Updated 7 years ago
- A platform of grid world that supports up to 1 million reinforcement-learning agents.☆70Updated 7 years ago
- ☆117Updated 4 years ago
- Easy TensorFlow logging for quick prototypes☆110Updated 3 years ago
- pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction☆79Updated 5 years ago
- ☆161Updated 7 years ago
- Our NIPS 2017: Learning to Run source code☆56Updated last year
- A working implementation of the Categorical DQN (Distributional RL).☆96Updated 6 years ago
- A parallel version of Trust Region Policy Optimization☆65Updated 7 years ago
- I am implementing a lot of reinforcement learning and imitation learning algorithms since I'm sick of reading about them but not really u…☆51Updated 4 years ago
- NIPS 2017 Value Prediction Network☆166Updated 6 years ago
- ☆98Updated 8 years ago
- Stochastic Neural Networks for Hierarchical Reinforcement Learning☆96Updated 6 years ago
- Implementations of deep RL papers and random experimentation☆177Updated 6 years ago
- Deep Attention Recurrent Q-Network☆116Updated 9 years ago
- Implimentation of the Model Free Episodic Control paper by Deep Mind : http://arxiv.org/abs/1606.04460☆56Updated 8 years ago
- RUDDER for ATARI games with delayed rewards in OpenAI Baselines package☆267Updated 5 years ago
- ☆99Updated 8 years ago
- A Python Interface for the Arcade Learning Environment (Shared Object)☆126Updated 4 years ago
- This is my implementation of the Optimality Tightening☆37Updated 7 years ago
- Using a paper from Google DeepMind I've developed a new version of the DQN using threads exploration instead of memory replay as explain …☆84Updated 8 years ago
- Implementation of "Control of Memory, Active Perception, and Action in Minecraft"☆86Updated 7 years ago
- ☆44Updated 5 years ago
- An implementation of FeUdal Networks for Hierarchical Reinforcement Learning as published : https://arxiv.org/abs/1703.01161☆178Updated 7 years ago
- Train an RL agent to play multiple Atari games at once☆71Updated 8 years ago
- Gym - Doom environments based on VizDoom.☆102Updated 7 years ago