fomorians / td-gammon
Implementation of TD-Gammon in TensorFlow.
☆110Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for td-gammon
- Deep Reinforcement Learning library for humans☆301Updated 7 years ago
- An implementation of Deep Reinforcement Learning / Deep Q-Networks for Atari games in TensorFlow☆74Updated 7 years ago
- ☆99Updated 8 years ago
- A reinforcement learning framework☆154Updated 5 years ago
- tensorflow reinforcement learning agents for OpenAI gym environments☆110Updated 7 years ago
- A Python Interface for the Arcade Learning Environment (Shared Object)☆126Updated 4 years ago
- KEras Reinforcement Learning gYM agents☆292Updated 7 years ago
- Basic DQN implementation☆220Updated 6 years ago
- Some Reinforcement Learning in Python☆117Updated 7 years ago
- Asynchronous Methods for Deep Reinforcement Learning☆38Updated 7 years ago
- random search, hill climbing, policy gradient☆140Updated 6 years ago
- This is the 0.4 release of the Arcade Learning Environment (ALE), a platform designed for AI research. ALE is based on Stella, an Atari 2…☆160Updated 7 years ago
- RLPy Reinforcement Learning Framework☆251Updated 5 years ago
- Implementations of deep RL papers and random experimentation☆177Updated 6 years ago
- Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783)☆400Updated 7 years ago
- Persistent advantage learning dueling double DQN for the Arcade Learning Environment☆264Updated 6 years ago
- A list of deep neural network architectures for reinforcement learning tasks.☆167Updated 8 years ago
- Testbed for deep reinforcement learning☆160Updated 7 years ago
- Neural Network Evolution Playground with Backprop NEAT☆135Updated 8 years ago
- A Tensorflow based implementation of "Asynchronous Methods for Deep Reinforcement Learning": https://arxiv.org/abs/1602.01783☆68Updated 8 years ago
- Board game AI implementations using Monte Carlo Tree Search☆183Updated 4 years ago
- A parallel version of Trust Region Policy Optimization☆65Updated 7 years ago
- RUDDER for ATARI games with delayed rewards in OpenAI Baselines package☆267Updated 5 years ago
- Code for the paper "Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks"☆342Updated 6 years ago
- The original implementation of HyperNEAT in C++☆28Updated 8 years ago
- some common TD Learning algorithms☆67Updated 4 years ago
- ☆28Updated 5 years ago
- NPI(Neural Programmer-Interpreters) implementation with Keras☆244Updated 2 years ago
- Publicly releasable baselines for the Retro contest☆128Updated 6 years ago
- A TensorFlow based implementation of the DeepMind Atari playing "Deep Q Learning" agent that works reasonably well☆91Updated 7 years ago