karpathy / tf-agent
tensorflow reinforcement learning agents for OpenAI gym environments
☆110Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for tf-agent
- An implementation of Deep Reinforcement Learning / Deep Q-Networks for Atari games in TensorFlow☆74Updated 7 years ago
- Adversarial networks in TensorFlow☆170Updated 8 years ago
- Universal library for deep reinforcement learning.☆39Updated 8 years ago
- ☆99Updated 8 years ago
- random search, hill climbing, policy gradient☆140Updated 6 years ago
- ☆68Updated 8 years ago
- A Tensorflow based implementation of "Asynchronous Methods for Deep Reinforcement Learning": https://arxiv.org/abs/1602.01783☆68Updated 8 years ago
- Benchmark testbed for assessing the performance of optimisation algorithms☆80Updated 9 years ago
- Testbed for deep reinforcement learning☆160Updated 7 years ago
- KEras Reinforcement Learning gYM agents☆292Updated 7 years ago
- Torch implementation of "Deep Exploration via Bootstrapped DQN"☆42Updated 8 years ago
- Using a paper from Google DeepMind I've developed a new version of the DQN using threads exploration instead of memory replay as explain …☆84Updated 8 years ago
- ☆64Updated 7 years ago
- An implementation of the RL-NTM from http://arxiv.org/abs/1505.00521☆157Updated 8 years ago
- Deep Attention Recurrent Q-Network☆115Updated 9 years ago
- X is a temporary name, but here lies RL☆40Updated 7 years ago
- Reinforcement learning environments for Torch7☆93Updated 7 years ago
- ☆24Updated 9 years ago
- Basic DQN implementation☆220Updated 6 years ago
- Playing Atari games with TensorFlow implementation of Asynchronous Deep Q-Learning☆43Updated 6 years ago
- A list of deep neural network architectures for reinforcement learning tasks.☆167Updated 8 years ago
- Implementation of the paper [Using Fast Weights to Attend to the Recent Past](https://arxiv.org/abs/1610.06258)☆172Updated 8 years ago
- Implementation of a simple example of Q learning in Torch.☆50Updated 7 years ago
- A Python Interface for the Arcade Learning Environment (Shared Object)☆126Updated 4 years ago
- Persistent advantage learning dueling double DQN for the Arcade Learning Environment☆264Updated 6 years ago
- NPI(Neural Programmer-Interpreters) implementation with Keras☆244Updated 2 years ago
- ☆86Updated 11 years ago