karpathy / tf-agent
tensorflow reinforcement learning agents for OpenAI gym environments
☆113Updated 7 years ago
Alternatives and similar repositories for tf-agent:
Users that are interested in tf-agent are comparing it to the libraries listed below
- Using a paper from Google DeepMind I've developed a new version of the DQN using threads exploration instead of memory replay as explain …☆83Updated 9 years ago
- ☆100Updated 8 years ago
- An implementation of Deep Reinforcement Learning / Deep Q-Networks for Atari games in TensorFlow☆74Updated 8 years ago
- Universal library for deep reinforcement learning.☆38Updated 8 years ago
- A parallel version of Trust Region Policy Optimization☆65Updated 8 years ago
- ☆28Updated 5 years ago
- ☆24Updated 9 years ago
- Dataset for the spaceship task from "Metacontrol for Adaptive Imagination-Based Optimization"☆56Updated 7 years ago
- An implementation of the RL-NTM from http://arxiv.org/abs/1505.00521☆157Updated 9 years ago
- Testbed for deep reinforcement learning☆160Updated 7 years ago
- A Tensorflow based implementation of "Asynchronous Methods for Deep Reinforcement Learning": https://arxiv.org/abs/1602.01783☆67Updated 8 years ago
- A list of deep neural network architectures for reinforcement learning tasks.☆166Updated 8 years ago
- random search, hill climbing, policy gradient☆140Updated 6 years ago
- Torch implementation of "Deep Exploration via Bootstrapped DQN"☆42Updated 8 years ago
- Deep Attention Recurrent Q-Network☆115Updated 9 years ago
- TensorFlow implementation of the paper "Learning to learn by gradient descent by gradient descent ( https://arxiv.org/abs/1606.04474 )"☆84Updated 7 years ago
- ☆89Updated 8 years ago
- Reinforcement learning environments for Torch7☆92Updated 8 years ago
- ☆117Updated 4 years ago
- Adversarial networks in TensorFlow☆170Updated 8 years ago
- A Python Interface for the Arcade Learning Environment (Shared Object)☆126Updated 4 years ago
- A working implementation of the Categorical DQN (Distributional RL).☆96Updated 6 years ago
- Benchmark testbed for assessing the performance of optimisation algorithms☆80Updated 10 years ago
- Implementations of deep RL papers and random experimentation☆177Updated 6 years ago
- Framework and model code for the paper "Language Understanding for Text-based Games using Deep Reinforcement Learning", EMNLP 2015☆127Updated 8 years ago
- Implementation of "Control of Memory, Active Perception, and Action in Minecraft"☆86Updated 8 years ago
- NPI(Neural Programmer-Interpreters) implementation with Keras☆243Updated 2 years ago
- ☆97Updated 8 years ago
- Tensorflow implementation of "The Predictron: End-To-End Learning and Planning"☆290Updated 8 years ago
- Implimentation of the Model Free Episodic Control paper by Deep Mind : http://arxiv.org/abs/1606.04460☆55Updated 8 years ago