Open source implementation of the PAAC algorithm presented in Efficient Parallel Methods for Deep Reinforcement Learning
☆201Jun 3, 2017Updated 8 years ago
Alternatives and similar repositories for paac
Users that are interested in paac are comparing it to the libraries listed below
Sorting:
- Pytorch implementation of the PAAC algorithm presented in Efficient Parallel Methods for Deep Reinforcement Learning https://arxiv.org/ab…☆20Jan 25, 2018Updated 8 years ago
- Hybrid CPU/GPU implementation of the A3C algorithm for deep reinforcement learning.☆662Feb 25, 2020Updated 6 years ago
- Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783)☆408Feb 25, 2017Updated 9 years ago
- Noisy Networks for Exploration☆187Jan 28, 2018Updated 8 years ago
- ☆20Apr 27, 2016Updated 9 years ago
- Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆81Nov 22, 2017Updated 8 years ago
- ☆30Apr 15, 2017Updated 8 years ago
- Malmo Collaborative AI Challenge - Team Pig Catcher☆66May 22, 2017Updated 8 years ago
- Neural Turing Machine (NTM) & Differentiable Neural Computer (DNC) with pytorch & visdom☆278Feb 20, 2018Updated 8 years ago
- Tensorflow implementation of "The Predictron: End-To-End Learning and Planning"☆291Jan 20, 2017Updated 9 years ago
- Code for the paper "Learning to Act by Predicting the Future", Alexey Dosovitskiy and Vladlen Koltun, ICLR 2017☆151Sep 2, 2024Updated last year
- Reinforcement learning with unsupervised auxiliary tasks☆423Feb 13, 2019Updated 7 years ago
- A Tensorflow based implementation of "Asynchronous Methods for Deep Reinforcement Learning": https://arxiv.org/abs/1602.01783☆68Oct 28, 2016Updated 9 years ago
- Implementation of Meta-RL A3C algorithm☆407Feb 22, 2017Updated 9 years ago
- Tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'☆13Dec 23, 2016Updated 9 years ago
- Persistent advantage learning dueling double DQN for the Arcade Learning Environment☆263Feb 8, 2018Updated 8 years ago
- Code for Attentive Recurrent Comparators☆58Mar 3, 2017Updated 9 years ago
- Actor-critic with experience replay☆257Oct 9, 2022Updated 3 years ago
- Asynchronous Methods for Deep Reinforcement Learning☆591Aug 9, 2018Updated 7 years ago
- Implementation of "Control of Memory, Active Perception, and Action in Minecraft"☆86Dec 23, 2016Updated 9 years ago
- Forward-mode Automatic Differentiation for TensorFlow☆139Mar 12, 2018Updated 7 years ago
- tensorflow reinforcement learning agents for OpenAI gym environments☆139Jul 21, 2017Updated 8 years ago
- an implementation of reinforcement learning problem, stock prices☆10Dec 26, 2016Updated 9 years ago
- TensorFlow implementation of the Value Iteration Networks (NIPS '16) paper☆552Mar 7, 2019Updated 6 years ago
- Modified tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'☆21Dec 15, 2016Updated 9 years ago
- ☆98Aug 25, 2016Updated 9 years ago
- Asynchronous Advantage Actor Critic☆20Aug 15, 2016Updated 9 years ago
- Implimentation of the Model Free Episodic Control paper by Deep Mind : http://arxiv.org/abs/1606.04460☆55Jul 25, 2016Updated 9 years ago
- reinforcement learning. policy gradient. PCL☆37Apr 25, 2017Updated 8 years ago
- ☆38Mar 6, 2017Updated 8 years ago
- 🏃 Implementation of Using Fast Weights to Attend to the Recent Past.☆270Feb 20, 2019Updated 7 years ago
- Reinforcement learning environments for Torch7☆91Dec 15, 2016Updated 9 years ago
- Evolution Strategies in PyTorch☆354Sep 11, 2017Updated 8 years ago
- some RL algorithms☆19Dec 9, 2016Updated 9 years ago
- ☆64Jun 2, 2017Updated 8 years ago
- PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".☆1,316Sep 25, 2019Updated 6 years ago
- neon implementation of SegNet☆13Jan 3, 2023Updated 3 years ago
- Unsupervised Perceptual Rewards for Imitation Learning☆11Feb 3, 2018Updated 8 years ago
- A3C LSTM Atari with Pytorch plus A3G design☆570Apr 18, 2023Updated 2 years ago