schinger / pong_actor-criticLinks
Trains an agent with (stochastic) Policy Gradients(actor-critic) on Pong. Uses OpenAI Gym.
☆17Updated last year
Alternatives and similar repositories for pong_actor-critic
Users that are interested in pong_actor-critic are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of Value Iteration Networks (VIN): Clean, Simple and Modular. Visualization in Visdom.☆225Updated 8 years ago
- TensorFlow implementation of Value Iteration Networks (VIN): Clean, Simple and Modular☆52Updated 8 years ago
- PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch☆114Updated 8 years ago
- Code for the paper "Learning to Act by Predicting the Future", Alexey Dosovitskiy and Vladlen Koltun, ICLR 2017☆151Updated last year
- Reinforcement learning models in ViZDoom environment☆130Updated 3 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆155Updated 8 years ago
- ☆160Updated 8 years ago
- ☆98Updated 9 years ago
- Noisy Networks for Exploration☆186Updated 8 years ago
- Deep reinforcement learning in ViZDoom (using Tensorflow)☆19Updated 8 years ago
- Implement A3C for Mujoco gym envs☆73Updated 8 years ago
- Deep Attention Recurrent Q-Network☆115Updated 10 years ago
- Our NIPS 2017: Learning to Run source code☆55Updated 2 years ago
- A working implementation of the Categorical DQN (Distributional RL).☆95Updated 7 years ago
- pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction☆80Updated 7 years ago
- PyTorch implementation of the Value Iteration Networks (VIN) (NIPS '16 best paper)☆80Updated 8 years ago
- Pytorch implementation of Value Iteration Networks (NIPS 2016 best paper)☆319Updated 5 years ago
- Reason8.ai PyTorch solution for NIPS RL 2017 challenge☆84Updated 6 years ago
- Helpful files for Visual Doom AI Competition 2017☆44Updated 7 years ago
- TensorFlow A2C to solve Acrobot, with synchronized parallel environments☆35Updated 7 years ago
- [ICLR 2018] TensorFlow code for zero-shot visual imitation by self-supervised exploration☆203Updated 7 years ago
- A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)☆57Updated 8 years ago
- Tutorial on continuous control at Reinforcement Learning Summer School 2017.☆34Updated 8 years ago
- Benchmark and build RL architectures that can do multitask and transfer learning.☆144Updated 3 years ago
- [NIPS 2017] InfoGAIL: Interpretable Imitation Learning from Visual Demonstrations☆183Updated last year
- NIPS 2017 Value Prediction Network☆167Updated 8 years ago
- Implimentation of the Model Free Episodic Control paper by Deep Mind : http://arxiv.org/abs/1606.04460☆55Updated 9 years ago
- Population Based Training, Figure 2☆25Updated 8 years ago
- for learning reinforcement learning using PyTorch.☆64Updated 6 years ago
- ☆119Updated 5 years ago