schinger / pong_actor-criticLinks
Trains an agent with (stochastic) Policy Gradients(actor-critic) on Pong. Uses OpenAI Gym.
☆15Updated 4 months ago
Alternatives and similar repositories for pong_actor-critic
Users that are interested in pong_actor-critic are comparing it to the libraries listed below
Sorting:
- Our NIPS 2017: Learning to Run source code☆55Updated 2 years ago
- TensorFlow implementation of Value Iteration Networks (VIN): Clean, Simple and Modular☆52Updated 8 years ago
- PyTorch implementation of the Value Iteration Networks (VIN) (NIPS '16 best paper)☆80Updated 8 years ago
- ☆29Updated 8 years ago
- TensorFlow A2C to solve Acrobot, with synchronized parallel environments☆35Updated 7 years ago
- reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆42Updated 7 years ago
- ☆79Updated 7 years ago
- ☆56Updated 6 years ago
- ☆38Updated 8 years ago
- Tutorial on continuous control at Reinforcement Learning Summer School 2017.☆34Updated 7 years ago
- Models built with TensorFlow☆25Updated 6 years ago
- A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)☆57Updated 7 years ago
- DEPRECATED!!! the same as official VAE example, but using Trainer in this repo☆30Updated 7 years ago
- Modified tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'☆21Updated 8 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆152Updated 7 years ago
- Deterministic Policy Gradient using torch7☆43Updated 9 years ago
- [DEPRECATED] Advantage Actor Critic model in PyTorch inspired by OpenAI baselines TensorFlow implementation☆53Updated 5 years ago
- Simple tools for statistical analyses in RL experiments☆66Updated 6 years ago
- ☆17Updated 7 years ago
- pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction☆79Updated 6 years ago
- Backpropagation training of neural networks with Hebbian plastic connections☆31Updated 3 years ago
- An implementation of "Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles" (http://arxiv.org/abs/1612.01474)☆34Updated 8 years ago
- Distributed A3C☆34Updated 7 years ago
- PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch☆114Updated 8 years ago
- Train an RL agent to play multiple Atari games at once☆69Updated 9 years ago
- Reference implementation for Structured Prediction with Deep Value Networks☆55Updated 7 years ago
- Implementation of A Distributional Perspective on Reinforcement Learning☆35Updated 7 years ago
- Blog for the Open Institute for Advanced Study☆10Updated 4 years ago
- Code base for solving Markov Decision Processes and Reinforcement Learning problems using Recurrent Convolutional Neural Networks.☆69Updated 7 years ago
- Helpful files for Visual Doom AI Competition 2017☆44Updated 6 years ago