ArvindSoma / a3c-super-mario-pytorchLinks
Reinforcement Learning for Super Mario Bros using A3C on GPU
☆37Updated 7 years ago
Alternatives and similar repositories for a3c-super-mario-pytorch
Users that are interested in a3c-super-mario-pytorch are comparing it to the libraries listed below
Sorting:
- This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super M…☆79Updated 6 years ago
- Source code for OpenAI Retro Contest for Sonic the Hedgehog☆31Updated 6 years ago
- ☆69Updated 6 years ago
- Meta Reinforcement Learning Experiments☆34Updated 7 years ago
- Deep reinforcement learning using an asynchronous advantage actor-critic (A3C) model.☆66Updated 7 years ago
- Ape-X DQN & DDPG with pytorch & tensorboard☆102Updated 6 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆35Updated 6 years ago
- PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch☆114Updated 8 years ago
- Noisy Networks for Exploration☆186Updated 7 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆152Updated 7 years ago
- Simple grid-world environment compatible with OpenAI-gym☆50Updated 5 years ago
- RainBow, Tensorflow☆49Updated 7 years ago
- Learning to play supermario using A3C algorithm☆11Updated 6 years ago
- Atari - Deep Reinforcement Learning algorithms in TensorFlow☆136Updated last year
- This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.☆66Updated 6 years ago
- Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"☆202Updated 6 years ago
- NIPS 2017 Value Prediction Network☆166Updated 7 years ago
- A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow☆181Updated 6 years ago
- ☆117Updated 4 years ago
- OpenAI Retro Contest☆65Updated 2 years ago
- Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.☆151Updated 2 years ago
- Deep Deterministic Policy Gradient implemented in PyTorch for DeepMind Control Suite☆25Updated 6 years ago
- Yet another prioritized experience replay buffer implementation.☆48Updated 2 years ago
- A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)☆57Updated 7 years ago
- Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874☆47Updated 4 years ago
- General implementation of Advantage Actor Critic using Pytorch☆27Updated 3 years ago
- Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization☆44Updated 6 years ago
- Exploration Strategies for Deep Reinforcement Learning☆39Updated 6 years ago
- advantage actor-critic reinforcement learning for openai gym cartpole☆65Updated 7 years ago
- ICML 2018 Self-Imitation Learning☆278Updated 5 years ago