ArvindSoma / a3c-super-mario-pytorch
Reinforcement Learning for Super Mario Bros using A3C on GPU
☆37Updated 6 years ago
Alternatives and similar repositories for a3c-super-mario-pytorch:
Users that are interested in a3c-super-mario-pytorch are comparing it to the libraries listed below
- This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super M…☆79Updated 6 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆36Updated 6 years ago
- ☆69Updated 6 years ago
- Source code for OpenAI Retro Contest for Sonic the Hedgehog☆31Updated 6 years ago
- Ape-X DQN & DDPG with pytorch & tensorboard☆103Updated 5 years ago
- Meta Reinforcement Learning Experiments☆33Updated 7 years ago
- Random Network Distillation(RND) algo in Pytorch☆48Updated 5 years ago
- World Models applied to the Open AI Sonic Retro Contest☆77Updated 6 years ago
- Simple grid-world environment compatible with OpenAI-gym☆49Updated 4 years ago
- PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch☆113Updated 7 years ago
- dqn autoplay mario bros☆21Updated 7 years ago
- Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874☆45Updated 4 years ago
- Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"☆197Updated 6 years ago
- A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)☆56Updated 7 years ago
- A simple Gridworld environment for Open AI gym☆24Updated 6 years ago
- This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.☆66Updated 6 years ago
- RainBow, Tensorflow☆49Updated 6 years ago
- ☆35Updated 6 years ago
- Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆80Updated 7 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 3 years ago
- PyTorch implementation of Proximal Policy Optimization☆50Updated 7 years ago
- Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization☆44Updated 6 years ago
- Highly Modular and Scalable Reinforcement Learning☆114Updated 5 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆31Updated 5 years ago
- Efficient Reinforcement Learning with a Thought-Game for StarCraft☆46Updated 2 years ago
- Atari-DRQN (keras ver.)☆33Updated 6 years ago
- A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow☆181Updated 5 years ago
- Tensorflow implementation of A3C algorithm☆47Updated 7 years ago
- Tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'☆13Updated 8 years ago
- Atari - Deep Reinforcement Learning algorithms in TensorFlow☆135Updated 10 months ago