ArvindSoma / a3c-super-mario-pytorch
Reinforcement Learning for Super Mario Bros using A3C on GPU
☆37Updated 6 years ago
Alternatives and similar repositories for a3c-super-mario-pytorch:
Users that are interested in a3c-super-mario-pytorch are comparing it to the libraries listed below
- This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super M…☆79Updated 6 years ago
- Ape-X DQN & DDPG with pytorch & tensorboard☆103Updated 5 years ago
- ☆69Updated 6 years ago
- PyTorch implementation of Proximal Policy Optimization☆51Updated 7 years ago
- Random Network Distillation(RND) algo in Pytorch☆48Updated 6 years ago
- PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch☆114Updated 7 years ago
- Meta Reinforcement Learning Experiments☆34Updated 7 years ago
- Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.☆150Updated last year
- Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"☆197Updated 6 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆36Updated 6 years ago
- Atari - Deep Reinforcement Learning algorithms in TensorFlow☆135Updated 11 months ago
- Simple grid-world environment compatible with OpenAI-gym☆49Updated 4 years ago
- A Tensorflow implementation of Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆32Updated 7 years ago
- advantage actor-critic reinforcement learning for openai gym cartpole☆65Updated 7 years ago
- This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.☆66Updated 6 years ago
- Learning to play supermario using A3C algorithm☆11Updated 6 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago
- A Simple Example for Imitation Learning with Dataset Aggregation (DAGGER) on Torcs Env☆71Updated 7 years ago
- PyTorch RL for Pommerman☆38Updated 6 years ago
- A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow☆181Updated 6 years ago
- Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874☆45Updated 4 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆27Updated 5 years ago
- Reinforcement Learning implementations and research prototyping in TensorFlow☆81Updated 5 years ago
- Deep reinforcement learning using an asynchronous advantage actor-critic (A3C) model.☆66Updated 6 years ago
- Tensorflow implementation of A3C algorithm☆47Updated 7 years ago
- High-quality implementations of deep reinforcement learning algorithms for experiments☆51Updated 6 months ago
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆181Updated 6 years ago
- Source code for OpenAI Retro Contest for Sonic the Hedgehog☆31Updated 6 years ago
- ☆56Updated 2 years ago
- Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization☆44Updated 6 years ago