ArvindSoma / a3c-super-mario-pytorch
Reinforcement Learning for Super Mario Bros using A3C on GPU
☆36Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for a3c-super-mario-pytorch
- This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super M…☆77Updated 5 years ago
- Ape-X DQN & DDPG with pytorch & tensorboard☆101Updated 5 years ago
- Atari - Deep Reinforcement Learning algorithms in TensorFlow☆135Updated 7 months ago
- Meta Reinforcement Learning Experiments☆33Updated 7 years ago
- A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow☆182Updated 5 years ago
- Simple grid-world environment compatible with OpenAI-gym☆49Updated 4 years ago
- Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.☆151Updated last year
- Deep Q Learning via Pytorch☆86Updated 6 years ago
- some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Co…☆131Updated last year
- ☆69Updated 5 years ago
- Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874☆46Updated 3 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆153Updated 7 years ago
- World Models applied to the Open AI Sonic Retro Contest☆77Updated 6 years ago
- High-quality implementations of deep reinforcement learning algorithms for experiments☆51Updated 2 months ago
- This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.☆65Updated 6 years ago
- implement of prioritized experience replay☆156Updated 6 years ago
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆180Updated 6 years ago
- Using Asynchronous Deep Reinforcement Learning to play Flappy Bird from pixel input.☆30Updated 7 years ago
- Duel_DDQN (Dueling Network Architectures + Double DQN) using Keras☆32Updated 8 years ago
- NIPS 2017 Value Prediction Network☆166Updated 6 years ago
- RainBow, Tensorflow☆49Updated 6 years ago
- Yet another prioritized experience replay buffer implementation.☆48Updated 2 years ago
- Source code for OpenAI Retro Contest for Sonic the Hedgehog☆30Updated 6 years ago
- Random Network Distillation(RND) algo in Pytorch☆48Updated 5 years ago
- Tensorflow Implementation for "Noisy network for exploration"☆33Updated 7 years ago
- PyTorch implementation of Proximal Policy Optimization☆50Updated 6 years ago
- A high-performance Atari A3C agent in 180 lines of PyTorch☆170Updated 3 years ago
- advantage actor-critic reinforcement learning for openai gym cartpole☆64Updated 7 years ago
- Noisy Networks for Exploration☆185Updated 6 years ago
- Highly Modular and Scalable Reinforcement Learning☆114Updated 4 years ago