Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advantage Actor-Critic (A3C) Algorithm. This is much superior and efficient than DQN and obsoletes it. Can play on many games
☆195Sep 19, 2024Updated last year
Alternatives and similar repositories for ReinforcementLearning-AtariGame
Users that are interested in ReinforcementLearning-AtariGame are comparing it to the libraries listed below
Sorting:
- A3C LSTM Atari with Pytorch plus A3G design☆570Apr 18, 2023Updated 2 years ago
- pytorch, noisy_distributional_double_dueling_PER_RNN_CNN...CartPole-v1 , Acrobot-v1, MountainCar-v0☆14Mar 19, 2018Updated 7 years ago
- A3C-LSTM algorithm tested on CartPole OpenAI Gym environment☆48Jul 4, 2018Updated 7 years ago
- A continuous action space version of A3C LSTM in pytorch plus A3G design☆260Oct 11, 2024Updated last year
- A TensorFlow based implementation of the DeepMind Atari playing "Deep Q Learning" agent that works reasonably well☆93Jun 21, 2017Updated 8 years ago
- Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space…☆24Nov 29, 2018Updated 7 years ago
- TensorFlow implementation of Deep Reinforcement Learning papers☆28Dec 31, 2016Updated 9 years ago
- An implementation of the A3C deep reinforcement learning method using a LSTM layer. Created with Tensorflow.☆29Oct 18, 2017Updated 8 years ago
- Gym-like extensions for POMDP☆56Feb 28, 2021Updated 5 years ago
- Gym wrapper for pysc2☆10Sep 16, 2022Updated 3 years ago
- Minimalist Operating System designed to implement as much functionality as possible with a budget of 1000 Lines of Code☆12Sep 28, 2016Updated 9 years ago
- Variation of "Asynchronous Methods for Deep Reinforcement Learning" with multiple processes generating experience for agent (Keras + Thea…☆44Feb 27, 2018Updated 8 years ago
- ☆11Feb 20, 2020Updated 6 years ago
- Deep Recurrent Attention Reinforcement Learning in Atari☆82Jul 19, 2018Updated 7 years ago
- a modular reinforcement learning library with JAX agents☆27Mar 3, 2025Updated 11 months ago
- A packaged and slightly-modified version of https://github.com/bbitmaster/ale_python_interface☆387May 20, 2023Updated 2 years ago
- Python Wrappers for Post-Quantum Cryptography, see: https://csrc.nist.gov/Projects/Post-Quantum-Cryptography/Round-1-Submissions☆10Jun 24, 2018Updated 7 years ago
- Use tensorflow2 achieve PPO to play atari game☆13Oct 25, 2019Updated 6 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆63Jul 30, 2018Updated 7 years ago
- PyTorch implementation of Advantage Actor-Critic (A2C)☆47Nov 25, 2017Updated 8 years ago
- A simple Gridworld environment for Open AI gym☆25Jun 10, 2018Updated 7 years ago
- PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".☆1,316Sep 25, 2019Updated 6 years ago
- 学习DRL CNN -> DQN -> LSTM☆13Oct 7, 2018Updated 7 years ago
- Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more w…☆32Sep 17, 2018Updated 7 years ago
- Qt-like event loops, signals and slots for communication across threads and processes in Python☆14Mar 26, 2024Updated last year
- implement of prioritized experience replay☆159Aug 20, 2018Updated 7 years ago
- ☆15Jul 23, 2018Updated 7 years ago
- PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinfor…☆3,875May 29, 2022Updated 3 years ago
- DQN, DDDQN, A3C, PPO, Curiosity applied to the game DOOM☆92Feb 8, 2021Updated 5 years ago
- Hierarchical Online Planning and Reinforcement Learning on Taxi☆32Oct 23, 2017Updated 8 years ago
- A high-performance Atari A3C agent in 180 lines of PyTorch☆173Jul 31, 2021Updated 4 years ago
- Creating DRL infrastructure for Dynamic Beta with Zipline and Keras☆14Dec 8, 2022Updated 3 years ago
- using recurrent networks(LSTM) to solve POMDPs☆35Oct 10, 2018Updated 7 years ago
- Simple change of a3c to a2c☆15Jun 18, 2017Updated 8 years ago
- Project 1 of Udacity's Deep Reinforcement Learning nanodegree program☆13Dec 2, 2018Updated 7 years ago
- ☆18May 8, 2018Updated 7 years ago
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆184Mar 25, 2018Updated 7 years ago
- Implementation of Meta-RL A3C algorithm☆407Feb 22, 2017Updated 9 years ago
- Tutorials that integrate the Fatiando a Terra software to solve data problems in geoscience☆18Apr 18, 2024Updated last year