mrahtz / tensorflow-rl-pongLinks

Pong AI trained using policy gradient-based reinforcement learning

☆51

Alternatives and similar repositories for tensorflow-rl-pong

Users that are interested in tensorflow-rl-pong are comparing it to the libraries listed below

Sorting:

keon / policy-gradient
Minimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras
☆160Updated 5 years ago
tokb23 / dqn
DQN implementation in Keras + TensorFlow + OpenAI Gym
☆158Updated 7 years ago
flyyufelix / VizDoom-Keras-RL
Reinforcement Learning in Keras on VizDoom
☆143Updated 7 years ago
pat-coady / trpo
Trust Region Policy Optimization with TensorFlow and OpenAI Gym
☆360Updated 5 years ago
yrlu / reinforcement_learning
Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.
☆151Updated 2 years ago
flyyufelix / C51-DDQN-Keras
C51-DDQN in Keras
☆126Updated 7 years ago
pemami4911 / deep-rl
Collection of Deep Reinforcement Learning algorithms
☆299Updated 6 years ago
gabrielgarza / openai-gym-policy-gradient
Reinforcement Learning using Policy Gradient to solve OpenAI Gym games
☆113Updated 7 years ago
MG2033 / A2C
A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow
☆181Updated 6 years ago
floodsung / DQN-Atari-Tensorflow
Simplest Version of playing Atari with Deep Q Learning in Tensorflow
☆158Updated 7 years ago
jaromiru / AI-blog
Accompanying repository for Let's make a DQN / A3C series.
☆393Updated 6 years ago
liampetti / DDPG
Implementation of DDPG (Modified from the work of Patrick Emami) - Tensorflow (no TFLearn dependency), Ornstein Uhlenbeck noise function,…
☆64Updated 8 years ago
Damcy / prioritized-experience-replay
implement of prioritized experience replay
☆159Updated 6 years ago
junhyukoh / self-imitation-learning
ICML 2018 Self-Imitation Learning
☆278Updated 5 years ago
DanielTakeshi / rl_algorithms
I am implementing a lot of reinforcement learning and imitation learning algorithms since I'm sick of reading about them but not really u…
☆51Updated 5 years ago
Anjum48 / rl-examples
Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow
☆103Updated 4 years ago
rgilman33 / simple-A2C-PPO
Actor-critic trained w PPO on OpenAI's Procgen Benchmark (PyTorch). Built from scratch.
☆102Updated 5 years ago
andreimuntean / A3C
Deep reinforcement learning using an asynchronous advantage actor-critic (A3C) model.
☆66Updated 7 years ago
spiglerg / DQN_DDQN_Dueling_and_DDPG_Tensorflow
Tensorflow + OpenAI Gym implementation of Deep Q-Network (DQN), Double DQN (DDQN), Dueling Network and Deep Deterministic Policy Gradient…
☆78Updated 8 years ago
ml-jku / baselines-rudder
RUDDER for ATARI games with delayed rewards in OpenAI Baselines package
☆267Updated 5 years ago
andrewliao11 / gail-tf
Tensorflow implementation of generative adversarial imitation learning
☆199Updated 7 years ago
liampetti / A3C-LSTM
A3C-LSTM algorithm tested on CartPole OpenAI Gym environment
☆48Updated 7 years ago
greydanus / baby-a3c
A high-performance Atari A3C agent in 180 lines of PyTorch
☆171Updated 3 years ago
brendanator / atari-rl
Atari - Deep Reinforcement Learning algorithms in TensorFlow
☆137Updated last year
takuseno / ppo
Proximal Policy Optimization implementation with TensorFlow
☆107Updated 6 years ago
openai / atari-reset
Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"
☆203Updated 6 years ago
stevenpjg / ddpg-aigym
Continuous control with deep reinforcement learning - Deep Deterministic Policy Gradient (DDPG) algorithm implemented in OpenAI Gym envir…
☆274Updated 7 years ago
alirezamika / bipedal-es
AI learning to walk in gym's BipedalWalker environment.
☆66Updated 8 years ago
haarnoja / softqlearning
Reinforcement Learning with Deep Energy-Based Policies
☆427Updated last year
Kaixhin / ACER
Actor-critic with experience replay
☆254Updated 2 years ago