UesugiErii / tf2-PPO-atari

Use tensorflow2 achieve PPO to play atari game

☆12

Related projects ⓘ

Alternatives and complementary repositories for tf2-PPO-atari

mehdiboubnan / Deep-Reinforcement-Learning-applied-to-DOOM
DQN, DDDQN, A3C, PPO, Curiosity applied to the game DOOM
☆83Updated 3 years ago
TonghanWang / ROMA
Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)
☆149Updated last year
DKuan / sc2_QMIX
The project to learn the QMIX.
☆11Updated 4 years ago
marctuscher / DRQN-tensorflow
Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro
☆173Updated last year
nikhilbarhate99 / TD3-PyTorch-BipedalWalker-v2
Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment
☆101Updated 5 years ago
jw1401 / PPO-Tensorflow-2.0
Proximal Policy Optimization with Tensorflow 2.0
☆30Updated 5 years ago
ac-93 / soft-actor-critic
Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.
☆94Updated 4 years ago
takuseno / ppo
Proximal Policy Optimization implementation with TensorFlow
☆104Updated 6 years ago
Huixxi / TensorFlow2.0-for-Deep-Reinforcement-Learning
TensorFlow 2.0 for Deep Reinforcement Learning.
☆82Updated last year
wsjeon / maddpg-rllib
MADDPG in Ray/RLlib
☆50Updated 4 years ago
deligentfool / policy_based_RL
The implement of the policy gradient RL algorithm with pytorch
☆36Updated 3 years ago
AdamStelmaszczyk / dqn
TensorFlow & Keras implementation of DQN with HER (Hindsight Experience Replay)
☆40Updated 4 years ago
MG2033 / A2C
A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow
☆182Updated 5 years ago
younggyoseo / pytorch-nfsp
Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)
☆45Updated 5 years ago
YuhangSong / Arena-BuildingToolkit
Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.
☆81Updated 3 years ago
AnujMahajanOxf / MAVEN
Submission for MAVEN: Multi-Agent Variational Exploration
☆57Updated 2 years ago
kkweon / A3C-Tensorflow
Simple Example A3C Reinforcement Learning Algorithm in Tensorflow
☆13Updated 7 years ago
BlueFisher / Advanced-Soft-Actor-Critic
Soft Actor-Critic with advanced features
☆47Updated last month
andrew-j-levy / Hierarchical-Actor-Critc-HAC-
This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.
☆253Updated 4 years ago
praveen-palanisamy / Ape-X-DQN
PyTorch Implementation of Ape-X (Distributed prioritized experience replay) architecture with DQN learner
☆27Updated 4 years ago
ray-project / rl-experiments
Keeping track of RL experiments
☆159Updated last year
simonmeister / pysc2-rl-agents
StarCraft II / PySC2 Deep Reinforcement Learning Agents (A2C)
☆134Updated 6 years ago
LuEE-C / PPO-Keras
My implementation of the Proximal Policy Optisation algorithm using Keras as a backend
☆88Updated 5 years ago
tdavchev / option-critic
A Tensorflow implementation of the Option-Critic Architecture
☆70Updated 7 years ago
TianhongDai / distributed-ppo
This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).
☆61Updated 6 years ago
shakenes / vizdoomgym
OpenAI Gym wrapper for ViZDoom enviroments
☆66Updated 3 years ago
toshikwa / fqf-iqn-qrdqn.pytorch
PyTorch implementation of FQF, IQN and QR-DQN.
☆161Updated 3 months ago
BY571 / QR-DQN
PyTorch implementation of QR-DQN: Distributional Reinforcement Learning with Quantile Regression
☆26Updated 4 years ago