cmusjtuliuyuan / RainBowLinks

RainBow, Tensorflow

☆49

Alternatives and similar repositories for RainBow

Users that are interested in RainBow are comparing it to the libraries listed below

Sorting:

cxxgtxy / deeprl-baselines
Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…
☆35Updated 6 years ago
MG2033 / A2C
A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow
☆181Updated 6 years ago
wenh123 / NoisyNet-DQN
Tensorflow Implementation for "Noisy network for exploration"
☆32Updated 8 years ago
flyyufelix / C51-DDQN-Keras
C51-DDQN in Keras
☆126Updated 7 years ago
jimkon / Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces
Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym
☆177Updated 7 years ago
Damcy / prioritized-experience-replay
implement of prioritized experience replay
☆159Updated 6 years ago
sadeqa / Super-Mario-Bros-RL
This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super M…
☆79Updated 6 years ago
nikonikolov / rltf
Reinforcement Learning implementations and research prototyping in TensorFlow
☆82Updated 6 years ago
yrlu / reinforcement_learning
Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.
☆151Updated 2 years ago
wwxFromTju / deepmind_MAS_enviroment
some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Co…
☆127Updated 2 years ago
keon / policy-gradient
Minimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras
☆160Updated 5 years ago
takoika / PrioritizedExperienceReplay
Yet another prioritized experience replay buffer implementation.
☆48Updated 2 years ago
spiglerg / DQN_DDQN_Dueling_and_DDPG_Tensorflow
Tensorflow + OpenAI Gym implementation of Deep Q-Network (DQN), Double DQN (DDQN), Dueling Network and Deep Deterministic Policy Gradient…
☆78Updated 8 years ago
alexis-jacq / Pytorch-DPPO
Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286
☆183Updated 7 years ago
uidilr / gail_ppo_tf
Tensorflow implementation of Generative Adversarial Imitation Learning(GAIL) with discrete action
☆115Updated 6 years ago
dbobrenko / awesome-rl
Awesome RL: Papers, Books, Codes, Benchmarks
☆116Updated last year
analog-rl / Duel_DDQN
Duel_DDQN (Dueling Network Architectures + Double DQN) using Keras
☆31Updated 9 years ago
floodsung / a2c_cartpole_pytorch
advantage actor-critic reinforcement learning for openai gym cartpole
☆65Updated 8 years ago
go2sea / DQfD
An implement of DQfD（Deep Q-learning from Demonstrations) raised by DeepMind:Learning from Demonstrations for Real World Reinforcement Le…
☆132Updated 7 years ago
Silvicek / distributional-dqn
Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…
☆132Updated 6 years ago
jingweiz / pytorch-distributed
Ape-X DQN & DDPG with pytorch & tensorboard
☆102Updated 6 years ago
Kaixhin / ACER
Actor-critic with experience replay
☆254Updated 2 years ago
xinleipan / gym-gridworld
Simple grid-world environment compatible with OpenAI-gym
☆50Updated 5 years ago
AdamStelmaszczyk / dqn
TensorFlow & Keras implementation of DQN with HER (Hindsight Experience Replay)
☆40Updated 5 years ago
LuEE-C / PPO-Keras
My implementation of the Proximal Policy Optisation algorithm using Keras as a backend
☆88Updated 5 years ago
louaaron / GAN-Q-Learning
Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874
☆47Updated 4 years ago
Officium / RL-Experiments
High-quality implementations of deep reinforcement learning algorithms for experiments
☆51Updated 11 months ago
hengyuan-hu / rainbow
A PyTorch implementation of Rainbow DQN agent
☆168Updated 7 years ago
uber-research / ape-x
This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"
☆190Updated 6 years ago
xlnwel / model-free-algorithms
TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x
☆62Updated 4 years ago