5vision / uct_atariLinks

uct tree search + supervised lerning for atari games

☆12

Alternatives and similar repositories for uct_atari

Users that are interested in uct_atari are comparing it to the libraries listed below

Sorting:

floringogianu / categorical-dqn
A working implementation of the Categorical DQN (Distributional RL).
☆96Updated 7 years ago
chrodan / tdlearn
some common TD Learning algorithms
☆66Updated 5 years ago
krfricke / rl-benchmark
Reinforcement learning benchmarking.
☆40Updated 6 years ago
Scitator / Run-Skeleton-Run
Reason8.ai PyTorch solution for NIPS RL 2017 challenge
☆84Updated 5 years ago
google-research / policy-learning-landscape
Explore the optimization landscape for direct policy learning reinforcement learning.
☆51Updated 6 years ago
flowersteam / geppg
☆35Updated 6 years ago
paintception / Deep-Quality-Value-DQV-Learning-
DQV-Learning: a novel faster synchronous Deep Reinforcement Learning algorithm
☆25Updated 2 years ago
flowersteam / rl-difference-testing
Simple tools for statistical analyses in RL experiments
☆66Updated 7 years ago
AdamStelmaszczyk / learning2run
Our NIPS 2017: Learning to Run source code
☆55Updated 2 years ago
facebookresearch / slbo
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
☆93Updated 5 years ago
geek-ai / 1m-agents
A platform of grid world that supports up to 1 million reinforcement-learning agents.
☆69Updated 7 years ago
rarilurelo / pcl_keras
reinforcement learning. policy gradient. PCL
☆37Updated 8 years ago
AdeelMufti / WorldModels
Full World Models Implementation in Chainer
☆166Updated 7 years ago
ehknight / natural-gradient-deep-q-learning
☆22Updated 6 years ago
RonanFR / UCRL
☆27Updated 6 years ago
openai / baselines-results
☆117Updated 5 years ago
ppaquette / gym-doom
Gym - Doom environments based on VizDoom.
☆103Updated 8 years ago
junhyukoh / value-prediction-network
NIPS 2017 Value Prediction Network
☆166Updated 7 years ago
rll / deeprlhw2
☆24Updated 9 years ago
siemens / policy_search_bb-alpha
☆69Updated 7 years ago
wojzaremba / trpo
☆101Updated 8 years ago
facebookresearch / rela
Reinforcement Learning Assembly
☆92Updated 3 years ago
ppaquette / gym-pull
Add-on for OpenAI Gym that supports automatic downloading of user environments.
☆45Updated 8 years ago
sudeepraja / Model-Free-Episodic-Control
Implimentation of the Model Free Episodic Control paper by Deep Mind : http://arxiv.org/abs/1606.04460
☆55Updated 8 years ago
Breakend / DeepReinforcementLearningThatMatters
Accompanying code for "Deep Reinforcement Learning that Matters"
☆152Updated 7 years ago
zuoxingdong / VIN_TensorFlow
TensorFlow implementation of Value Iteration Networks (VIN): Clean, Simple and Modular
☆52Updated 8 years ago
DartML / PPO-Stein-Control-Variate
Proximal Policy Optimization with Stein Control Variates:
☆33Updated 7 years ago
openai / sonic-on-ray
Training Sonic with RLlib
☆59Updated 2 years ago
go2sea / C51DQN
A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)
☆57Updated 7 years ago
kimhc6028 / pytorch-noreward-rl
pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction
☆80Updated 6 years ago