AdamStelmaszczyk / rl-tutorialLinks

Source code for "A deep dive into reinforcement learning"

☆13

Alternatives and similar repositories for rl-tutorial

Users that are interested in rl-tutorial are comparing it to the libraries listed below

Sorting:

llSourcell / pysc2
StarCraft II Learning Environment
☆18Updated 6 years ago
wassname / world-models-sonic-pytorch
Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more w…
☆32Updated 6 years ago
cgnicholls / rlpoker
Reinforcement learning algorithms to play Poker
☆14Updated 3 years ago
AntonOsika / agz
AlphaGo Zero Reimplementation. MCTS Self Play library.
☆26Updated 2 years ago
TomZahavy / CB_AE_DQN
Contextual Bandits Action Elimination DQN
☆21Updated 7 years ago
davinwang / C2TutorialsGo
This is a tutorial written for Caffe2 which mocks google AlphaGo Fan and AlphaGo Zero.
☆8Updated 6 years ago
facebookresearch / rela
Reinforcement Learning Assembly
☆92Updated 3 years ago
petosa / multiplayer-alphazero
PyTorch AlphaZero implementation with multiplayer support [NeurIPS 2019 Deep Reinforcement Learning Workshop]
☆34Updated 4 years ago
grananqvist / reinforcement-learning-super-mario-A3C
Learning to play supermario using A3C algorithm
☆11Updated 6 years ago
epignatelli / discovering-reinforcement-learning-algorithms
A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…
☆22Updated 4 years ago
SuReLI / dyna-gym
This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.
☆32Updated 6 years ago
distillpub / post--understanding-rl-vision
Understanding RL vision Distill article
☆23Updated 2 years ago
chenxy99 / Generative-Temporal-Models-with-Spatial-Memory
Reimplementation code for the paper "Generative Temporal Models with Spatial Memory for Partially Observed Environments"
☆29Updated 3 years ago
R-McHenry / ParallelizedGoExplore
A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post
☆46Updated 6 years ago
yashbonde / freeciv-python
This is the learning environment for Freeciv 3.1 with python bindings for advancements in RL. This is the first project of it's kind in t…
☆40Updated 5 years ago
alok / rl_implementations
Reinforcement learning algorithm implementations and ML experimentation workspace
☆43Updated 6 years ago
david-abel / rl_info_theory
A collection of code investigating the use of information theory for abstractions in RL
☆16Updated 6 years ago
grantsrb / PyTorch-A2C
General implementation of Advantage Actor Critic using Pytorch
☆27Updated 3 years ago
deepsense-ai / Distributed-BA3C
☆56Updated 2 years ago
neka-nat / async-rl-noreward
Keras implementation of Curiosity-driven Exploration by Self-supervised Prediction
☆8Updated 8 years ago
p-kar / a2c-acktr-vizdoom
A2C, ACKTR and A2T implementations for ViZDoom
☆10Updated 7 years ago
xuedong / machine-learning-summer-schools
Curated materials for different machine learning related summer schools
☆19Updated 4 years ago
Zeta36 / Policy-chess
A Policy Network in Tensorflow to classify chess moves
☆18Updated 8 years ago
cgel / DRL
A collection of Deep Reinforcement Learning algorithms implemented in tensorflow. Very extensible. High performing DQN implementation.
☆29Updated 8 years ago
awjuliani / RL-CC
Web-based Reinforcement Learning Control Center
☆64Updated 8 years ago
erfanMhi / base_reinforcement_learning
This is the code-base that I personally use as the starting point for any reinforcement learning codebase with the purpose of fast experi…
☆12Updated 2 years ago
matthiasplappert / keras-rl-weights
Trained models for keras-rl.
☆21Updated 8 years ago
YyzHarry / SV-RL
[ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning
☆34Updated 5 years ago
asmadotgh / neural_chat_web
The server portion of the Neural Chat project to deploy chatbots on web. This code is accompanied by another repository that includes the…
☆36Updated 4 years ago
instadeepai / AlphaNPI
Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.
☆79Updated last year