ondrejbiza / banditsLinks

Comparison of bandit algorithms from the Reinforcement Learning bible.

☆17

Alternatives and similar repositories for bandits

Users that are interested in bandits are comparing it to the libraries listed below

Sorting:

alok / rl_implementations
Reinforcement learning algorithm implementations and ML experimentation workspace
☆43Updated 6 years ago
instadeepai / AlphaNPI
Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.
☆79Updated last year
garymcintire / mpi_util
☆25Updated 7 years ago
shagunsodhani / memory-augmented-self-play
PyTorch implementation of Memory Augmented Self-Play
☆52Updated 4 years ago
jolibrain / manette
Deep Reinforcement Learning with Fined Grained Action Repetition
☆23Updated 7 years ago
RobRomijnders / bandit
Implementation of Counterfactual risk minimization
☆26Updated 8 years ago
floringogianu / categorical-dqn
A working implementation of the Categorical DQN (Distributional RL).
☆96Updated 7 years ago
vruvora / reinforcement-learning-kdd
☆42Updated 6 years ago
ofirnachum / models
Models built with TensorFlow
☆25Updated 6 years ago
rll / deeprlhw2
☆24Updated 9 years ago
rgilman33 / baselines-A2C
[DEPRECATED] Advantage Actor Critic model in PyTorch inspired by OpenAI baselines TensorFlow implementation
☆53Updated 5 years ago
flowersteam / rl-difference-testing
Simple tools for statistical analyses in RL experiments
☆66Updated 7 years ago
paintception / Deep-Quality-Value-DQV-Learning-
DQV-Learning: a novel faster synchronous Deep Reinforcement Learning algorithm
☆25Updated 2 years ago
rarilurelo / pcl_keras
reinforcement learning. policy gradient. PCL
☆37Updated 8 years ago
siemens / policy_search_bb-alpha
☆69Updated 7 years ago
AdamStelmaszczyk / learning2run
Our NIPS 2017: Learning to Run source code
☆55Updated 2 years ago
wulfebw / async_rl
Python implementation of tabular asynchronous actor critic
☆11Updated 9 years ago
arnomoonens / yarll
Combining deep learning and reinforcement learning.
☆80Updated 3 years ago
seba-1511 / randopt
Streamlined machine learning experiment management.
☆107Updated 5 years ago
cshenton / neuroevolution
Replication of Uber Neuroevolution paper
☆46Updated 7 years ago
ADGEfficiency / dsr-rl
Reinforcement learning course at Data Science Retreat
☆42Updated 6 years ago
awjuliani / RL-CC
Web-based Reinforcement Learning Control Center
☆64Updated 8 years ago
moskomule / pytorch.rl.learning
for learning reinforcement learning using PyTorch.
☆64Updated 5 years ago
automl / labwatch
An extension to Sacred for automated hyperparameter optimization.
☆59Updated 7 years ago
deepsense-ai / Distributed-BA3C
☆56Updated 2 years ago
AdeelMufti / WorldModels
Full World Models Implementation in Chainer
☆166Updated 7 years ago
chrodan / tdlearn
some common TD Learning algorithms
☆66Updated 5 years ago
suriyadeepan / BayesianML
Experiments in Bayesian Machine Learning
☆69Updated 6 years ago
Riashat / Bayesian-Exploration-Deep-RL
Bayesian Uncertainty Exploration in Deep Reinforcement Learning
☆18Updated 8 years ago
SuReLI / dyna-gym
This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.
☆32Updated 6 years ago