ondrejbiza / banditsLinks
Comparison of bandit algorithms from the Reinforcement Learning bible.
☆17Updated 7 years ago
Alternatives and similar repositories for bandits
Users that are interested in bandits are comparing it to the libraries listed below
Sorting:
- Reinforcement learning algorithm implementations and ML experimentation workspace☆43Updated 6 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated last year
- ☆25Updated 7 years ago
- PyTorch implementation of Memory Augmented Self-Play☆52Updated 4 years ago
- Deep Reinforcement Learning with Fined Grained Action Repetition☆23Updated 7 years ago
- Implementation of Counterfactual risk minimization☆26Updated 8 years ago
- A working implementation of the Categorical DQN (Distributional RL).☆96Updated 7 years ago
- ☆42Updated 6 years ago
- Models built with TensorFlow☆25Updated 6 years ago
- ☆24Updated 9 years ago
- [DEPRECATED] Advantage Actor Critic model in PyTorch inspired by OpenAI baselines TensorFlow implementation☆53Updated 5 years ago
- Simple tools for statistical analyses in RL experiments☆66Updated 7 years ago
- DQV-Learning: a novel faster synchronous Deep Reinforcement Learning algorithm☆25Updated 2 years ago
- reinforcement learning. policy gradient. PCL☆37Updated 8 years ago
- ☆69Updated 7 years ago
- Our NIPS 2017: Learning to Run source code☆55Updated 2 years ago
- Python implementation of tabular asynchronous actor critic☆11Updated 9 years ago
- Combining deep learning and reinforcement learning.☆80Updated 3 years ago
- Streamlined machine learning experiment management.☆107Updated 5 years ago
- Replication of Uber Neuroevolution paper☆46Updated 7 years ago
- Reinforcement learning course at Data Science Retreat☆42Updated 6 years ago
- Web-based Reinforcement Learning Control Center☆64Updated 8 years ago
- for learning reinforcement learning using PyTorch.☆64Updated 5 years ago
- An extension to Sacred for automated hyperparameter optimization.☆59Updated 7 years ago
- ☆56Updated 2 years ago
- Full World Models Implementation in Chainer☆166Updated 7 years ago
- some common TD Learning algorithms☆66Updated 5 years ago
- Experiments in Bayesian Machine Learning☆69Updated 6 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Updated 8 years ago
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.☆32Updated 6 years ago