ondrejbiza / banditsLinks
Comparison of bandit algorithms from the Reinforcement Learning bible.
☆17Updated 7 years ago
Alternatives and similar repositories for bandits
Users that are interested in bandits are comparing it to the libraries listed below
Sorting:
- Simple tools for statistical analyses in RL experiments☆66Updated 6 years ago
- ☆43Updated 5 years ago
- Some hard problems for reinforcement learning.☆32Updated 6 years ago
- reinforcement learning. policy gradient. PCL☆37Updated 8 years ago
- ☆68Updated 7 years ago
- Python implementation of tabular asynchronous actor critic☆11Updated 9 years ago
- A working implementation of the Categorical DQN (Distributional RL).☆96Updated 7 years ago
- Our NIPS 2017: Learning to Run source code☆55Updated 2 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Updated 7 years ago
- DQV-Learning: a novel faster synchronous Deep Reinforcement Learning algorithm☆25Updated 2 years ago
- Implementation of Counterfactual risk minimization☆26Updated 8 years ago
- Models built with TensorFlow☆25Updated 6 years ago
- A Python library for reinforcement learning using Bayesian approaches☆54Updated 10 years ago
- ☆44Updated 6 years ago
- some common TD Learning algorithms☆66Updated 5 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆31Updated 4 years ago
- A job launching library for docker, EC2, GCP, etc.☆57Updated 3 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated last year
- [DEPRECATED] Advantage Actor Critic model in PyTorch inspired by OpenAI baselines TensorFlow implementation☆53Updated 5 years ago
- ☆25Updated 7 years ago
- ☆21Updated 6 years ago
- Combining deep learning and reinforcement learning.☆80Updated 3 years ago
- Collection of tutorials, exercises and papers on RL☆17Updated 7 years ago
- PyTorch implementation of Memory Augmented Self-Play☆52Updated 4 years ago
- Actor Critic using Kronecker-Factored Trust Region☆19Updated 6 years ago
- ☆13Updated 9 years ago
- Deep Reinforcement Learning with Fined Grained Action Repetition☆23Updated 7 years ago
- Code accompanying the OptionGAN paper.☆44Updated 6 years ago
- Reason8.ai PyTorch solution for NIPS RL 2017 challenge☆84Updated 5 years ago
- for learning reinforcement learning using PyTorch.☆64Updated 5 years ago