VowpalWabbit / reinforcement_learningLinks
Interaction-side integration library for Reinforcement Learning loops: Predict, Log, [Learn,] Update
☆75Updated 8 months ago
Alternatives and similar repositories for reinforcement_learning
Users that are interested in reinforcement_learning are comparing it to the libraries listed below
Sorting:
- Progress, Notes, Summaries and a lot of Questions on Machine Learning☆55Updated 5 years ago
- Open-source library for a reinforcement learning research.☆54Updated 2 years ago
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.☆32Updated 6 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated last year
- Public repository for the work on bandit problems☆23Updated last year
- Contextual bandit benchmarking☆50Updated last month
- Mixture Density Networks (Bishop, 1994) tutorial in JAX☆60Updated 5 years ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- SafeLife: safety benchmarks for reinforcement learning agents☆60Updated 4 years ago
- An AutoML pipeline selection system to quickly select a promising pipeline for a new dataset.☆83Updated 3 years ago
- Deep RL Bootcamp solutions☆35Updated 7 years ago
- McKernel: A Library for Approximate Kernel Expansions in Log-linear Time.☆13Updated 2 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆51Updated 6 years ago
- PyTorch port and extension of the Deep Bayesian Bandits Library☆42Updated 5 years ago
- Source code for ICLR 2020 paper: "Learning to Guide Random Search"☆40Updated 10 months ago
- Evolution Strategy Library☆55Updated 5 years ago
- Using / reproducing DAC from the paper "Disentangled Attribution Curves for Interpreting Random Forests and Boosted Trees"☆28Updated 4 years ago
- Generic reinforcement learning codebase in TensorFlow☆95Updated 3 years ago
- Statistical adaptive stochastic optimization methods☆32Updated 5 years ago
- Willump Is a Low-Latency Useful Machine learning Platform.☆44Updated 2 years ago
- Experiment orchestration☆103Updated 5 years ago
- Library for Multi-Armed Bandit Algorithms☆58Updated 8 years ago
- Comparison of bandit algorithms from the Reinforcement Learning bible.☆17Updated 7 years ago
- This project was moved to: https://github.com/coax-dev/coax☆160Updated 2 years ago
- 2019 talk at GECCO☆68Updated 6 years ago
- Library for learning and inference with Sum-product Networks utilizing TensorFlow 2.x and Keras☆48Updated 4 years ago
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)☆43Updated 2 years ago
- Functional machine learning for fun☆85Updated 4 years ago
- ☆45Updated 5 years ago
- RL-Bakery makes it easy to build production, large scale, batch Deep Reinforcement Learning applications.☆92Updated 9 months ago