Non stationary bandit for experiments with Reinforcement Learning
☆33Mar 24, 2017Updated 9 years ago
Alternatives and similar repositories for NonStationaryBandit
Users that are interested in NonStationaryBandit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python for Neuroscience - An introduction to scientific computing with Python☆11Feb 3, 2016Updated 10 years ago
- A Julia Package for providing Multi Armed Bandit Experiments☆21Jul 19, 2018Updated 7 years ago
- Course taught at the University of Bordeaux in the academic year 2017 for PhD students.☆17Feb 6, 2017Updated 9 years ago
- Comparison of bandit algorithms from the Reinforcement Learning bible.☆17Jun 6, 2018Updated 7 years ago
- Experiments showing effects of parameters on Maximum Entropy Inverse Reinforcement Learning using grid world☆15Nov 26, 2016Updated 9 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Semi-supervised Latent Dirichlet Allocation (LDA)☆12Dec 21, 2017Updated 8 years ago
- An attempt to apply reinforcement learning to graph signal recovery problem☆11Aug 25, 2021Updated 4 years ago
- Reinforcement learning on gridworld with Q-learning☆10Jan 28, 2017Updated 9 years ago
- Implementation of the X-armed Bandits algorithm, as detailed in the paper, "X-armed Bandits", Bubeck et al., 2011.☆11Jul 12, 2018Updated 7 years ago
- A PyTorch implementation of REINFORCE Learning To Rank on OSHUMED, MQ, etc. dataset. Basic idea also appears in SIGIR'17 Reinforcement Le…☆18Dec 8, 2017Updated 8 years ago
- A Python 3 Bandit Visualization Package☆11Oct 16, 2017Updated 8 years ago
- 🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorith…☆420Apr 30, 2024Updated last year
- A multi-armed bandit library for Python☆81Jan 13, 2020Updated 6 years ago
- Implementation of my Bayesian Optimization algorithms☆12Mar 17, 2018Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Deep Gaussian Process for Inverse Reinforcement Learning☆32Jul 6, 2017Updated 8 years ago
- Constrained multivariate least-squares optimizer for scipy☆23Jan 26, 2016Updated 10 years ago
- Codebase for Mechanistic Mode Connectivity☆13Jul 14, 2023Updated 2 years ago
- Implementation of Behavioral Cloning from Observationmentation☆16Nov 28, 2019Updated 6 years ago
- Hierarchical Dirichlet Process (with Split-Merge Operations), originally by Chong Wang☆18Oct 12, 2013Updated 12 years ago
- Published by Packt☆11Jan 18, 2021Updated 5 years ago
- A reinforcement learning algorithm for the 2048 game☆20Mar 25, 2014Updated 12 years ago
- Call for Replication in ReScience☆13Oct 13, 2016Updated 9 years ago
- Non-stationary Off-policy Evaluation☆13Nov 8, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Simple implementation of GP-UCB algorithm.☆54Jan 20, 2017Updated 9 years ago
- Simple implementation of the CGP-UCB algorithm.☆38Nov 30, 2019Updated 6 years ago
- Code repo for the NeurIPS 2021 paper "Online Adaption to Label Distribution Shift".☆15Feb 15, 2023Updated 3 years ago
- Lecture notes for the "Programming with Python" course I have taught in Spring 2015. at The University of Manchester☆22Jan 21, 2017Updated 9 years ago
- Implementations of various RL and Deep RL algorithms in TensorFlow, PyTorch and Keras.☆16Sep 18, 2024Updated last year
- Today I Woke: wake up early, change your life☆11Apr 28, 2021Updated 4 years ago
- ☆13Mar 11, 2025Updated last year
- Octicons glyph name for emacs☆24Dec 24, 2015Updated 10 years ago
- Course taught at the University of Bordeaux in the academic year 2015/16 for PhD students.☆21Mar 17, 2016Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆18Nov 24, 2019Updated 6 years ago
- L1 regularized Least Squares minimization problem solver.☆17Oct 12, 2016Updated 9 years ago
- Python code for the post "Adversarial Bandits and the Exp3 Algorithm"☆51Jun 9, 2020Updated 5 years ago
- Collection of useful, re-used routines.☆45Jul 15, 2017Updated 8 years ago
- Code for abstracting, evaluating, and visualizing Markov Decision Processes.☆10Jan 12, 2017Updated 9 years ago
- ☆17Nov 7, 2024Updated last year
- Comprehensive database of ratings for 11k news domains☆29Sep 14, 2023Updated 2 years ago