lilianweng / multi-armed-banditView external linksLinks
Play with the solutions to the multi-armed-bandit problem.
☆416May 21, 2024Updated last year
Alternatives and similar repositories for multi-armed-bandit
Users that are interested in multi-armed-bandit are comparing it to the libraries listed below
Sorting:
- Implementation of various multi-armed bandits algorithms on a 10-arm testbed.☆38Jan 16, 2020Updated 6 years ago
- Multi-Armed Bandit Algorithms Library (MAB)☆135Sep 6, 2022Updated 3 years ago
- Python implementations of contextual bandits algorithms☆820Jan 14, 2026Updated last month
- Code for my book on Multi-Armed Bandit Algorithms☆920Jan 9, 2020Updated 6 years ago
- Library of contextual bandits algorithms☆339Mar 14, 2024Updated last year
- Multi-Arm Bandits for online recommendations via Particle Thompson Sampling with Probabilistic Matrix Factorization☆14May 9, 2018Updated 7 years ago
- ☆369Aug 12, 2020Updated 5 years ago
- Python code for the post "Adversarial Bandits and the Exp3 Algorithm"☆50Jun 9, 2020Updated 5 years ago
- Implementing LinUCB and HybridLinUCB in Python.☆49May 15, 2018Updated 7 years ago
- ☆83Jan 21, 2019Updated 7 years ago
- Entity Linking within a Social Media Platform☆11May 2, 2019Updated 6 years ago
- Contextual Combinatorial Cascading Bandits☆10Jun 30, 2016Updated 9 years ago
- ☆13Nov 12, 2019Updated 6 years ago
- Hybrid Linear UCB bandit learning algorithm L Li(2010) python code☆56Dec 23, 2015Updated 10 years ago
- Bandit algorithms simulations for online learning☆88May 13, 2020Updated 5 years ago
- A Python 3 Bandit Visualization Package☆11Oct 16, 2017Updated 8 years ago
- Official repository of "Efficient and Effective Query Expansion for Web Search", Short Paper @ CIKM 2018☆15Nov 17, 2019Updated 6 years ago
- R package for Multi-Armed Bandit Simulation Study☆38Aug 18, 2017Updated 8 years ago
- Simulations for Dueling Bandit Algorithms, including our Double Thompson Sampling (D-TS) algorithms☆25Sep 27, 2016Updated 9 years ago
- Yahoo! news article recommendation system by linUCB☆111Feb 1, 2018Updated 8 years ago
- Variable-order CRFs with structure learning☆17Aug 1, 2024Updated last year
- ☆16Dec 7, 2021Updated 4 years ago
- The High-dimensional BayesOpt algorithms from "A Framework for Bayesian Optimization in Embedded Subspaces☆43Jun 8, 2019Updated 6 years ago
- ☆13Jun 23, 2017Updated 8 years ago
- Global Average Pooling Implemented in TensorFlow☆15Nov 9, 2017Updated 8 years ago
- Neural Elastic Inference and Search☆20Nov 14, 2019Updated 6 years ago
- Dynamic Entity Summarization (DynES)☆20May 10, 2019Updated 6 years ago
- A multi-armed bandit library for Python☆82Jan 13, 2020Updated 6 years ago
- Deep & Classical Reinforcement Learning + Machine Learning Examples in Python☆370Jul 20, 2023Updated 2 years ago
- Code for Fast Information-theoretic Bayesian Optimisation☆16Jun 7, 2018Updated 7 years ago
- A curated list on papers about combinatorial multi-armed bandit problems.☆17May 10, 2021Updated 4 years ago
- ☆20Mar 30, 2022Updated 3 years ago
- [IJAIT 2021] MABWiser: Contextual Multi-Armed Bandits Library☆280Sep 5, 2024Updated last year
- Tensorflow/Keras code and trained models for Episodic Curiosity Through Reachability☆205Oct 2, 2020Updated 5 years ago
- Japanese tutorial for Vespa☆20Mar 14, 2018Updated 7 years ago
- ☆70Mar 2, 2015Updated 10 years ago
- A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)☆3,682Updated this week
- Multi-Armed Bandit algorithms applied to the MovieLens 20M dataset☆57Aug 9, 2020Updated 5 years ago
- Quiz code of debugging a badly-implemented neural network☆22Dec 19, 2018Updated 7 years ago