Library for Multi-Armed Bandit Algorithms
☆57Apr 2, 2017Updated 8 years ago
Alternatives and similar repositories for libbandit
Users that are interested in libbandit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Multi-armed bandit simulation library☆140Nov 9, 2023Updated 2 years ago
- More about the exploration-exploitation tradeoff with harder bandits☆24May 12, 2019Updated 6 years ago
- Notes from Simons Institute program "Foundations of Machine Learning"☆13May 5, 2017Updated 8 years ago
- Epsilon-greedy, softmax and LinUCB contextual bandit implementations [recommender systems]☆50Mar 15, 2019Updated 7 years ago
- ☆12May 22, 2016Updated 9 years ago
- ☆15May 27, 2019Updated 6 years ago
- Simulations for Dueling Bandit Algorithms, including our Double Thompson Sampling (D-TS) algorithms☆25Sep 27, 2016Updated 9 years ago
- A Python 3 Bandit Visualization Package☆11Oct 16, 2017Updated 8 years ago
- A lightweight python library for bandit algorithms☆30Jul 21, 2022Updated 3 years ago
- Implementation of the X-armed Bandits algorithm, as detailed in the paper, "X-armed Bandits", Bubeck et al., 2011.☆11Jul 12, 2018Updated 7 years ago
- Empirical tests of various bandit algorithms.☆16Dec 6, 2014Updated 11 years ago
- ☆17Oct 25, 2016Updated 9 years ago
- 🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorith…☆420Apr 30, 2024Updated last year
- Code for a generative controller for the AI Gym cartpole task☆15Feb 22, 2017Updated 9 years ago
- Recurrent Neural Network language modeling toolkit☆38Jan 23, 2014Updated 12 years ago
- Online material and code base for the article Coordinates and Intervals in Graph Based Reference Genomes☆11May 2, 2017Updated 8 years ago
- Non-stationary Off-policy Evaluation☆13Nov 8, 2018Updated 7 years ago
- Create graphs of cumulative cases over cumulative deaths for COVID-19☆12May 3, 2020Updated 5 years ago
- Dynamic weighted sampling with replacement☆14Mar 19, 2016Updated 10 years ago
- Fast Ensembles of Sparse Trees☆38Apr 9, 2016Updated 9 years ago
- SUPBUB is a tool that, in linear time, finds out superbubbles(special graph-structures) in a directed graph.☆12Sep 19, 2019Updated 6 years ago
- EXPERIMENTAL implementation of side graph☆10Apr 16, 2015Updated 10 years ago
- A simple moving dot environment for OpenAI Gym to test reinforcement learning algorithms☆23Sep 1, 2022Updated 3 years ago
- starter kit for vizdoom2018-singleplayer track☆28Jul 29, 2018Updated 7 years ago
- ☆27May 17, 2019Updated 6 years ago
- Genevieve client: using GenNotes, report ClinVar for individual genomes & add consensus notes☆10Aug 2, 2016Updated 9 years ago
- Using stochastic gradient descent (SGD) with explicit and implicit updates to fit large-scale statistical models.☆16Aug 21, 2014Updated 11 years ago
- Stochastic Gradient Markov Chain Monte Carlo and Optimisation☆17Mar 21, 2017Updated 9 years ago
- Library of contextual bandits algorithms☆341Mar 14, 2024Updated 2 years ago
- Run-length compressed BWT with LZ77 sampled suffix array☆10Apr 25, 2022Updated 3 years ago
- This is the libMF source files with comments in Chinses.☆29May 25, 2014Updated 11 years ago
- Model-Free Episodic Control☆14Jan 12, 2017Updated 9 years ago
- A chain of LLMs to build more complex systems.☆18Apr 4, 2023Updated 2 years ago
- ☆19Jun 10, 2024Updated last year
- ☆10Apr 4, 2023Updated 2 years ago
- lightning☆11Oct 20, 2015Updated 10 years ago
- The handbook for leading Applied AI teams☆15Mar 12, 2026Updated last week
- Torch implementation for Robust convolutional neural networks under adversarial noise☆13Mar 8, 2016Updated 10 years ago
- Theano implementation of the Neural GPU☆15Jan 5, 2016Updated 10 years ago