Bandit algorithms simulations for online learning
☆89May 13, 2020Updated 5 years ago
Alternatives and similar repositories for bandit_simulations
Users that are interested in bandit_simulations are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Yahoo! news article recommendation system by linUCB☆112Feb 1, 2018Updated 8 years ago
- Multi-Armed Bandit algorithms applied to the MovieLens 20M dataset☆57Aug 9, 2020Updated 5 years ago
- Code for my book on Multi-Armed Bandit Algorithms☆921Jan 9, 2020Updated 6 years ago
- demo of running rl-based recommender systems locally☆12Jun 11, 2022Updated 3 years ago
- Study NeuralUCB and regret analysis for contextual bandit with neural decision☆101Dec 14, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆38Mar 28, 2022Updated 3 years ago
- ☆12Jun 17, 2019Updated 6 years ago
- In this notebook several classes of multi-armed bandits are implemented. This includes epsilon greedy, UCB, Linear UCB (Contextual bandit…☆90Dec 10, 2020Updated 5 years ago
- Vowpal Wabbit examples and tutorials☆21Jan 20, 2022Updated 4 years ago
- Implement different variants of gradient descent in python using numpy☆11Apr 23, 2019Updated 6 years ago
- Implementation of various multi-armed bandits algorithms on a 10-arm testbed.☆38Jan 16, 2020Updated 6 years ago
- Predict and recommend the news articles, user is most likely to click in real time.☆32Apr 3, 2018Updated 7 years ago
- Implementation of Thompson Sampling in Python☆15Feb 4, 2020Updated 6 years ago
- Intelligent Document Processing with AWS AI/ML, published by Packt☆12Mar 2, 2026Updated 3 weeks ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Accompanying repository for Unsupervised Active Domain Randomization in Goal-Directed RL☆12Aug 4, 2020Updated 5 years ago
- Play with the solutions to the multi-armed-bandit problem.☆417May 21, 2024Updated last year
- Multi-Armed Bandit Algorithms Library (MAB)☆135Sep 6, 2022Updated 3 years ago
- Contextual bandit in python☆112Jul 7, 2021Updated 4 years ago
- [IJAIT 2021] MABWiser: Contextual Multi-Armed Bandits Library☆280Sep 5, 2024Updated last year
- Example using Great Expectations to Validate Data in a scikit-learn Pipeline☆21Jul 23, 2020Updated 5 years ago
- Bootstrap (Linear) Thompson Sampling☆13Jun 30, 2016Updated 9 years ago
- ☆11Oct 9, 2021Updated 4 years ago
- Extra functions to be used with the miner package☆11Dec 14, 2020Updated 5 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Offline evaluation of multi-armed bandit algorithms☆23Dec 1, 2020Updated 5 years ago
- Self-Control of Smartphone Application Usage☆11Jan 4, 2024Updated 2 years ago
- Non Intrusive Load Monitoring data repository and data converter for NILMTK☆11Feb 10, 2017Updated 9 years ago
- Intrinsic Motivation and Automatic Curricula via Asymmetric Self-Play☆14May 1, 2018Updated 7 years ago
- Network Flows Optimization - Shortest Path, Max Flow and Min Cost Flow Algorithms in Python☆11Sep 13, 2019Updated 6 years ago
- ☆15Jan 20, 2020Updated 6 years ago
- A set of RL experiments. Currently including: (1) the MDP rank experiment, based on policy gradient algorithm☆27Feb 7, 2022Updated 4 years ago
- A scalable benchmark for state representation learning in visual reinforcement learning.☆17Jun 23, 2025Updated 9 months ago
- PyTorch implementation of PtrNet to solve sorting problem.☆12Dec 19, 2017Updated 8 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [IJCAI'23] Speeding Up Multi-Objective Hyperparameter Optimization by Task Similarity-Based Meta-Learning for the Tree-Structured Parzen …☆10Mar 9, 2024Updated 2 years ago
- Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: F…☆17May 21, 2018Updated 7 years ago
- [NeurIPS 2024] "Discovery of the Hidden World with Large Language Models"☆31Dec 2, 2024Updated last year
- Off-policy Learning in Two-stage Recommender Systems. https://dl.acm.org/doi/pdf/10.1145/3366423.3380130☆30Jun 11, 2020Updated 5 years ago
- The repository contains my code for route optimization heuristics developed to solve the problem of routing for pick/place operations in …☆13Aug 3, 2018Updated 7 years ago
- ☆11May 6, 2025Updated 10 months ago
- Learning to Recommend using a Deep Reinforcement Agent☆23Apr 2, 2017Updated 8 years ago