A comprehensive Python library implementing a variety of contextual and non-contextual multi-armed bandit algorithms—including LinUCB, Epsilon-Greedy, Upper Confidence Bound (UCB), Thompson Sampling, KernelUCB, NeuralLinearBandit, and DecisionTreeBandit—designed for reinforcement learning applications
☆13Dec 31, 2024Updated last year
Alternatives and similar repositories for contextual-bandits
Users that are interested in contextual-bandits are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This item is code for the paper "Dynamic pricing algorithm for edge computing task offloading based on Contextual Multi-Armed Bandit".☆12Apr 16, 2024Updated last year
- Learning Multi-Armed Bandits by Examples. Currently covering MAB, UCB, Boltzmann Exploration, Thompson Sampling, Contextual MAB, LinUCB, …☆45Nov 25, 2025Updated 4 months ago
- 2024秋SJTU自然辩证法复习资料汇总☆26Feb 19, 2026Updated last month
- Simulation Implementations based on "A potential game approach to multiple UAV cooperative search and surveillance"☆15Apr 2, 2024Updated last year
- ☆19Jul 14, 2025Updated 8 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Implementation of the Adaptive Contextual Combinatorial Upper Confidence Bound (ACC-UCB) algorithm for the contextual combinatorial volat…☆21Feb 24, 2020Updated 6 years ago
- Funding arbitrage screener for Binance, OKX, ByBit, Mexc☆14Sep 25, 2024Updated last year
- Python SDK for vishwa.ai☆21Jan 29, 2024Updated 2 years ago
- ☆35Updated this week
- ☆21Aug 20, 2022Updated 3 years ago
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆56Mar 17, 2026Updated last week
- Code for "Context-aware Communication for Multi-agent Reinforcement Learning"☆36Jan 29, 2024Updated 2 years ago
- ☆23May 3, 2025Updated 10 months ago
- how to create models using Gurobi in Python☆14Mar 25, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [NeurIPS 2024] "Collaboration! Towards Robust Neural Methods for Routing Problems"☆21Nov 16, 2024Updated last year
- Pytorch implementation of GLocal-K: Global and Local Kernels for Recommender Systems☆12Dec 5, 2022Updated 3 years ago
- ☆28Mar 20, 2021Updated 5 years ago
- GRPO Training Script for Qwen Model on GSM8K Dataset. This script trains a Qwen model using the GRPO (Generalized Reinforcement Policy Op…☆29Dec 11, 2025Updated 3 months ago
- Crawl trading markets from crypto exchanges everyday☆19May 8, 2025Updated 10 months ago
- Use the minimum curvature method to perform the directional drilling calculations between two survey stations (Inclination / Azimuth /☆14Jul 22, 2018Updated 7 years ago
- An official JAX-based code for our NeuraLCB paper, "Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization", ICLR…☆13Mar 13, 2022Updated 4 years ago
- Process mining module for Python.☆18Nov 16, 2021Updated 4 years ago
- A Python library for automated Pressure Transient Analysis (PTA) workflows. It provides tools for PTA flow regime feature extraction, tim…☆10Dec 16, 2025Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Reproduce the result of the paper "Deep Learning with Long Short-Term Memory Networks for Financial Market Prediction"☆19Aug 21, 2020Updated 5 years ago
- Flutter UI Practices☆13Apr 15, 2020Updated 5 years ago
- ☆24Feb 8, 2024Updated 2 years ago
- A modular and flexible backtesting framework for trading strategies in Python using Backtrader.☆25Aug 25, 2025Updated 7 months ago
- arbitrage trading robot in binance.com☆15Oct 18, 2022Updated 3 years ago
- ☆11Sep 27, 2024Updated last year
- Python implementation of the capacitance resistance model☆16May 10, 2020Updated 5 years ago
- MXNet Implementation of DCGAN, Conditional GAN, pix2pix☆25Dec 10, 2022Updated 3 years ago
- An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large…☆14Feb 17, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Hybrid recommendation engine using deep learning that incorporates user and item features, including images and text.☆20Jun 3, 2020Updated 5 years ago
- SFCTA Prospector: Data Warehouse and Visualization Platform☆17Mar 19, 2024Updated 2 years ago
- Data-Driven Engineering☆21Nov 6, 2024Updated last year
- Jupyter notebook for the MAP-Elites algorithms (Mouret & Clune, 2015)☆23Jul 9, 2022Updated 3 years ago
- JOINT RESOURCE ALLOCATION AND TRAJECTORY OPTIMIZATION FOR MULTI-UAV-ASSISTED MULTI-ACCESS MOBILE EDGE COMPUTING☆34Nov 20, 2023Updated 2 years ago
- A library to visualize C data structures.☆15Apr 11, 2020Updated 5 years ago
- ☆10May 29, 2024Updated last year