Thompson Sampling Tutorial
☆56Jan 25, 2019Updated 7 years ago
Alternatives and similar repositories for thompson
Users that are interested in thompson are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆366Aug 12, 2020Updated 5 years ago
- Multi-Armed Bandit Algorithms Library (MAB)☆136Apr 13, 2026Updated 3 weeks ago
- Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncat…☆11Apr 3, 2019Updated 7 years ago
- Implement different variants of gradient descent in python using numpy☆11Apr 23, 2019Updated 7 years ago
- Cost-Sensitive Multi-Label Classification☆20Oct 29, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Active Learning using Multi Label Image Dataset☆14Feb 20, 2019Updated 7 years ago
- Study of the paper 'Neural Thompson Sampling' published in October 2020☆24Sep 27, 2022Updated 3 years ago
- Python code to perform risk-sensitive Reinforcement Learning with dynamic convex risk measures☆23Feb 21, 2024Updated 2 years ago
- Monte Carlo simulations of several different multi-armed bandit algorithms and a comparison with classical statistical A/B testing☆75Jan 28, 2020Updated 6 years ago
- pointMass pybullet RL environment for simple experiments☆23Jan 12, 2022Updated 4 years ago
- Thompson Sampling for Bandits using UCB policy☆10Jul 29, 2017Updated 8 years ago
- Keras implementation of `Decoupled Neural Interfaces using Synthetic Gradients`☆12Oct 19, 2018Updated 7 years ago
- A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.☆71Jun 4, 2021Updated 4 years ago
- A method to search for a subset of best performing items wrt black-box reward function☆15Jun 7, 2019Updated 6 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Posterior with interesting shapes from actually used models☆13Feb 10, 2025Updated last year
- Examples for TensorFlow Weight Normalization☆14Apr 19, 2019Updated 7 years ago
- GibsonSim2RealChallenge @ CVPR2020☆35May 26, 2020Updated 5 years ago
- ☆13Jul 17, 2025Updated 9 months ago
- Coursera 2018/ data structures and algorithms / 6 course specialization by University of California, San Diego & National Research Unive…☆33Apr 6, 2018Updated 8 years ago
- ☆18Jun 13, 2025Updated 10 months ago
- Python package built on NAMD/OpenMM and OpenMMTools to perform binding free energy calculations using the TIES protocol.☆34Dec 10, 2023Updated 2 years ago
- Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: F…☆17May 21, 2018Updated 7 years ago
- ☆11Oct 25, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- In this notebook several classes of multi-armed bandits are implemented. This includes epsilon greedy, UCB, Linear UCB (Contextual bandit…☆90Dec 10, 2020Updated 5 years ago
- [NeurIPS 2022] Leveraging Factored Action Spaces for Efficient Offline RL in Healthcare. https://arxiv.org/abs/2305.01738☆11Nov 27, 2022Updated 3 years ago
- Multi-view Reinforcement Learning☆11Feb 9, 2020Updated 6 years ago
- Jupyter Notebook Tutorials for Creating Chemical Space Networks☆39Dec 27, 2023Updated 2 years ago
- R package for tracking Covid19 cases in San Francisco☆12Apr 2, 2023Updated 3 years ago
- ☆69Updated this week
- The Limited Multi-Label Projection Layer☆59Jul 25, 2024Updated last year
- A mini racetrack world for developing and testing robots with AWS RoboMaker and Gazebo simulations.☆15Sep 8, 2020Updated 5 years ago
- Robust and stable clustering of molecular dynamics simulation trajectories.☆19Sep 2, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Implementation of data dimensionality reduction algorithms SVD and CUR without using library functions.☆10Jul 24, 2017Updated 8 years ago
- A backend for storing MCMC draws.☆22Apr 27, 2026Updated last week
- Bayesian Multistate Bennett Acceptance Ratio Method☆16Mar 17, 2026Updated last month
- ☆10Aug 25, 2016Updated 9 years ago
- Code and datasets from the publication https://doi.org/10.1186/s13321-023-00787-9☆22Apr 21, 2024Updated 2 years ago
- Implementation of "Debiasing Item-to-Item Recommendations With Small Annotated Datasets" (RecSys '20)☆40Oct 13, 2020Updated 5 years ago
- Code for "Approaching Deep Learning through the Spectral Dynamics of Weights"☆13Oct 30, 2024Updated last year