Thompson Sampling Tutorial
☆56Jan 25, 2019Updated 7 years ago
Alternatives and similar repositories for thompson
Users that are interested in thompson are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Bootstrap (Linear) Thompson Sampling☆13Jun 30, 2016Updated 9 years ago
- ☆370Aug 12, 2020Updated 5 years ago
- Cost-Sensitive Multi-Label Classification☆20Oct 29, 2017Updated 8 years ago
- Active Learning using Multi Label Image Dataset☆14Feb 20, 2019Updated 7 years ago
- Code for the paper 'Monte Carlo Tree Search for Asymmetric Trees'☆13May 24, 2018Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Python code to perform risk-sensitive Reinforcement Learning with dynamic convex risk measures☆23Feb 21, 2024Updated 2 years ago
- pointMass pybullet RL environment for simple experiments☆23Jan 12, 2022Updated 4 years ago
- Thompson Sampling for Bandits using UCB policy☆10Jul 29, 2017Updated 8 years ago
- Java implementation of Thompson sampling to solve the multi-armed bandit problem☆30Jun 14, 2023Updated 2 years ago
- ATP: Directed Graph Embedding with Asymmetric Transitivity Preservation☆10Apr 18, 2019Updated 6 years ago
- A simple Python script to get details of top 1000 best matching results for any search query on GitHub☆12Jun 2, 2018Updated 7 years ago
- Keras implementation of `Decoupled Neural Interfaces using Synthetic Gradients`☆12Oct 19, 2018Updated 7 years ago
- A pokemon battle AI based on UCT-MCTS☆13May 5, 2022Updated 3 years ago
- A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.☆71Jun 4, 2021Updated 4 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- A method to search for a subset of best performing items wrt black-box reward function☆15Jun 7, 2019Updated 6 years ago
- Posterior with interesting shapes from actually used models☆13Feb 10, 2025Updated last year
- Repository for "Known Unknowns: Uncertainty Quality in Bayesian Neural Networks" paper.☆12Mar 3, 2017Updated 9 years ago
- Code for "Adversarial Over-Sensitivity and Over-Stability Strategies for Dialogue Models (CoNLL 2018)"☆15Feb 6, 2019Updated 7 years ago
- SIMPLE: A Gradient Estimator for $k$-subset Sampling☆12Aug 8, 2024Updated last year
- ELEC6910R & COMP6211C: Robotic Perception of HKUST, public shared files, including tutorials, project topics(in the future)☆11Sep 12, 2020Updated 5 years ago
- ☆13Jul 17, 2025Updated 8 months ago
- Examples and data for performing path similarity analysis (PSA).☆17Oct 23, 2015Updated 10 years ago
- Coursera 2018/ data structures and algorithms / 6 course specialization by University of California, San Diego & National Research Unive…☆33Apr 6, 2018Updated 8 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆18Jun 13, 2025Updated 9 months ago
- ☆12Jul 15, 2020Updated 5 years ago
- ☆54Updated this week
- Python package built on NAMD/OpenMM and OpenMMTools to perform binding free energy calculations using the TIES protocol.☆34Dec 10, 2023Updated 2 years ago
- ☆11Oct 25, 2023Updated 2 years ago
- R package for tracking Covid19 cases in San Francisco☆12Apr 2, 2023Updated 3 years ago
- ☆12Jun 13, 2022Updated 3 years ago
- A mini racetrack world for developing and testing robots with AWS RoboMaker and Gazebo simulations.☆15Sep 8, 2020Updated 5 years ago
- Bayesian Multistate Bennett Acceptance Ratio Method☆16Mar 17, 2026Updated 3 weeks ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆10Aug 25, 2016Updated 9 years ago
- ☆31Jul 14, 2021Updated 4 years ago
- A PyTorch implementation of deep Q-learning for Atari games☆13Dec 4, 2018Updated 7 years ago
- some examples of bert☆14Nov 29, 2018Updated 7 years ago
- The code for paper 'STAS: Spatial-Temporal Return Decomposition for Multi-agent Reinforcement Learning'☆16Oct 6, 2024Updated last year
- Implementation of Embarrassingly Shallow Autoencoders (Harald Steck) in PyTorch☆35Sep 23, 2023Updated 2 years ago
- Simulation code for reference with MABUC article: Bareinboim, Forney, & Pearl (2015)☆18Nov 11, 2015Updated 10 years ago