Thompson Sampling for Bandits using UCB policy
☆10Jul 29, 2017Updated 8 years ago
Alternatives and similar repositories for Bandits-using-UCB-algorithm
Users that are interested in Bandits-using-UCB-algorithm are comparing it to the libraries listed below
Sorting:
- Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: F…☆17May 21, 2018Updated 7 years ago
- Code for "Best arm identification in multi-armed bandits with delayed feedback", AISTATS 2018.☆20Apr 3, 2018Updated 7 years ago
- Implement different variants of gradient descent in python using numpy☆11Apr 23, 2019Updated 6 years ago
- My personal collection for some Machine Learning Lecture/ Tutorial Notes☆11Mar 16, 2016Updated 10 years ago
- Visualizing ImageNet Classes Hierarchical Structure.☆15Apr 8, 2018Updated 7 years ago
- Julia implementations of temporal difference Reinforcement Learning algorithms like Q-Learning and SARSA☆13Nov 16, 2025Updated 4 months ago
- ☆13Apr 2, 2018Updated 7 years ago
- multi-pages dash app☆12Apr 3, 2018Updated 7 years ago
- Analyzes and adjusts the volume of MP3 files☆12Apr 7, 2019Updated 6 years ago
- Bootstrap (Linear) Thompson Sampling☆13Jun 30, 2016Updated 9 years ago
- Real-time log based alerting for developers.☆13Aug 28, 2023Updated 2 years ago
- Stochastic Variance Reduction Policy Gradient Estimation☆11Nov 6, 2018Updated 7 years ago
- Kaggle Avito Demand Challenge (top 1% solution)☆17Jul 31, 2018Updated 7 years ago
- ☆12Jul 3, 2021Updated 4 years ago
- ☆23Feb 3, 2026Updated last month
- ☆16Feb 19, 2025Updated last year
- AndroidSlicer is a dynamic slicing tool, useful for a variety of tasks, from testing to debugging to security.☆14Jul 28, 2019Updated 6 years ago
- Heuristic Dynamic Programming with Python☆14Jul 28, 2014Updated 11 years ago
- An attempt to formalize my thoughts. A pythonic approach to mental housekeeping☆15Apr 21, 2016Updated 9 years ago
- Code to study the generalisability of benchmark models on non-stationary EHRs.☆14Aug 7, 2019Updated 6 years ago
- Hybrid Linear UCB Multi-arm Bandit library☆14Oct 5, 2016Updated 9 years ago
- PhysioNet 2019 Challenge: Early Prediction of Sepsis from Clinical Data☆12May 19, 2019Updated 6 years ago
- Binary floating-point formats in Go (IEEE 754 half and quadruple precision, x86 extended precision and PowerPC quadruple precision with d…☆23Dec 12, 2021Updated 4 years ago
- Works for Applied Deep Learning / Machine Learning and Having It Deep and Structured (2017 FALL) @ NTU☆11Aug 14, 2018Updated 7 years ago
- ☆19Jun 10, 2022Updated 3 years ago
- 基于神经网络的中文分词器☆17Mar 29, 2019Updated 6 years ago
- Stochastic Gradient Markov Chain Monte Carlo and Optimisation☆17Mar 21, 2017Updated 9 years ago
- weighted deepwalk implementation in c++☆18Feb 8, 2017Updated 9 years ago
- Clustering documents based on LSH☆14Apr 20, 2016Updated 9 years ago
- Implementation of Alpha Go Zero algorithm for the game of tic-tac-toe☆16Nov 4, 2017Updated 8 years ago
- Online Variance Reduction☆15May 9, 2019Updated 6 years ago
- online learning for time series prediction☆13May 17, 2014Updated 11 years ago
- ☆17May 16, 2018Updated 7 years ago
- Implementation of my Bayesian Optimization algorithms☆12Mar 17, 2018Updated 8 years ago
- A neural branch predictor tested using CPU emulator, testing both supervised learning and reinforcement learning (for COS 583: Great Mome…☆15May 17, 2017Updated 8 years ago
- Codes for Stackelberg GAN☆15Apr 23, 2019Updated 6 years ago
- Predicting Unplanned Hospital Readmission Using Natural Language Processing of MIMICIII Discharge Notes☆12Feb 12, 2019Updated 7 years ago
- A site comparing services of different Cloud Vendors☆10Jan 4, 2017Updated 9 years ago
- Everything about Transfer Learning and Domain Adaptation--迁移学习☆10Jun 5, 2019Updated 6 years ago