annieyan / Bandits-using-UCB-algorithmLinks
Thompson Sampling for Bandits using UCB policy
☆10Updated 8 years ago
Alternatives and similar repositories for Bandits-using-UCB-algorithm
Users that are interested in Bandits-using-UCB-algorithm are comparing it to the libraries listed below
Sorting:
- Contextual Bandit Algorithms (+Bandit Algorithms)☆22Updated 6 years ago
- Contextual bandit in python☆113Updated 4 years ago
- Library of contextual bandits algorithms☆336Updated last year
- Study NeuralUCB and regret analysis for contextual bandit with neural decision☆99Updated 3 years ago
- Deconfounding Reinforcement Learning in Observational Settings☆51Updated 6 years ago
- Play with the solutions to the multi-armed-bandit problem.☆414Updated last year
- Implementation of DDPG (Modified from the work of Patrick Emami) - Tensorflow (no TFLearn dependency), Ornstein Uhlenbeck noise function,…☆64Updated 8 years ago
- Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym☆178Updated 7 years ago
- Real-Time Bidding by Reinforcement Learning in Display Advertising☆183Updated 4 years ago
- Thompson Sampling Tutorial☆55Updated 6 years ago
- In this notebook several classes of multi-armed bandits are implemented. This includes epsilon greedy, UCB, Linear UCB (Contextual bandit…☆89Updated 4 years ago
- Direct Gibbs sampling for DPMM using python.☆17Updated 8 years ago
- Bandit algorithms simulations for online learning☆88Updated 5 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 6 years ago
- ☆366Updated 5 years ago
- Non stationary bandit for experiments with Reinforcement Learning☆33Updated 8 years ago
- Dynamic Pricing BwK Problem and Reinforcement Learning☆31Updated 6 years ago
- Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.☆151Updated 2 years ago
- RainBow, Tensorflow☆49Updated 7 years ago
- paper list in the area of reinforcenment learning for recommendation systems☆25Updated 5 years ago
- The submission template for the Learning to Dispatch and Reposition Competition @ KDD2020.☆92Updated 4 years ago
- RL library based on algorithms from the book <A-introduction-to-reinforcement-learning>☆90Updated 7 years ago
- Simple implementation of GP-UCB algorithm.☆54Updated 8 years ago
- Semi-synthetic experiments to test several approaches for off-policy evaluation and optimization of slate recommenders.☆43Updated 8 years ago
- Multi-Armed Bandit Algorithms Library (MAB)☆133Updated 3 years ago
- ☆44Updated 5 years ago
- Reinforcement learning in python☆36Updated 6 years ago
- My solutions to Berkeley's CS294 (Deep Reinforcement Learning) Homework☆36Updated 7 years ago
- ☆27Updated 6 years ago
- More about the exploration-exploitation tradeoff with harder bandits☆24Updated 6 years ago