annieyan / Bandits-using-UCB-algorithmLinks
Thompson Sampling for Bandits using UCB policy
☆10Updated 8 years ago
Alternatives and similar repositories for Bandits-using-UCB-algorithm
Users that are interested in Bandits-using-UCB-algorithm are comparing it to the libraries listed below
Sorting:
- Contextual Bandit Algorithms (+Bandit Algorithms)☆22Updated 5 years ago
- Contextual bandit in python☆114Updated 4 years ago
- Thompson Sampling Tutorial☆54Updated 6 years ago
- Library of contextual bandits algorithms☆334Updated last year
- paper list in the area of reinforcenment learning for recommendation systems☆25Updated 5 years ago
- Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: F…☆16Updated 7 years ago
- My solutions to Berkeley's CS294 (Deep Reinforcement Learning) Homework☆36Updated 7 years ago
- Study NeuralUCB and regret analysis for contextual bandit with neural decision☆96Updated 3 years ago
- Play with the solutions to the multi-armed-bandit problem.☆416Updated last year
- Deconfounding Reinforcement Learning in Observational Settings☆51Updated 6 years ago
- Non stationary bandit for experiments with Reinforcement Learning☆34Updated 8 years ago
- ☆27Updated 5 years ago
- RL library based on algorithms from the book <A-introduction-to-reinforcement-learning>☆90Updated 7 years ago
- Task-based end-to-end model learning in stochastic optimization☆211Updated 4 years ago
- The submission template for the Learning to Dispatch and Reposition Competition @ KDD2020.☆90Updated 4 years ago
- Maximum Causal Entropy Inverse Reinforcement Learning☆48Updated 6 years ago
- In this notebook several classes of multi-armed bandits are implemented. This includes epsilon greedy, UCB, Linear UCB (Contextual bandit…☆88Updated 4 years ago
- ☆365Updated 5 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 5 years ago
- Upper Confidence Tree Planner for ATARI games☆19Updated 9 years ago
- Real-Time Bidding by Reinforcement Learning in Display Advertising☆183Updated 4 years ago
- References at the Intersection of Causality and Reinforcement Learning☆89Updated 5 years ago
- Simple implementation of GP-UCB algorithm.☆53Updated 8 years ago
- Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym☆178Updated 7 years ago
- Dynamic Pricing BwK Problem and Reinforcement Learning☆31Updated 6 years ago
- Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.☆151Updated 2 years ago
- Implementation of Optimal Auctions through Deep Learning☆129Updated 5 years ago
- More about the exploration-exploitation tradeoff with harder bandits☆24Updated 6 years ago
- Reproducing results from DeepMind's paper on Population Based Training of Neural Networks.☆55Updated 7 years ago
- Direct Gibbs sampling for DPMM using python.☆16Updated 8 years ago