annieyan / Bandits-using-UCB-algorithmLinks
Thompson Sampling for Bandits using UCB policy
☆10Updated 8 years ago
Alternatives and similar repositories for Bandits-using-UCB-algorithm
Users that are interested in Bandits-using-UCB-algorithm are comparing it to the libraries listed below
Sorting:
- Play with the solutions to the multi-armed-bandit problem.☆415Updated last year
- Deconfounding Reinforcement Learning in Observational Settings☆51Updated 6 years ago
- Contextual bandit in python☆112Updated 4 years ago
- Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym☆178Updated 7 years ago
- The submission template for the Learning to Dispatch and Reposition Competition @ KDD2020.☆92Updated 4 years ago
- paper list in the area of reinforcenment learning for recommendation systems☆25Updated 5 years ago
- Contextual Bandit Algorithms (+Bandit Algorithms)☆22Updated 6 years ago
- Non stationary bandit for experiments with Reinforcement Learning☆33Updated 8 years ago
- Thompson Sampling Tutorial☆55Updated 6 years ago
- Study NeuralUCB and regret analysis for contextual bandit with neural decision☆99Updated 4 years ago
- Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.☆152Updated 2 years ago
- Implementation of DDPG (Modified from the work of Patrick Emami) - Tensorflow (no TFLearn dependency), Ornstein Uhlenbeck noise function,…☆64Updated 8 years ago
- Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: F…☆16Updated 7 years ago
- Library of contextual bandits algorithms☆336Updated last year
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 6 years ago
- In this notebook several classes of multi-armed bandits are implemented. This includes epsilon greedy, UCB, Linear UCB (Contextual bandit…☆89Updated 5 years ago
- RL library based on algorithms from the book <A-introduction-to-reinforcement-learning>☆90Updated 8 years ago
- My solutions to Berkeley's CS294 (Deep Reinforcement Learning) Homework☆36Updated 7 years ago
- Real-Time Bidding by Reinforcement Learning in Display Advertising☆187Updated 4 years ago
- Code for the paper "Skynet: A Top Deep RL Agent in the Inaugural Pommerman Team Competition"☆37Updated 6 years ago
- Reinforcement learning in python☆36Updated 6 years ago
- Unofficial PyBrain extension for multi-agent reinforcement learning in general sum stochastic games.☆69Updated 5 months ago
- Dynamic Pricing BwK Problem and Reinforcement Learning☆31Updated 7 years ago
- Exercise Solutions for Reinforcement Learning: An Introduction [2nd Edition]☆107Updated 3 years ago
- ☆27Updated 6 years ago
- RainBow, Tensorflow☆49Updated 7 years ago
- reproduce some RL or Multi-Agent models☆35Updated 6 years ago
- Upper Confidence Tree Planner for ATARI games☆19Updated 9 years ago
- pytorch neural combinatorial optimization☆387Updated 8 years ago
- Awesome RL: Papers, Books, Codes, Benchmarks☆118Updated 2 years ago