annieyan / Bandits-using-UCB-algorithmLinks
Thompson Sampling for Bandits using UCB policy
☆10Updated 8 years ago
Alternatives and similar repositories for Bandits-using-UCB-algorithm
Users that are interested in Bandits-using-UCB-algorithm are comparing it to the libraries listed below
Sorting:
- Thompson Sampling Tutorial☆55Updated 6 years ago
- Play with the solutions to the multi-armed-bandit problem.☆415Updated last year
- Contextual bandit in python☆112Updated 4 years ago
- Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym☆178Updated 7 years ago
- paper list in the area of reinforcenment learning for recommendation systems☆25Updated 5 years ago
- Implementation of DDPG (Modified from the work of Patrick Emami) - Tensorflow (no TFLearn dependency), Ornstein Uhlenbeck noise function,…☆64Updated 8 years ago
- In this notebook several classes of multi-armed bandits are implemented. This includes epsilon greedy, UCB, Linear UCB (Contextual bandit…☆89Updated 5 years ago
- Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.☆152Updated 2 years ago
- Library of contextual bandits algorithms☆336Updated last year
- ☆27Updated 6 years ago
- Real-Time Bidding by Reinforcement Learning in Display Advertising☆185Updated 4 years ago
- The submission template for the Learning to Dispatch and Reposition Competition @ KDD2020.☆92Updated 4 years ago
- Upper Confidence Tree Planner for ATARI games☆19Updated 9 years ago
- Contextual Bandit Algorithms (+Bandit Algorithms)☆22Updated 6 years ago
- Study NeuralUCB and regret analysis for contextual bandit with neural decision☆99Updated 3 years ago
- Deconfounding Reinforcement Learning in Observational Settings☆51Updated 6 years ago
- My solutions to Berkeley's CS294 (Deep Reinforcement Learning) Homework☆36Updated 7 years ago
- Multiagent Cooperation and Competition with Deep Reinforcement Learning☆123Updated 10 years ago
- Code for the paper "Skynet: A Top Deep RL Agent in the Inaugural Pommerman Team Competition"☆37Updated 6 years ago
- Industrial Benchmark☆139Updated 2 years ago
- Non stationary bandit for experiments with Reinforcement Learning☆33Updated 8 years ago
- Tensorflow + OpenAI Gym implementation of Deep Q-Network (DQN), Double DQN (DDQN), Dueling Network and Deep Deterministic Policy Gradient…☆79Updated 8 years ago
- Awesome RL: Papers, Books, Codes, Benchmarks☆117Updated 2 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 6 years ago
- pytorch neural combinatorial optimization☆388Updated 7 years ago
- ☆368Updated 5 years ago
- High-quality implementations of deep reinforcement learning algorithms for experiments☆51Updated last year
- RL library based on algorithms from the book <A-introduction-to-reinforcement-learning>☆90Updated 7 years ago
- RainBow, Tensorflow☆49Updated 7 years ago
- Implementation of Multi-Agent Deep Deterministic Policy Gradients☆39Updated 7 years ago