annieyan / Bandits-using-UCB-algorithmLinks
Thompson Sampling for Bandits using UCB policy
☆10Updated 8 years ago
Alternatives and similar repositories for Bandits-using-UCB-algorithm
Users that are interested in Bandits-using-UCB-algorithm are comparing it to the libraries listed below
Sorting:
- paper list in the area of reinforcenment learning for recommendation systems☆24Updated 5 years ago
- Thompson Sampling Tutorial☆54Updated 6 years ago
- Contextual bandit in python☆114Updated 4 years ago
- Implementation of proximal policy optimization(PPO) with tensorflow☆35Updated 7 years ago
- Library of contextual bandits algorithms☆334Updated last year
- Upper Confidence Tree Planner for ATARI games☆19Updated 9 years ago
- ☆27Updated 5 years ago
- Contextual Bandit Algorithms (+Bandit Algorithms)☆22Updated 5 years ago
- Deconfounding Reinforcement Learning in Observational Settings☆52Updated 6 years ago
- Real-Time Bidding by Reinforcement Learning in Display Advertising☆183Updated 4 years ago
- Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym☆177Updated 7 years ago
- Code for the paper "Skynet: A Top Deep RL Agent in the Inaugural Pommerman Team Competition"☆37Updated 6 years ago
- RainBow, Tensorflow☆49Updated 7 years ago
- ☆32Updated 2 years ago
- Play with the solutions to the multi-armed-bandit problem.☆416Updated last year
- Study NeuralUCB and regret analysis for contextual bandit with neural decision☆96Updated 3 years ago
- In this notebook several classes of multi-armed bandits are implemented. This includes epsilon greedy, UCB, Linear UCB (Contextual bandit…☆88Updated 4 years ago
- reproduce some RL or Multi-Agent models☆35Updated 6 years ago
- Code of ICML-2020 paper Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising☆26Updated 5 years ago
- Trust Region Policy Optimization (TRPO) in pure TensorFlow☆18Updated 7 years ago
- Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.☆151Updated 2 years ago
- FEN Code☆38Updated 5 years ago
- Implementation for ICML 16 paper "Deep reinforcement learning with opponent modeling"☆71Updated 9 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Updated 2 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 5 years ago
- Implementation of our paper "Meta Reinforcement Learning with Task Embedding and Shared Policy"☆34Updated 6 years ago
- Implementation of Optimal Auctions through Deep Learning☆129Updated 5 years ago
- Bandit algorithms simulations for online learning☆88Updated 5 years ago
- TensorFlow & Keras implementation of DQN with HER (Hindsight Experience Replay)☆40Updated 5 years ago
- Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: F…☆16Updated 7 years ago