annieyan / Bandits-using-UCB-algorithm
Thompson Sampling for Bandits using UCB policy
☆10Updated 7 years ago
Alternatives and similar repositories for Bandits-using-UCB-algorithm:
Users that are interested in Bandits-using-UCB-algorithm are comparing it to the libraries listed below
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 5 years ago
- Contextual Bandit Algorithms (+Bandit Algorithms)☆22Updated 5 years ago
- ☆26Updated 5 years ago
- My solutions to Berkeley's CS294 (Deep Reinforcement Learning) Homework☆36Updated 6 years ago
- Thompson Sampling Tutorial☆51Updated 6 years ago
- The submission template for the Learning to Dispatch and Reposition Competition @ KDD2020.☆85Updated 3 years ago
- Non stationary bandit for experiments with Reinforcement Learning☆34Updated 7 years ago
- ☆16Updated 6 years ago
- paper list in the area of reinforcenment learning for recommendation systems☆24Updated 4 years ago
- Ordered Preference Elicitation Strategies for Multi-Objective Decision Making using Gaussian Processes☆23Updated 6 years ago
- Efficient Exploration through Bayesian Deep Q-Networks☆37Updated 7 years ago
- Code the AAAI 2019 paper "Melding the Data-Decisions Pipeline: Decision-Focused Learning for Combinatorial Optimization"☆30Updated 4 years ago
- Deconfounding Reinforcement Learning in Observational Settings☆49Updated 5 years ago
- Contextual bandit in python☆110Updated 3 years ago
- Policy gradient reinforcement learning algorithm with importance sampling☆31Updated 7 years ago
- Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: F…☆17Updated 6 years ago
- Direct Gibbs sampling for DPMM using python.☆16Updated 7 years ago
- Multi-Objective Deep Reinforcement Learning☆43Updated 8 years ago
- Contains Code for Contextual Bandits Decision Tree☆20Updated 5 years ago
- Task-based end-to-end model learning in stochastic optimization☆202Updated 4 years ago
- ☆32Updated 2 years ago
- working example of a contextual multi-armed bandit☆55Updated 5 years ago
- ☆8Updated 7 years ago
- Simple implementation of GP-UCB algorithm.☆51Updated 8 years ago
- Unofficial PyBrain extension for multi-agent reinforcement learning in general sum stochastic games.☆69Updated 8 years ago
- Implementation of Population-Guided Parallel Policy Search for Reinforcement Learning☆22Updated 5 years ago
- Code to train RL agents along with Adversarial distrubance agents☆63Updated 7 years ago
- FEN Code☆37Updated 5 years ago
- Experiments on a discrete mean field game model of population dynamics with reinforcement learning☆34Updated last year
- Asymmetric Transfer Learning with Deep Gaussian Processes☆18Updated 9 years ago