ardaegeunlu / X-armed-Bandits
Implementation of the X-armed Bandits algorithm, as detailed in the paper, "X-armed Bandits", Bubeck et al., 2011.
☆9Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for X-armed-Bandits
- Implementation of my Bayesian Optimization algorithms☆11Updated 6 years ago
- Public repository for the work on bandit problems☆23Updated 7 months ago
- A Python 3 Bandit Visualization Package☆10Updated 7 years ago
- Online Variance Reduction☆13Updated 5 years ago
- Non stationary bandit for experiments with Reinforcement Learning☆34Updated 7 years ago
- ☆11Updated 8 years ago
- Empirical tests of various bandit algorithms.☆16Updated 9 years ago
- Non-stationary Off-policy Evaluation☆13Updated 6 years ago
- Randomized Linear Algebra in Python☆12Updated 7 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 4 years ago
- Source code for ICLR 2020 paper: "Learning to Guide Random Search"☆39Updated 2 months ago
- Code for doubly stochastic gradients☆25Updated 10 years ago
- A lightweight python library for bandit algorithms☆29Updated 2 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆21Updated 6 years ago
- An implementation of "Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles" (http://arxiv.org/abs/1612.01474)☆34Updated 7 years ago
- Simple implementation of GP-UCB algorithm.☆49Updated 7 years ago
- Comparison of bandit algorithms from the Reinforcement Learning bible.☆17Updated 6 years ago
- ☆26Updated 5 years ago
- Code for our paper: Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies☆15Updated 5 years ago
- ☆69Updated 6 years ago
- Variational Auto-Regressive Gaussian Processes for Continual Learning☆20Updated 3 years ago
- scripts for evaluation of contextual bandit algorithms☆43Updated 4 years ago
- Code for "Efficient optimization of loops and limits with randomized telescoping sums"☆27Updated 5 years ago
- Code to related to my NIPS 2016 paper☆10Updated 7 years ago
- Code for "Learning Inductive Biases with Simple Neural Networks" (Feinman & Lake, 2018).☆21Updated 5 years ago
- Code accompanying the paper "Learning Permutations with Sinkhorn Policy Gradient"☆39Updated 6 years ago
- This is code associated with the paper: Broderick, T, Boyd, N, Wibisono, A, Wilson, AC, and Jordan, MI. Streaming variational Bayes. Neur…☆41Updated 10 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 5 years ago
- An implementation of the Hogwild! algorithm for asynchronous SGD that interfaces with sci-kit learn.☆20Updated 4 years ago
- Pytorch-based python library for continuous reinforcement learning and imitation learning [superseded by @osudrl/apex]☆13Updated 4 years ago