tor / libbanditLinks
Library for Multi-Armed Bandit Algorithms
☆56Updated 8 years ago
Alternatives and similar repositories for libbandit
Users that are interested in libbandit are comparing it to the libraries listed below
Sorting:
- BBRL is a C++ open-source library used to compare Bayesian reinforcement learning algorithms☆34Updated 9 years ago
- A Python library for reinforcement learning using Bayesian approaches☆54Updated 10 years ago
- NeurIPS workshop on Advances in Approximate Bayesian Inference☆48Updated 6 months ago
- This is code associated with the paper: Broderick, T, Boyd, N, Wibisono, A, Wilson, AC, and Jordan, MI. Streaming variational Bayes. Neur…☆41Updated 11 years ago
- Public repository for the work on bandit problems☆23Updated last year
- Fastidious accounting of entropy streams into and out of optimization and sampling algorithms.☆33Updated 9 years ago
- Multi-armed bandit simulation library☆140Updated last year
- Repo for a paper about constructing priors on very deep models.☆73Updated 9 years ago
- Reading Group on Reinforcement Learning topics☆56Updated 8 years ago
- Edward content including papers, posters, and talks☆92Updated 5 years ago
- Non stationary bandit for experiments with Reinforcement Learning☆34Updated 8 years ago
- Experimentation for oracle based contextual bandit algorithms.☆33Updated 3 years ago
- An extension to Sacred for automated hyperparameter optimization.☆59Updated 7 years ago
- RNNprop☆36Updated 8 years ago
- Implementation in C and Theano of the method Probabilistic Backpropagation for scalable Bayesian inference in deep neural networks.☆191Updated 6 years ago
- Hyperparameter optimization with approximate gradient☆66Updated 4 years ago
- Off the convex path☆67Updated 2 years ago
- Epsilon-greedy, softmax and LinUCB contextual bandit implementations [recommender systems]☆50Updated 6 years ago
- ☆69Updated 7 years ago
- Summaries and minimal implementations of ML / statistics research articles.☆39Updated 4 years ago
- NeurIPS 2017 best paper. An interpretable linear-time kernel goodness-of-fit test.☆67Updated 6 years ago
- Reducing Reparameterization Gradient Variance code.☆33Updated 8 years ago
- Collaborative filtering with the GP-LVM☆25Updated 10 years ago
- Deep exponential families (DEFs)☆55Updated 7 years ago
- Non-stationary Off-policy Evaluation☆13Updated 6 years ago
- Probabilistic Programming and Statistical Inference in PyTorch☆111Updated 8 years ago
- Columbia Advanced Machine Learning Seminar☆24Updated 7 years ago
- reinforcement learning. policy gradient. PCL☆37Updated 8 years ago
- Skip Context Tree Switching - Reference Implementation☆51Updated 8 years ago
- Gaussian Processes in Pytorch☆75Updated 5 years ago