tor / libbanditLinks
Library for Multi-Armed Bandit Algorithms
☆56Updated 8 years ago
Alternatives and similar repositories for libbandit
Users that are interested in libbandit are comparing it to the libraries listed below
Sorting:
- Fastidious accounting of entropy streams into and out of optimization and sampling algorithms.☆33Updated 9 years ago
- BBRL is a C++ open-source library used to compare Bayesian reinforcement learning algorithms☆34Updated 9 years ago
- Collaborative filtering with the GP-LVM☆25Updated 10 years ago
- Reading Group on Reinforcement Learning topics☆56Updated 9 years ago
- A Python library for reinforcement learning using Bayesian approaches☆53Updated 10 years ago
- An extension to Sacred for automated hyperparameter optimization.☆59Updated 7 years ago
- Epsilon-greedy, softmax and LinUCB contextual bandit implementations [recommender systems]☆50Updated 6 years ago
- This is code associated with the paper: Broderick, T, Boyd, N, Wibisono, A, Wilson, AC, and Jordan, MI. Streaming variational Bayes. Neur…☆41Updated 11 years ago
- Repo for a paper about constructing priors on very deep models.☆73Updated 9 years ago
- Skip Context Tree Switching - Reference Implementation☆51Updated 8 years ago
- Implementation in C and Theano of the method Probabilistic Backpropagation for scalable Bayesian inference in deep neural networks.☆191Updated 6 years ago
- Edward content including papers, posters, and talks☆92Updated 5 years ago
- RNNprop☆36Updated 8 years ago
- Public repository for the work on bandit problems☆24Updated last year
- Hyperparameter optimization with approximate gradient☆66Updated 4 years ago
- Topics on theoretical, mathematical aspects of DL☆72Updated 9 years ago
- ☆67Updated 7 years ago
- Columbia Advanced Machine Learning Seminar☆24Updated 7 years ago
- Multi-armed bandit simulation library☆140Updated 2 years ago
- Optimizers for machine learning☆183Updated 2 years ago
- Efficient Hyperparameter Optimization of Deep Learning Algorithms Using Deterministic RBF Surrogates☆115Updated 8 years ago
- C++ code for the RDIS algorithm from "Recursive Decomposition for Nonconvex Optimization." Friesen and Domingos, IJCAI 2015.☆58Updated 10 years ago
- Experiment code for Stochastic Gradient Hamiltonian Monte Carlo☆110Updated 7 years ago
- Torch implementation of the Deep Network for Global Optimization (DNGO)☆51Updated 9 years ago
- Off the convex path☆67Updated 2 years ago
- NeurIPS workshop on Advances in Approximate Bayesian Inference☆48Updated 7 months ago
- Modular Probabilistic Programming on MXNet☆104Updated 2 years ago
- Reducing Reparameterization Gradient Variance code.☆33Updated 8 years ago
- Code for "Best arm identification in multi-armed bandits with delayed feedback", AISTATS 2018.☆19Updated 7 years ago
- some common TD Learning algorithms☆66Updated 5 years ago