tor / libbandit
Library for Multi-Armed Bandit Algorithms
☆57Updated 7 years ago
Alternatives and similar repositories for libbandit:
Users that are interested in libbandit are comparing it to the libraries listed below
- BBRL is a C++ open-source library used to compare Bayesian reinforcement learning algorithms☆33Updated 8 years ago
- Collaborative filtering with the GP-LVM☆25Updated 9 years ago
- Experimentation for oracle based contextual bandit algorithms.☆31Updated 2 years ago
- NeurIPS workshop on Advances in Approximate Bayesian Inference☆48Updated last week
- ☆68Updated 6 years ago
- An extension to Sacred for automated hyperparameter optimization.☆59Updated 6 years ago
- Public repository for the work on bandit problems☆23Updated 9 months ago
- A lightweight python library for bandit algorithms☆30Updated 2 years ago
- Epsilon-greedy, softmax and LinUCB contextual bandit implementations [recommender systems]☆48Updated 5 years ago
- Fastidious accounting of entropy streams into and out of optimization and sampling algorithms.☆32Updated 8 years ago
- Reducing Reparameterization Gradient Variance code.☆33Updated 7 years ago
- Repo for a paper about constructing priors on very deep models.☆72Updated 8 years ago
- Code for "Best arm identification in multi-armed bandits with delayed feedback", AISTATS 2018.☆19Updated 6 years ago
- RNN with differentiable structure (number of neurons)☆22Updated 8 years ago
- Topics on theoretical, mathematical aspects of DL☆71Updated 8 years ago
- Summaries and minimal implementations of ML / statistics research articles.☆39Updated 3 years ago
- This is code associated with the paper: Broderick, T, Boyd, N, Wibisono, A, Wilson, AC, and Jordan, MI. Streaming variational Bayes. Neur…☆41Updated 10 years ago
- Empirical tests of various bandit algorithms.☆16Updated 10 years ago
- Multi-armed bandit simulation library☆138Updated last year
- Non stationary bandit for experiments with Reinforcement Learning☆34Updated 7 years ago
- ☆46Updated 11 years ago
- code for stochastic expectation propagation☆16Updated 9 years ago
- Reading Group on Reinforcement Learning topics☆55Updated 8 years ago
- Torch implementation of the Deep Network for Global Optimization (DNGO)☆51Updated 8 years ago
- C++ code for the RDIS algorithm from "Recursive Decomposition for Nonconvex Optimization." Friesen and Domingos, IJCAI 2015.☆57Updated 9 years ago
- Implementation of Hamiltonian Monte Carlo using Google's TensorFlow☆47Updated 9 years ago
- Contextual Bandit Algorithms (+Bandit Algorithms)☆22Updated 5 years ago
- Off the convex path☆67Updated last year
- Deep exponential families (DEFs)☆55Updated 6 years ago
- Code accompanying the paper "Learning Permutations with Sinkhorn Policy Gradient"☆39Updated 6 years ago