tor / libbanditLinks
Library for Multi-Armed Bandit Algorithms
☆58Updated 8 years ago
Alternatives and similar repositories for libbandit
Users that are interested in libbandit are comparing it to the libraries listed below
Sorting:
- BBRL is a C++ open-source library used to compare Bayesian reinforcement learning algorithms☆34Updated 9 years ago
- Repo for a paper about constructing priors on very deep models.☆73Updated 9 years ago
- An extension to Sacred for automated hyperparameter optimization.☆59Updated 7 years ago
- Experimentation for oracle based contextual bandit algorithms.☆32Updated 2 years ago
- RNNprop☆36Updated 8 years ago
- Fastidious accounting of entropy streams into and out of optimization and sampling algorithms.☆33Updated 9 years ago
- A Python library for reinforcement learning using Bayesian approaches☆54Updated 10 years ago
- Skip Context Tree Switching - Reference Implementation☆51Updated 7 years ago
- This is code associated with the paper: Broderick, T, Boyd, N, Wibisono, A, Wilson, AC, and Jordan, MI. Streaming variational Bayes. Neur…☆41Updated 10 years ago
- Hyperparameter optimization with approximate gradient☆66Updated 4 years ago
- Collaborative filtering with the GP-LVM☆25Updated 10 years ago
- ☆69Updated 7 years ago
- Reading Group on Reinforcement Learning topics☆56Updated 8 years ago
- Multi-armed bandit simulation library☆139Updated last year
- Deep exponential families (DEFs)☆55Updated 7 years ago
- Epsilon-greedy, softmax and LinUCB contextual bandit implementations [recommender systems]☆50Updated 6 years ago
- Topics on theoretical, mathematical aspects of DL☆72Updated 8 years ago
- C++ code for the RDIS algorithm from "Recursive Decomposition for Nonconvex Optimization." Friesen and Domingos, IJCAI 2015.☆57Updated 9 years ago
- LaTeX package for randomizing author order based on a public seed.☆40Updated 10 years ago
- Columbia Advanced Machine Learning Seminar☆24Updated 6 years ago
- Off the convex path☆67Updated 2 years ago
- ☆90Updated 7 years ago
- Python package for modular Bayesian optimization☆136Updated 4 years ago
- ☆25Updated 7 years ago
- Reducing Reparameterization Gradient Variance code.☆33Updated 8 years ago
- Non stationary bandit for experiments with Reinforcement Learning☆34Updated 8 years ago
- Edward content including papers, posters, and talks☆92Updated 4 years ago
- Python implementation of Markov Jump Hamiltonian Monte Carlo☆24Updated 8 years ago
- Implementation of Hamiltonian Monte Carlo using Google's TensorFlow☆47Updated 9 years ago
- RNN with differentiable structure (number of neurons)☆22Updated 8 years ago