iurteaga / banditsLinks
Public repository for the work on bandit problems
☆23Updated last year
Alternatives and similar repositories for bandits
Users that are interested in bandits are comparing it to the libraries listed below
Sorting:
- Non-stationary Off-policy Evaluation☆13Updated 6 years ago
- Library for Multi-Armed Bandit Algorithms☆58Updated 8 years ago
- Non stationary bandit for experiments with Reinforcement Learning☆34Updated 8 years ago
- ☆30Updated 5 years ago
- Library for Bayesian Neural Networks in PyTorch (first version as published in ProbProg2020)☆42Updated 3 years ago
- Python implementation of projection losses.☆27Updated 5 years ago
- InferPy: Deep Probabilistic Modeling with Tensorflow Made Easy☆148Updated 11 months ago
- CHOP: An optimization library based on PyTorch, with applications to adversarial examples and structured neural network training.☆77Updated last year
- Variational Fourier Features☆86Updated 4 years ago
- Library for learning and inference with Sum-product Networks utilizing TensorFlow 2.x and Keras☆48Updated 4 years ago
- Discontinuous Hamiltonian Monte Carlo in JAX☆41Updated 5 years ago
- Summaries and minimal implementations of ML / statistics research articles.☆39Updated 4 years ago
- DQV-Learning: a novel faster synchronous Deep Reinforcement Learning algorithm☆25Updated 2 years ago
- Repository of models in Pyro☆29Updated 11 months ago
- ☆53Updated 4 years ago
- Contextual bandit benchmarking☆50Updated last month
- Estimators to perform off-policy evaluation☆13Updated 10 months ago
- Movies Recommendation with Hierarchical Poisson Factorization in Edward☆18Updated 8 years ago
- Generative Forests in Python☆35Updated 2 years ago
- Mixture Density Networks (Bishop, 1994) tutorial in JAX☆60Updated 5 years ago
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)☆43Updated 2 years ago
- Open access book on variational Bayesian methods written collaboratively☆28Updated 10 years ago
- Reducing Reparameterization Gradient Variance code.☆33Updated 8 years ago
- Matlab code implementing Minimum Probability Flow Learning.☆69Updated 10 years ago
- Interaction-side integration library for Reinforcement Learning loops: Predict, Log, [Learn,] Update☆75Updated 8 months ago
- A Python library for reinforcement learning using Bayesian approaches☆54Updated 10 years ago
- Reference implementation of variational sequential Monte Carlo proposed by Naesseth et al. "Variational Sequential Monte Carlo" (2018)☆65Updated 6 years ago
- Reweighted Expectation Maximization☆29Updated 6 years ago
- Code for "Best arm identification in multi-armed bandits with delayed feedback", AISTATS 2018.☆19Updated 7 years ago
- Implementation of linear CorEx and temporal CorEx.☆37Updated 3 years ago