tansey / tstd0Links
An experiment with Thompson sampling and TD(0) on a grid world variant
☆17Updated 11 years ago
Alternatives and similar repositories for tstd0
Users that are interested in tstd0 are comparing it to the libraries listed below
Sorting:
- Empirical tests of various bandit algorithms.☆16Updated 10 years ago
- Epsilon-greedy, softmax and LinUCB contextual bandit implementations [recommender systems]☆49Updated 6 years ago
- Collaborative filtering with the GP-LVM☆25Updated 10 years ago
- Gopalan, P., Ruiz, F. J., Ranganath, R., & Blei, D. M. (2014). Bayesian Nonparametric Poisson Factorization for Recommendation Systems. I…☆15Updated 10 years ago
- Starter kit for getting started in the NIPS 2017 Criteo Ad Placement Challenge☆19Updated 7 years ago
- Implementation of Counterfactual risk minimization☆26Updated 8 years ago
- Semi-synthetic experiments to test several approaches for off-policy evaluation and optimization of slate recommenders.☆43Updated 7 years ago
- Boosting and ensemble learning in Python.☆54Updated 10 years ago
- Hybrid Linear UCB bandit learning algorithm L Li(2010) python code☆56Updated 9 years ago
- Sklearn implementation of GBM to predict mu(X) and std(X) on heteroscedastic data☆26Updated 9 years ago
- Experimentation for oracle based contextual bandit algorithms.☆31Updated 2 years ago
- ☆36Updated 10 years ago
- ☆20Updated 8 years ago
- Based on Thompson sampling with the online bootstrap (Dean Eckles, Maurits Kaptein). http://arxiv.org/abs/1410.4009☆11Updated 10 years ago
- Public repository for the work on bandit problems☆23Updated last year
- Non-stationary Off-policy Evaluation☆13Updated 6 years ago
- simple python interface to SMAC.☆21Updated 7 years ago
- A short paper describing the library is available on arXiv.☆64Updated 7 years ago
- Scikit-learn compatible implementations of the Random Rotation Ensemble idea of (Blaser & Fryzlewicz, 2016)☆43Updated 9 years ago
- A potential 22nd rank solution to Criteo Labs Display Advertising Challenge on Kaggle☆25Updated 7 years ago
- Bayesian Logistic Regression using Laplace approximations to the posterior.☆47Updated 8 years ago
- ☆11Updated 8 years ago
- Mirror of Apache Spark☆24Updated 9 years ago
- The information sieve for discrete variables.☆36Updated 8 years ago
- reinforcement learning. policy gradient. PCL☆37Updated 8 years ago
- Gaussian Process Factorization Machines for Context-aware Recommendations☆42Updated 10 years ago
- A Python library for reinforcement learning using Bayesian approaches☆54Updated 10 years ago
- ☆20Updated 9 years ago
- Big Topic Model is a fast engine for running large-scale Topic Models.☆22Updated 8 years ago
- ☆29Updated 7 years ago