iosband/ts_tutorial

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/iosband/ts_tutorial)

iosband / ts_tutorial

☆368

Alternatives and similar repositories for ts_tutorial

Users that are interested in ts_tutorial are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

andrecianflone / thompson
View on GitHub
Thompson Sampling Tutorial
☆56Jan 25, 2019Updated 7 years ago
bgalbraith / bandits
View on GitHub
Python library for Multi-Armed Bandits
☆771Feb 11, 2020Updated 6 years ago
aa14k / Exploration-in-RL
View on GitHub
☆29May 27, 2024Updated 2 years ago
david-cortes / contextualbandits
View on GitHub
Python implementations of contextual bandits algorithms
☆838Jun 28, 2026Updated 3 weeks ago
SMPyBandits / SMPyBandits
View on GitHub
🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorith…
☆424Jun 19, 2026Updated last month
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
MinRegret / deluca
View on GitHub
Performant, differentiable reinforcement learning
☆23Jun 16, 2023Updated 3 years ago
alison-carrera / mabalgs
View on GitHub
Multi-Armed Bandit Algorithms Library (MAB)
☆136Apr 13, 2026Updated 3 months ago
akshaykr / oracle_cb
View on GitHub
Experimentation for oracle based contextual bandit algorithms.
☆33Sep 12, 2022Updated 3 years ago
alexrutar / banditvis
View on GitHub
A Python 3 Bandit Visualization Package
☆11Oct 16, 2017Updated 8 years ago
abbyvansoest / maxent
View on GitHub
☆14May 30, 2019Updated 7 years ago
lilianweng / multi-armed-bandit
View on GitHub
Play with the solutions to the multi-armed-bandit problem.
☆418May 21, 2024Updated 2 years ago
google-research / policy-learning-landscape
View on GitHub
Explore the optimization landscape for direct policy learning reinforcement learning.
☆51Jan 16, 2019Updated 7 years ago
AdityaMate / collapsing_bandits
View on GitHub
Code repo for "Collapsing Bandits and Their Applications to Public Health Interventions", (NeurIPS'20)
☆11Dec 3, 2025Updated 7 months ago
johnmyleswhite / BanditsBook
View on GitHub
Code for my book on Multi-Armed Bandit Algorithms
☆922Jan 9, 2020Updated 6 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
DavidJanz / successor_uncertainties_atari
View on GitHub
Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…
☆21Feb 24, 2023Updated 3 years ago
conormm / bandit_algorithms
View on GitHub
☆26Sep 30, 2018Updated 7 years ago
LaunchpadAI / space-bandits
View on GitHub
☆106Sep 13, 2021Updated 4 years ago
Anton1o-I / thompson-sampling
View on GitHub
Implementation of Thompson Sampling in Python
☆15Feb 4, 2020Updated 6 years ago
annieyan / Bandits-using-UCB-algorithm
View on GitHub
Thompson Sampling for Bandits using UCB policy
☆10Jul 29, 2017Updated 8 years ago
ermongroup / best-arm-delayed
View on GitHub
Code for "Best arm identification in multi-armed bandits with delayed feedback", AISTATS 2018.
☆20Apr 3, 2018Updated 8 years ago
AnujMahajanOxf / VIREL
View on GitHub
Code for VIREL: A Variational Inference Framework for Reinforcement Learning
☆14Dec 1, 2019Updated 6 years ago
colby-j-wise / ParticleThompsonSamplingMAB
View on GitHub
Multi-Arm Bandits for online recommendations via Particle Thompson Sampling with Probabilistic Matrix Factorization
☆14May 9, 2018Updated 8 years ago
jimimvp / torch_rl
View on GitHub
Reinforcement learning library for PyTorch.
☆11Jun 15, 2018Updated 8 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
iosband / TabulaRL
View on GitHub
☆66Mar 11, 2024Updated 2 years ago
ajgupta93 / d4pg-pytorch
View on GitHub
In Progress : State of the art Distributed Distributional Deep Deterministic Policy Gradient algorithm implementation in pytorch.
☆19Jun 15, 2018Updated 8 years ago
mechanism-learning-research / two-player-auctions
View on GitHub
JAX/Haiku implementation of "Auction Learning as a Two-Player Game"
☆11Jul 6, 2024Updated 2 years ago
WilsonWangTHU / mbbl
View on GitHub
☆399Jul 18, 2019Updated 7 years ago
facebookresearch / RandomizedValueFunctions
View on GitHub
Randomized Value Functions via Multiplicative Normalizing Flows
☆18Jan 1, 2023Updated 3 years ago
HuasenWu / DuelingBandits
View on GitHub
Simulations for Dueling Bandit Algorithms, including our Double Thompson Sampling (D-TS) algorithms
☆25Sep 27, 2016Updated 9 years ago
akshaykhadse / reinforcement-learning
View on GitHub
Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: F…
☆17May 21, 2018Updated 8 years ago
ntucllab / striatum
View on GitHub
Contextual bandit in python
☆112Jul 7, 2021Updated 5 years ago
abietti / cb_bakeoff
View on GitHub
scripts for evaluation of contextual bandit algorithms
☆46Apr 27, 2020Updated 6 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
bstadie / third_person_im
View on GitHub
third person imitation learning. Archival only.
☆72Oct 22, 2019Updated 6 years ago
minatosato / deep-learning-theano
View on GitHub
Theano
☆11Aug 26, 2017Updated 8 years ago
andrewk1 / pytorch-deep-bayesian-bandits
View on GitHub
PyTorch port and extension of the Deep Bayesian Bandits Library
☆43Sep 4, 2019Updated 6 years ago
robintyh1 / icml2021-pengqlambda
View on GitHub
Revisiting Peng's Q(lambda) for Modern Reinforcement Learning
☆15Jul 23, 2021Updated 5 years ago
Continual-Lifelong-Learning / resources
View on GitHub
☆17Feb 21, 2020Updated 6 years ago
rrmenon10 / Bootstrapped-DQN
View on GitHub
Tensorflow implementation of BootstrappedDQN using OpenAI baselines
☆19Jan 12, 2021Updated 5 years ago
lxuechen / inference-suboptimality
View on GitHub
Code for "Inference Suboptimality in Variational Autoencoders"
☆16Mar 17, 2020Updated 6 years ago