dquail/NonStationaryBandit

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dquail/NonStationaryBandit)

dquail / NonStationaryBandit

Non stationary bandit for experiments with Reinforcement Learning

☆34

Alternatives and similar repositories for NonStationaryBandit

Users that are interested in NonStationaryBandit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

qingyun-wu / NonstationaryBanditLib
View on GitHub
☆15Jan 20, 2020Updated 6 years ago
python-neuroscience / python-for-neuroscience
View on GitHub
Python for Neuroscience - An introduction to scientific computing with Python
☆11Feb 3, 2016Updated 10 years ago
v-i-s-h / MAB.jl
View on GitHub
A Julia Package for providing Multi Armed Bandit Experiments
☆21Jul 19, 2018Updated 8 years ago
rougier / Scipy-Bordeaux-2017
View on GitHub
Course taught at the University of Bordeaux in the academic year 2017 for PhD students.
☆17Feb 6, 2017Updated 9 years ago
gdmarmerola / advanced-bandit-problems
View on GitHub
More about the exploration-exploitation tradeoff with harder bandits
☆24May 12, 2019Updated 7 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
fancyspeed / semi-lda
View on GitHub
Semi-supervised Latent Dirichlet Allocation (LDA)
☆12Dec 21, 2017Updated 8 years ago
hartikainen / rl-graph-signal-recovery
View on GitHub
An attempt to apply reinforcement learning to graph signal recovery problem
☆11Aug 25, 2021Updated 4 years ago
osome-iu / ChatGPT_domain_rating
View on GitHub
Code and data for paper "Large language models can rate news outlet credibility"
☆13Aug 10, 2024Updated last year
ludobouan / Q-learning-gridworld
View on GitHub
Reinforcement learning on gridworld with Q-learning
☆10Jan 28, 2017Updated 9 years ago
rougier / ASPP-2017
View on GitHub
Material for the Advanced Scientific Python Programming course, Nikiti, Greece, 2017
☆14Aug 26, 2017Updated 8 years ago
hollygrimm / cs294-homework
View on GitHub
Assignments for CS294-112.
☆17Jul 13, 2018Updated 8 years ago
ardaegeunlu / X-armed-Bandits
View on GitHub
Implementation of the X-armed Bandits algorithm, as detailed in the paper, "X-armed Bandits", Bubeck et al., 2011.
☆11Jul 12, 2018Updated 8 years ago
xeniaqian94 / RLeToR
View on GitHub
A PyTorch implementation of REINFORCE Learning To Rank on OSHUMED, MQ, etc. dataset. Basic idea also appears in SIGIR'17 Reinforcement Le…
☆18Dec 8, 2017Updated 8 years ago
prideout / sympy-fun
View on GitHub
use SymPy to generate equations for parametric surfaces
☆18May 7, 2012Updated 14 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
alexrutar / banditvis
View on GitHub
A Python 3 Bandit Visualization Package
☆11Oct 16, 2017Updated 8 years ago
roycoding / slots
View on GitHub
A multi-armed bandit library for Python
☆81Jan 13, 2020Updated 6 years ago
econtal / gp-optimization-python
View on GitHub
Implementation of my Bayesian Optimization algorithms
☆12Mar 17, 2018Updated 8 years ago
jerrylin1121 / BCO
View on GitHub
Implementation of Behavioral Cloning from Observationmentation
☆16Nov 28, 2019Updated 6 years ago
PacktPublishing / Hands-On-Reinforcement-Learning-with-TensorFlow-TRFL
View on GitHub
Hands-On Reinforcement Learning with TensorFlow & TRFL
☆14Jan 18, 2021Updated 5 years ago
j-wang / BanditEmpirical
View on GitHub
Empirical tests of various bandit algorithms.
☆16Dec 6, 2014Updated 11 years ago
ReScience / call-for-replication
View on GitHub
Call for Replication in ReScience
☆13Oct 13, 2016Updated 9 years ago
rjagerman / wsdm2019-nonstationary
View on GitHub
Non-stationary Off-policy Evaluation
☆13Nov 8, 2018Updated 7 years ago
Underflow / reinforcement-2048
View on GitHub
A reinforcement learning algorithm for the 2048 game
☆20Mar 25, 2014Updated 12 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
tushuhei / gpucb
View on GitHub
Simple implementation of GP-UCB algorithm.
☆55Jan 20, 2017Updated 9 years ago
FrankVeenstra / gym_rem2D
View on GitHub
OpenAI gym environment for evolving morphologies of 2D virtual creatures.
☆34Jul 26, 2023Updated 3 years ago
ardaegeunlu / Contextual-Gaussian-Process-Bandit-Optimization
View on GitHub
Simple implementation of the CGP-UCB algorithm.
☆39Nov 30, 2019Updated 6 years ago
NickTrossa / OCR7SD
View on GitHub
Optical Character Recognition of Seven Segment Display
☆14May 25, 2019Updated 7 years ago
rougier / emacs-octicons
View on GitHub
Octicons glyph name for emacs
☆24Dec 24, 2015Updated 10 years ago
gemst1 / IRL
View on GitHub
Inverse Reinforcement Learning, Inverse Optimal Control, Apprenticeship Learning, Imitation Learning review
☆46Apr 27, 2021Updated 5 years ago
j2kun / exp3
View on GitHub
Python code for the post "Adversarial Bandits and the Exp3 Algorithm"
☆51Jun 9, 2020Updated 6 years ago
hauselin / domain-quality-ratings
View on GitHub
Comprehensive database of ratings for 11k news domains
☆29Sep 14, 2023Updated 2 years ago
david-abel / state_abstraction
View on GitHub
Code for abstracting, evaluating, and visualizing Markov Decision Processes.
☆10Jan 12, 2017Updated 9 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ancorso / POMDPGym.jl
View on GitHub
☆12Nov 26, 2025Updated 8 months ago
rougier / dynamic-som
View on GitHub
Dynamic Self-Organized maps
☆22Nov 11, 2015Updated 10 years ago
sustainable-computing / COBS
View on GitHub
COBS: COmprehensive Building Simulator
☆16Jun 23, 2022Updated 4 years ago
bgalbraith / bandits
View on GitHub
Python library for Multi-Armed Bandits
☆771Feb 11, 2020Updated 6 years ago
sbarratt / nips2017
View on GitHub
☆14Dec 10, 2017Updated 8 years ago
brunoscherrer / retraites
View on GitHub
simulateur du COR amélioré
☆25Sep 30, 2020Updated 5 years ago
nel215 / change_finder
View on GitHub
☆11Dec 26, 2022Updated 3 years ago