Non stationary bandit for experiments with Reinforcement Learning
☆33Mar 24, 2017Updated 8 years ago
Alternatives and similar repositories for NonStationaryBandit
Users that are interested in NonStationaryBandit are comparing it to the libraries listed below
Sorting:
- Python for Neuroscience - An introduction to scientific computing with Python☆11Feb 3, 2016Updated 10 years ago
- Course taught at the University of Bordeaux in the academic year 2017 for PhD students.☆17Feb 6, 2017Updated 9 years ago
- ☆15Jan 20, 2020Updated 6 years ago
- Notes for the Scientific Python course at the university of Bordeaux☆21Feb 27, 2018Updated 8 years ago
- A Julia Package for providing Multi Armed Bandit Experiments☆21Jul 19, 2018Updated 7 years ago
- An attempt to apply reinforcement learning to graph signal recovery problem☆11Aug 25, 2021Updated 4 years ago
- Code and data for paper "Large language models can rate news outlet credibility"☆13Aug 10, 2024Updated last year
- A Python 3 Bandit Visualization Package☆11Oct 16, 2017Updated 8 years ago
- Reinforcement learning on gridworld with Q-learning☆10Jan 28, 2017Updated 9 years ago
- Material for the Advanced Scientific Python Programming course, Nikiti, Greece, 2017☆14Aug 26, 2017Updated 8 years ago
- Non-stationary Off-policy Evaluation☆13Nov 8, 2018Updated 7 years ago
- Call for Replication in ReScience☆13Oct 13, 2016Updated 9 years ago
- A reinforcement learning algorithm for the 2048 game☆20Mar 25, 2014Updated 11 years ago
- Restructured Text Bootstrap☆20Mar 3, 2018Updated 8 years ago
- Empirical tests of various bandit algorithms.☆16Dec 6, 2014Updated 11 years ago
- Experiments showing effects of parameters on Maximum Entropy Inverse Reinforcement Learning using grid world☆15Nov 26, 2016Updated 9 years ago
- Assignments for CS294-112.☆16Jul 13, 2018Updated 7 years ago
- SemEval 2019 Hyperpartisan News Detection - team Bertha von Suttner contribution☆23Aug 15, 2019Updated 6 years ago
- Course taught at the University of Bordeaux in the academic year 2015/16 for PhD students.☆21Mar 17, 2016Updated 9 years ago
- Code for "An Online Algorithm to Reduce the Spread of Misinformation in Social Networks", WSDM 2018☆27Jan 8, 2018Updated 8 years ago
- A multi-armed bandit library for Python☆81Jan 13, 2020Updated 6 years ago
- Pandas Msgpack☆24Jul 21, 2022Updated 3 years ago
- Python code for the post "Adversarial Bandits and the Exp3 Algorithm"☆50Jun 9, 2020Updated 5 years ago
- Dynamic Self-Organized maps☆22Nov 11, 2015Updated 10 years ago
- Octicons glyph name for emacs☆24Dec 24, 2015Updated 10 years ago
- simulateur du COR amélioré☆25Sep 30, 2020Updated 5 years ago
- Repository for the CLiPS HAte speech DEtection System [HADES].☆24Apr 5, 2018Updated 7 years ago
- Comprehensive database of ratings for 11k news domains☆28Sep 14, 2023Updated 2 years ago
- Public repository for the work on bandit problems☆24Apr 4, 2024Updated last year
- OpenAI gym environment for evolving morphologies of 2D virtual creatures.☆34Jul 26, 2023Updated 2 years ago
- Official implementation for: Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning ICLR'24☆26Aug 28, 2024Updated last year
- Top 10 LaTeX fonts☆64Oct 6, 2014Updated 11 years ago
- Implementation of PCA algorithm using Gram-Scmidt modification on NIPALS☆10Jun 13, 2015Updated 10 years ago
- The code for the post "Optimism in the Face of Uncertainty: the UCB1 Algorithm"☆37Jun 9, 2020Updated 5 years ago
- One hour lecture to introduce LaTeX to maths undergraduates.☆10Oct 2, 2020Updated 5 years ago
- Run large scale tensor and coupled matrix-tensor factorization on top of stock Hadoop.☆18Dec 28, 2017Updated 8 years ago
- Pogobot is an open-source open-hardware robotic platform for swarm robotics☆16Feb 27, 2026Updated last week
- Galaxy is a lightweight software deployment and management tool. We use it at Ning to manage the Java cores and Apache httpd instances th…☆21Sep 11, 2011Updated 14 years ago
- Published by Packt☆11Jan 18, 2021Updated 5 years ago