Non stationary bandit for experiments with Reinforcement Learning
☆33Mar 24, 2017Updated 9 years ago
Alternatives and similar repositories for NonStationaryBandit
Users that are interested in NonStationaryBandit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Jan 20, 2020Updated 6 years ago
- A Julia Package for providing Multi Armed Bandit Experiments☆21Jul 19, 2018Updated 7 years ago
- Comparison of bandit algorithms from the Reinforcement Learning bible.☆17Jun 6, 2018Updated 8 years ago
- Experiments showing effects of parameters on Maximum Entropy Inverse Reinforcement Learning using grid world☆15Nov 26, 2016Updated 9 years ago
- An attempt to apply reinforcement learning to graph signal recovery problem☆11Aug 25, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Reinforcement learning on gridworld with Q-learning☆10Jan 28, 2017Updated 9 years ago
- Material for the Advanced Scientific Python Programming course, Nikiti, Greece, 2017☆14Aug 26, 2017Updated 8 years ago
- A PyTorch implementation of REINFORCE Learning To Rank on OSHUMED, MQ, etc. dataset. Basic idea also appears in SIGIR'17 Reinforcement Le…☆18Dec 8, 2017Updated 8 years ago
- Implementation of my Bayesian Optimization algorithms☆12Mar 17, 2018Updated 8 years ago
- Deep Gaussian Process for Inverse Reinforcement Learning☆32Jul 6, 2017Updated 8 years ago
- Implementation of Behavioral Cloning from Observationmentation☆16Nov 28, 2019Updated 6 years ago
- A python implementation of the spectral projected gradient (SPG) optimization method☆12Mar 24, 2014Updated 12 years ago
- Hierarchical Dirichlet Process (with Split-Merge Operations), originally by Chong Wang☆18Oct 12, 2013Updated 12 years ago
- Non-stationary Off-policy Evaluation☆13Nov 8, 2018Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Restructured Text Bootstrap☆20Mar 3, 2018Updated 8 years ago
- Simple implementation of GP-UCB algorithm.☆55Jan 20, 2017Updated 9 years ago
- OpenAI gym environment for evolving morphologies of 2D virtual creatures.☆34Jul 26, 2023Updated 2 years ago
- SemEval 2019 Hyperpartisan News Detection - team Bertha von Suttner contribution☆23Aug 15, 2019Updated 6 years ago
- Course taught at the University of Bordeaux in the academic year 2015/16 for PhD students.☆21Mar 17, 2016Updated 10 years ago
- Linear UCB bandit learning algorithm L Li(2010) python code☆19Oct 6, 2014Updated 11 years ago
- Python code for the post "Adversarial Bandits and the Exp3 Algorithm"☆51Jun 9, 2020Updated 6 years ago
- Collection of useful, re-used routines.☆45Jul 15, 2017Updated 8 years ago
- Comprehensive database of ratings for 11k news domains☆29Sep 14, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Code for the paper "Refining Language Model with Compositional Explanation" (NeurIPS 2021)☆11Oct 25, 2021Updated 4 years ago
- Python library for Multi-Armed Bandits☆770Feb 11, 2020Updated 6 years ago
- Code from my blog post & online course☆55Jul 7, 2019Updated 6 years ago
- ☆14Dec 10, 2017Updated 8 years ago
- ROBEL: Robotics Benchmarks for Learning with low-cost robots (dev fork)☆13Jul 30, 2020Updated 5 years ago
- ☆12Dec 8, 2016Updated 9 years ago
- ☆10Oct 11, 2022Updated 3 years ago
- Recommendation engine and it's algorithms in python , R .☆12Oct 26, 2018Updated 7 years ago
- A Docutils writer for converting from reStructuredText documents to Markdown.☆48Mar 16, 2019Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- We conduct a preregistered experiment to investigate whether fact checks provided by a large language model can serve as an effective mis…☆13Dec 14, 2024Updated last year
- Various tools for EEG/MEG data analysis.☆10Updated this week
- Contextual Bandit Algorithms (+Bandit Algorithms)☆22Oct 18, 2019Updated 6 years ago
- Mac port of Torcs, The Open Racing Car Simulator☆11Jun 16, 2010Updated 16 years ago
- Code for the Population-Based Bandits Algorithm, presented at NeurIPS 2020.☆20Apr 13, 2021Updated 5 years ago
- Implementations on OpenAI's Gym☆10Nov 21, 2017Updated 8 years ago
- NavCog is an example app of blelocpp library aimed specifically for the blind to help those people “explore” the world without vision. No…☆10Jan 18, 2017Updated 9 years ago