An experiment with Thompson sampling and TD(0) on a grid world variant
☆17Nov 8, 2013Updated 12 years ago
Alternatives and similar repositories for tstd0
Users that are interested in tstd0 are comparing it to the libraries listed below
Sorting:
- Based on Thompson sampling with the online bootstrap (Dean Eckles, Maurits Kaptein). http://arxiv.org/abs/1410.4009☆11Dec 30, 2014Updated 11 years ago
- Optimizing the best Ads using Reinforcement learning Algorithms such as Thompson Sampling and Upper Confidence Bound.☆13May 24, 2019Updated 6 years ago
- ☆16Jun 23, 2015Updated 10 years ago
- Hybrid Linear UCB Multi-arm Bandit library☆14Oct 5, 2016Updated 9 years ago
- Selective Bayesian Forest Classifier - R package for simultaneous feature selection and classification. See paper: http://arxiv.org/abs/1…☆16Jan 15, 2022Updated 4 years ago
- Elm Web Development, published by Packt☆13Oct 31, 2022Updated 3 years ago
- Python implementation of UCB, EXP3 and Epsilon greedy algorithms☆30Oct 4, 2018Updated 7 years ago
- Latent Dirichlet Allocation with Gibbs sampling☆16Dec 18, 2013Updated 12 years ago
- Code for the DeepScript Submission to ICFHR2016 Competition on the Classification of Medieval Handwritings in Latin Script☆18Nov 23, 2016Updated 9 years ago
- C++ implementation of a b-tree.☆13Aug 4, 2022Updated 3 years ago
- An asynchronous Redis client for Tornado☆20Nov 6, 2020Updated 5 years ago
- Surprise-based intrinsic motivation for deep reinforcement learning☆21Mar 6, 2017Updated 9 years ago
- Large scale matrix factorization on GPU☆19Jun 4, 2016Updated 9 years ago
- training ternary neural networks☆15Apr 18, 2017Updated 8 years ago
- SDK for creating waPC WebAssembly Guest Modules in Zig☆14Dec 27, 2021Updated 4 years ago
- Generative models and other stuff too, maybe, perhaps even probably☆16Dec 12, 2015Updated 10 years ago
- Reinforcement Learning Algorithm for Packet Routing☆12Aug 20, 2020Updated 5 years ago
- Simple persistent storage for C++ objects using virtual memory mapping mechanism☆18Nov 23, 2009Updated 16 years ago
- Command-line JSON processor☆14Oct 23, 2019Updated 6 years ago
- ☆19Oct 26, 2023Updated 2 years ago
- Eliminate global state without the boilerplate!☆13Dec 18, 2018Updated 7 years ago
- Tunnel is a clean wrapper around native Go channel to allow cleanly closing the channel without throwing a panic.☆13Aug 1, 2019Updated 6 years ago
- 贝叶斯思维☆15Jan 5, 2019Updated 7 years ago
- Leave No Trace is an algorithm for safe reinforcement learning.☆15Apr 30, 2018Updated 7 years ago
- Notes on Logistic Regression and OWLQN☆26Apr 8, 2017Updated 8 years ago
- Examples of different methods to compose FaaS functions together☆10Jul 18, 2018Updated 7 years ago
- Dumb implementation of monads in Python.☆16May 5, 2015Updated 10 years ago
- Code for ICML2020 "Sequence Generation with Mixed Representations"☆12Jun 27, 2020Updated 5 years ago
- Implement IMPALA architecture from Distributed Deep-RL Paper.☆15Oct 18, 2018Updated 7 years ago
- A Bayesian Data Augmentation Approach for Learning Deep Models in Keras. Here is the link to a pytorch version: https://github.com/toant…☆25Oct 4, 2017Updated 8 years ago
- homework for shenlan's "Motion Planning For Mobile Robots "☆15May 14, 2020Updated 5 years ago
- This repo contain the exercies of the Next.ML 2015 presentation☆24Jan 17, 2015Updated 11 years ago
- Cross-platform socketpair functionality☆16May 15, 2025Updated 10 months ago
- This is a read-only mirror of the CRAN R package repository. speedglm — Fitting Linear and Generalized Linear Models to Large Data Sets…☆10May 6, 2023Updated 2 years ago
- A distributed heart rate monitor using Microsoft Band, Raspberry PI2, and Windows 10 UWP, Azure and Signal/R☆11Jul 15, 2015Updated 10 years ago
- Implementation of Deep Q-learning from Demonstrations using Keras and a Retro Gym environment.☆14Jul 16, 2018Updated 7 years ago
- [Starter project] web server & client. Fully C++/WebAssembly. Server runs on google cloud function. Client uses a C++ virtual dom.☆11Jun 10, 2019Updated 6 years ago
- Giza++☆12May 12, 2015Updated 10 years ago
- Exposes the haproxy command socket over TCP using socat☆13May 28, 2020Updated 5 years ago