some common TD Learning algorithms
☆66Mar 6, 2020Updated 5 years ago
Alternatives and similar repositories for tdlearn
Users that are interested in tdlearn are comparing it to the libraries listed below
Sorting:
- Some Reinforcement Learning in Python☆115Apr 17, 2017Updated 8 years ago
- A python implementation of tile coding using numpy.☆11May 13, 2017Updated 8 years ago
- Based on Thompson sampling with the online bootstrap (Dean Eckles, Maurits Kaptein). http://arxiv.org/abs/1410.4009☆11Dec 30, 2014Updated 11 years ago
- An experiment with Thompson sampling and TD(0) on a grid world variant☆17Nov 8, 2013Updated 12 years ago
- Latent Dirichlet Allocation with Gibbs sampling☆16Dec 18, 2013Updated 12 years ago
- Learning to Reinforcement Learn☆11Nov 22, 2022Updated 3 years ago
- A collection of deep reinforcement learning-based & GFlowNet drug molecule generators focused on generation of molecules using Graphs/SEL…☆10Dec 11, 2022Updated 3 years ago
- ☆13Jan 19, 2017Updated 9 years ago
- ☆160Jul 21, 2017Updated 8 years ago
- Ball & beam OpenAI gym environments☆15Mar 4, 2020Updated 5 years ago
- Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…☆133May 5, 2019Updated 6 years ago
- Code for the DeepScript Submission to ICFHR2016 Competition on the Classification of Medieval Handwritings in Latin Script☆18Nov 23, 2016Updated 9 years ago
- Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.☆37Oct 14, 2020Updated 5 years ago
- Emergent Communication of Generalizations, NeurIPS 2021☆13Sep 29, 2021Updated 4 years ago
- Selective Bayesian Forest Classifier - R package for simultaneous feature selection and classification. See paper: http://arxiv.org/abs/1…☆16Jan 15, 2022Updated 4 years ago
- LfD: Learning from Demonstrations for Robotic Manipulation☆47Jul 30, 2015Updated 10 years ago
- Contextual Relative Entropy Policy Search for Reinforcement Learning in Python☆15Oct 17, 2018Updated 7 years ago
- Code to compute the Stein discrepancy between a sample distribution and its target☆17Jul 17, 2017Updated 8 years ago
- C implementation of RL and IRL algorithms☆19Jul 6, 2020Updated 5 years ago
- ☆101Aug 15, 2016Updated 9 years ago
- Generative models and other stuff too, maybe, perhaps even probably☆16Dec 12, 2015Updated 10 years ago
- NPHC☆17May 30, 2021Updated 4 years ago
- Large scale matrix factorization on GPU☆19Jun 4, 2016Updated 9 years ago
- An asynchronous Redis client for Tornado☆20Nov 6, 2020Updated 5 years ago
- A parallel version of Trust Region Policy Optimization☆65Mar 6, 2017Updated 8 years ago
- Implementation of the Self Paced Reinforcement Learning Experiments☆19Sep 27, 2023Updated 2 years ago
- ☆25Jan 2, 2019Updated 7 years ago
- Code to accompany the paper "k-Stochastic Neighbor Embeddings for Supervised and Unsupervised Learning, ICML 2013".☆27Jun 8, 2016Updated 9 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆46Sep 20, 2023Updated 2 years ago
- Official implementation of the AAAI 2021 paper Deep Bayesian Quadrature Policy Optimization.☆17Feb 17, 2021Updated 5 years ago
- Simulating Language Course☆24Apr 1, 2019Updated 6 years ago
- Click through rate prediction☆19Feb 14, 2017Updated 9 years ago
- A library to benchmark reinforcement learning algorithms☆21Apr 18, 2018Updated 7 years ago
- PyTorch implementation of "PatchGame: Learning to Signal Mid-level Patches in Referential Games" to appear in NeurIPS 2021☆24Jun 4, 2021Updated 4 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆49Apr 1, 2022Updated 3 years ago
- RLPy Reinforcement Learning Framework☆254Sep 29, 2019Updated 6 years ago
- stochs: fast stochastic solvers for machine learning in C++ and Cython☆26Oct 13, 2022Updated 3 years ago
- Python implementation of Markov Jump Hamiltonian Monte Carlo☆25Feb 9, 2017Updated 9 years ago
- L1 Trend Filtering☆19Apr 17, 2024Updated last year