unixpickle / treeagentLinks
Decision tree ensembles as RL policies
☆22Updated 7 years ago
Alternatives and similar repositories for treeagent
Users that are interested in treeagent are comparing it to the libraries listed below
Sorting:
- Non stationary bandit for experiments with Reinforcement Learning☆34Updated 8 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Updated 8 years ago
- Library for Multi-Armed Bandit Algorithms☆56Updated 8 years ago
- Contextual bandit benchmarking☆50Updated 2 months ago
- Parameter Importance Analysis Tool☆77Updated 4 years ago
- Contextual bandit in python☆114Updated 4 years ago
- Bandits Environments for the OpenAI Gym☆89Updated 5 years ago
- Comparison of bandit algorithms from the Reinforcement Learning bible.☆17Updated 7 years ago
- some common TD Learning algorithms☆66Updated 5 years ago
- A Python library for reinforcement learning using Bayesian approaches☆54Updated 10 years ago
- OpenAI's cartpole env solver.☆18Updated 6 years ago
- A lightweight python library for bandit algorithms☆30Updated 3 years ago
- Reproducing results from DeepMind's paper on Population Based Training of Neural Networks.☆55Updated 6 years ago
- A fast Evolution Strategy implementation in Python☆271Updated 5 years ago
- Variation of "Asynchronous Methods for Deep Reinforcement Learning" with multiple processes generating experience for agent (Keras + Thea…☆44Updated 7 years ago
- scripts for evaluation of contextual bandit algorithms☆45Updated 5 years ago
- Experimentation for oracle based contextual bandit algorithms.☆32Updated 2 years ago
- A reinforcement learning framework☆156Updated 6 years ago
- Multi-armed bandit simulation library☆139Updated last year
- Interaction-side integration library for Reinforcement Learning loops: Predict, Log, [Learn,] Update☆75Updated 9 months ago
- ☆69Updated 7 years ago
- High-quality implementations of deep reinforcement learning algorithms for experiments☆51Updated last year
- A bot for financial signal☆62Updated 7 years ago
- ☆209Updated 7 years ago
- Data as Demonstrator (DaD) is a meta learning algorithm to improve the multi-step predictive capabilities of a learned time series (e.g. …☆34Updated 9 years ago
- reinforcement learning. policy gradient. PCL☆37Updated 8 years ago
- Temporally-reweighted Chinese restaurant process mixture models for multivariate time series☆37Updated last year
- A suite of boosting algorithms for the online learning setting.☆65Updated 8 years ago
- Files for Python Talk☆24Updated 9 years ago
- Efficient Hyperparameter Optimization of Deep Learning Algorithms Using Deterministic RBF Surrogates☆115Updated 8 years ago