unixpickle / treeagentLinks
Decision tree ensembles as RL policies
☆22Updated 8 years ago
Alternatives and similar repositories for treeagent
Users that are interested in treeagent are comparing it to the libraries listed below
Sorting:
- Non stationary bandit for experiments with Reinforcement Learning☆33Updated 8 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Updated 8 years ago
- A lightweight python library for bandit algorithms☆30Updated 3 years ago
- Library for Multi-Armed Bandit Algorithms☆57Updated 8 years ago
- ☆209Updated 7 years ago
- OpenAI's cartpole env solver.☆18Updated 7 years ago
- some common TD Learning algorithms☆66Updated 5 years ago
- Contextual bandit in python☆112Updated 4 years ago
- Temporally-reweighted Chinese restaurant process mixture models for multivariate time series☆37Updated last year
- Contextual bandit benchmarking☆53Updated 2 weeks ago
- Non-stationary Off-policy Evaluation☆13Updated 7 years ago
- A Python library for reinforcement learning using Bayesian approaches☆53Updated 10 years ago
- Parameter Importance Analysis Tool☆77Updated 4 years ago
- ☆25Updated 6 years ago
- scripts for evaluation of contextual bandit algorithms☆45Updated 5 years ago
- Reproducing results from DeepMind's paper on Population Based Training of Neural Networks.☆55Updated 7 years ago
- Implementation of Model-Agnostic Meta-Learning (MAML) in Jax☆191Updated 3 years ago
- A suite of boosting algorithms for the online learning setting.☆67Updated 8 years ago
- A fast Evolution Strategy implementation in Python☆272Updated 5 years ago
- Simple tools for statistical analyses in RL experiments☆67Updated 7 years ago
- An implementation of the Augmented Random Search algorithm☆427Updated 4 years ago
- RLgraph: Modular computation graphs for deep reinforcement learning☆323Updated 6 years ago
- Contextual Bandit Algorithms (+Bandit Algorithms)☆22Updated 6 years ago
- High-quality implementations of deep reinforcement learning algorithms for experiments☆51Updated last year
- Epsilon-greedy, softmax and LinUCB contextual bandit implementations [recommender systems]☆50Updated 6 years ago
- InfiniteBoost: building infinite ensembles with gradient descent☆183Updated 7 years ago
- Comparison of bandit algorithms from the Reinforcement Learning bible.☆17Updated 7 years ago
- A reinforcement learning framework☆157Updated 7 years ago
- Multi-Armed Bandit Algorithms Library (MAB)☆134Updated 3 years ago
- [IJAIT 2021] MABWiser: Contextual Multi-Armed Bandits Library☆280Updated last year