erensezener / aima-based-irl
IRL implementation based on Norvig's AIMA code.
☆13Updated 10 years ago
Alternatives and similar repositories for aima-based-irl:
Users that are interested in aima-based-irl are comparing it to the libraries listed below
- A Python library for reinforcement learning using Bayesian approaches☆54Updated 9 years ago
- reinforcement learning. policy gradient. PCL☆37Updated 7 years ago
- Gopalan, P., Ruiz, F. J., Ranganath, R., & Blei, D. M. (2014). Bayesian Nonparametric Poisson Factorization for Recommendation Systems. I…☆15Updated 10 years ago
- Robust policy search algorithms which train on model ensembles☆28Updated 8 years ago
- Data as Demonstrator (DaD) is a meta learning algorithm to improve the multi-step predictive capabilities of a learned time series (e.g. …☆32Updated 8 years ago
- Collaborative filtering with the GP-LVM☆25Updated 9 years ago
- ☆27Updated 5 years ago
- Non-stationary Off-policy Evaluation☆13Updated 6 years ago
- ☆11Updated 6 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Updated 7 years ago
- Contextual Bandits Action Elimination DQN☆21Updated 6 years ago
- ☆25Updated 7 years ago
- Implementation of Counterfactual risk minimization☆26Updated 7 years ago
- Implementation is mostly based on Sergey Levine work (http://www.eecs.berkeley.edu/~svlevine/).☆42Updated 10 years ago
- Starter kit for getting started in the NIPS 2017 Criteo Ad Placement Challenge☆19Updated 7 years ago
- ☆24Updated 9 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 6 years ago
- Code for "Boosted Generative Models", AAAI 2018.☆20Updated 7 years ago
- ☆31Updated 6 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 5 years ago
- hierarchical deep reinforcement learning algorithms☆41Updated 7 years ago
- Code for training and testing a Hidden Parameter Markov Decision Process, used to facilitate the transfer of learning☆30Updated 7 years ago
- ☆16Updated 8 years ago
- Code accompanying the OptionGAN paper.☆43Updated 6 years ago
- Actor Critic using Kronecker-Factored Trust Region☆19Updated 6 years ago
- IRL Toolkit developed by Sergey Levine (Taken from https://graphics.stanford.edu/projects/gpirl/)☆62Updated 8 years ago
- Ranking Policy Gradient☆23Updated 5 years ago
- Dagger - An implementation of Dataset Aggregation☆30Updated 6 years ago
- Code for a generative controller for the AI Gym cartpole task☆15Updated 8 years ago
- TensorFlow A2C to solve Acrobot, with synchronized parallel environments☆35Updated 6 years ago