erensezener / aima-based-irl
IRL implementation based on Norvig's AIMA code.
☆13Updated 10 years ago
Alternatives and similar repositories for aima-based-irl:
Users that are interested in aima-based-irl are comparing it to the libraries listed below
- Robust policy search algorithms which train on model ensembles☆28Updated 8 years ago
- A Python library for reinforcement learning using Bayesian approaches☆54Updated 9 years ago
- Collaborative Deep Reinforcement Learning☆32Updated 7 years ago
- Contextual Bandits Action Elimination DQN☆19Updated 6 years ago
- reinforcement learning. policy gradient. PCL☆37Updated 7 years ago
- Implementation of Counterfactual risk minimization☆26Updated 7 years ago
- ☆25Updated 7 years ago
- Data as Demonstrator (DaD) is a meta learning algorithm to improve the multi-step predictive capabilities of a learned time series (e.g. …☆32Updated 8 years ago
- hierarchical deep reinforcement learning algorithms☆41Updated 7 years ago
- Code accompanying the OptionGAN paper.☆43Updated 6 years ago
- Code for a generative controller for the AI Gym cartpole task☆15Updated 7 years ago
- IRL Toolkit developed by Sergey Levine (Taken from https://graphics.stanford.edu/projects/gpirl/)☆62Updated 8 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 6 years ago
- Code for paper "Learning Multimodal Transition Dynamics for Model-Based Reinforcement Learning".☆34Updated 6 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆31Updated 5 years ago
- Ranking Policy Gradient☆23Updated 5 years ago
- Actor Critic using Kronecker-Factored Trust Region☆19Updated 6 years ago
- ☆68Updated 6 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 5 years ago
- ☆28Updated 5 years ago
- Train an RL agent to play multiple Atari games at once☆70Updated 8 years ago
- A2C for GVG-AI☆21Updated 6 years ago
- ☆26Updated 5 years ago
- reimplementation of the ddpg algorithm using tensorflow☆38Updated 8 years ago
- Torch7 impementation of: Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images☆42Updated 9 years ago
- Python implementation of tabular asynchronous actor critic☆11Updated 8 years ago
- Code for our paper: Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies☆15Updated 5 years ago
- Model-Free Episodic Control☆14Updated 8 years ago
- ☆13Updated 9 years ago
- Reading Group on Reinforcement Learning topics☆55Updated 8 years ago