erensezener / aima-based-irl
IRL implementation based on Norvig's AIMA code.
☆14Updated 10 years ago
Related projects ⓘ
Alternatives and complementary repositories for aima-based-irl
- A Python library for reinforcement learning using Bayesian approaches☆53Updated 9 years ago
- reinforcement learning. policy gradient. PCL☆38Updated 7 years ago
- Robust policy search algorithms which train on model ensembles☆28Updated 8 years ago
- Code accompanying the OptionGAN paper.☆43Updated 6 years ago
- Data as Demonstrator (DaD) is a meta learning algorithm to improve the multi-step predictive capabilities of a learned time series (e.g. …☆33Updated 8 years ago
- hierarchical deep reinforcement learning algorithms☆41Updated 6 years ago
- ☆18Updated 9 years ago
- IRL Toolkit developed by Sergey Levine (Taken from https://graphics.stanford.edu/projects/gpirl/)☆62Updated 7 years ago
- Contextual Bandits Action Elimination DQN☆19Updated 6 years ago
- ☆24Updated 9 years ago
- Torch7 impementation of: Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images☆43Updated 8 years ago
- ☆25Updated 7 years ago
- TensorFlow impementation of: Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images☆65Updated 8 years ago
- Non-stationary Off-policy Evaluation☆13Updated 6 years ago
- Collaborative filtering with the GP-LVM☆25Updated 9 years ago
- Variational Recurrent Auto Encoder☆16Updated 8 years ago
- Model-Free Episodic Control☆15Updated 7 years ago
- Code for training and testing a Hidden Parameter Markov Decision Process, used to facilitate the transfer of learning☆29Updated 6 years ago
- Variational Recurrent Auto-Encoder using LSTM encoder/decoder networks☆54Updated 8 years ago
- Analogs of Linguistic Structure in Deep Representations☆19Updated 7 years ago
- Implementation is mostly based on Sergey Levine work (http://www.eecs.berkeley.edu/~svlevine/).☆43Updated 9 years ago
- Some code for tutorials following https://gym.openai.com/docs/rl☆14Updated 8 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆30Updated 5 years ago
- ☆11Updated 3 years ago
- ☆28Updated 5 years ago
- Actor Critic using Kronecker-Factored Trust Region☆19Updated 6 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Updated 7 years ago
- Implementation of the Incremental Sequence Learning algorithms described in the Incremental Sequence Learning article☆41Updated 7 years ago
- Policy gradient reinforcement learning algorithm with importance sampling☆31Updated 7 years ago