aijunbai / taxi
Hierarchical Online Planning and Reinforcement Learning on Taxi
☆30Updated 7 years ago
Alternatives and similar repositories for taxi:
Users that are interested in taxi are comparing it to the libraries listed below
- Gym-like extensions for POMDP☆57Updated 4 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆71Updated 7 years ago
- ☆25Updated 7 years ago
- DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)☆65Updated 5 years ago
- A curated list of awesome Model-based reinforcement learning resources☆93Updated 4 years ago
- ☆84Updated 6 years ago
- Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstractions and Intrinsic Motivation☆87Updated 7 years ago
- ☆53Updated 7 years ago
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆27Updated 5 years ago
- Code accompanying HAAR paper, NeurIPS 2019 - Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards☆32Updated 2 years ago
- Code repository for Active Domain Randomization (CoRL 2019, https://arxiv.org/abs/1904.04762)☆97Updated 4 years ago
- ☆97Updated last year
- Library for model based RL in robotics☆37Updated 6 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆93Updated 2 years ago
- QMDP-Net implementation☆65Updated 4 years ago
- Companion code to CoRL 2018 paper: E Bıyık, D Sadigh. "Batch Active Preference-Based Learning of Reward Functions". Conference on Robot L…☆29Updated 5 years ago
- Code for CORL'18 paper "Risk-Aware Active Inverse Reinforcement Learning"☆16Updated 6 years ago
- Code for "Divide-and-Conquer Reinforcement Learning"☆61Updated 6 years ago
- The Reinforcement-Learning-Related Papers of ICLR 2019☆47Updated 5 years ago
- Hierarchical Deep RL Network☆31Updated 8 years ago
- PyTorch implementation of SAC-Q Reinforcement Learning Algorithm (tested on OpenAI Gym environments)☆36Updated 4 years ago
- Inverse Reinforcement Learning via State Marginal Matching, CoRL 2020☆45Updated last year
- ☆66Updated 4 years ago
- ☆91Updated last year
- Residual policy learning☆60Updated 5 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 3 years ago
- IRL Toolkit developed by Sergey Levine (Taken from https://graphics.stanford.edu/projects/gpirl/)☆62Updated 8 years ago
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆54Updated 2 years ago
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆64Updated last year
- NeurIPS Reproducibility Challenge 2019☆20Updated 5 years ago