aijunbai / taxi
Hierarchical Online Planning and Reinforcement Learning on Taxi
☆30Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for taxi
- A curated list of awesome Model-based reinforcement learning resources☆90Updated 4 years ago
- Negative Update Intervals in Multi-Agent Deep Reinforcement Learning☆32Updated 5 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆70Updated 7 years ago
- Code for CORL'18 paper "Risk-Aware Active Inverse Reinforcement Learning"☆15Updated 5 years ago
- Code for the paper "AlwaysSafe: Reinforcement Learning Without Safety Constraint Violations During Training"☆18Updated 2 years ago
- ☆97Updated last year
- DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)☆66Updated 5 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 3 years ago
- Gym-like extensions for POMDP☆56Updated 3 years ago
- Code accompanying HAAR paper, NeurIPS 2019 - Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards☆30Updated last year
- The Reinforcement-Learning-Related Papers of ICLR 2019☆48Updated 5 years ago
- ☆43Updated last year
- Code for training policies based on paper Coordinated Multi-Agent Imitation Learning☆26Updated 7 years ago
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆62Updated last year
- PyTorch implementation of SAC-Q Reinforcement Learning Algorithm (tested on OpenAI Gym environments)☆36Updated 3 years ago
- ☆53Updated 6 years ago
- Implementation of the Option-Critic Architecture☆36Updated 5 years ago
- Safe exploration in Markov Decision Processes☆38Updated 7 years ago
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆26Updated 5 years ago
- Safe Policy Improvement with Baseline Bootstrapping☆25Updated 4 years ago
- Hierarchical Deep RL Network☆30Updated 7 years ago
- Code accompanying the paper "Action Robust Reinforcement Learning and Applications in Continuous Control" https://arxiv.org/abs/1901.0918…☆41Updated 5 years ago
- Unified Model-Free Hierarchical Reinforcement Learning Framework☆37Updated 5 years ago
- Reinforcement Learning for robotics continuous control, mainly based on Proximal Policy Optimization, extending to Interpolated Policy Gr…☆36Updated 5 years ago
- ☆71Updated 5 months ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆61Updated 6 years ago
- Safe Reinforcement Learning in Constrained Markov Decision Processes☆55Updated 4 years ago
- ☆25Updated 6 years ago
- soft q learning and soft actor critic☆15Updated 5 years ago
- Residual policy learning☆58Updated 5 years ago