aijunbai / taxiLinks

Hierarchical Online Planning and Reinforcement Learning on Taxi

☆30

Alternatives and similar repositories for taxi

Users that are interested in taxi are comparing it to the libraries listed below

Sorting:

mengf1 / DHER
DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)
☆66Updated 5 years ago
tdavchev / option-critic
A Tensorflow implementation of the Option-Critic Architecture
☆71Updated 8 years ago
hu-po / pySACQ
PyTorch implementation of SAC-Q Reinforcement Learning Algorithm (tested on OpenAI Gym environments)
☆37Updated 4 years ago
hejia-zhang / awesome-model-based-reinforcement-learning
A curated list of awesome Model-based reinforcement learning resources
☆94Updated 4 years ago
kpaonaut / HAAR-A-Hierarchical-RL-Algorithm
Code accompanying HAAR paper, NeurIPS 2019 - Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards
☆31Updated 2 years ago
d3sm0 / gym_pomdp
Gym-like extensions for POMDP
☆57Updated 4 years ago
WilsonWangTHU / POPLIN
☆99Updated 2 years ago
ewanlee / ICLR2019-RL-Papers
The Reinforcement-Learning-Related Papers of ICLR 2019
☆47Updated 6 years ago
alversafa / option-critic-arch
Implementation of the Option-Critic Architecture
☆39Updated 6 years ago
gjp1203 / nui_in_madrl
Negative Update Intervals in Multi-Agent Deep Reinforcement Learning
☆33Updated 6 years ago
ermongroup / multiagent-gail
☆83Updated 6 years ago
zuoxingdong / DeepPILCO
☆54Updated 7 years ago
jianing-sun / Interpolated-Policy-Gradient-with-PPO-for-Robotics-Control-
Reinforcement Learning for robotics continuous control, mainly based on Proximal Policy Optimization, extending to Interpolated Policy Gr…
☆37Updated 6 years ago
localminimum / hindsight-experience-replay
Hindsight Experience Replay - Bit flipping experiment in Tensorflow
☆58Updated 6 years ago
fshamshirdar / pytorch-rdpg
PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)
☆55Updated 2 years ago
montrealrobotics / active-domainrand
Code repository for Active Domain Randomization (CoRL 2019, https://arxiv.org/abs/1904.04762)
☆98Updated 4 years ago
eugenevinitsky / robust_RL_multi_adversary
We investigate the effect of populations on finding good solutions to the robust MDP
☆28Updated 4 years ago
jangirrishabh / Overcoming-exploration-from-demos
Implementation of the paper "Overcoming Exploration in Reinforcement Learning with Demonstrations" Nair et al. over the HER baselines fro…
☆154Updated 3 years ago
Pearl-UTexas / ActiveVaR
Code for CORL'18 paper "Risk-Aware Active Inverse Reinforcement Learning"
☆16Updated 6 years ago
theophilegervet / options-hierarchical-rl
☆26Updated 7 years ago
mcgillmrl / kusanagi
Library for model based RL in robotics
☆37Updated 6 years ago
dibyaghosh / dnc
Code for "Divide-and-Conquer Reinforcement Learning"
☆61Updated 6 years ago
liuzuxin / safe-mbrl
Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method
☆66Updated 2 years ago
syuntoku14 / pytorch-rl-il
A library for building reinforcement learning and imitation learning agents in Pytorch
☆59Updated 5 years ago
Stanford-ILIAD / batch-active-preference-based-learning
Companion code to CoRL 2018 paper: E Bıyık, D Sadigh. "Batch Active Preference-Based Learning of Reward Functions". Conference on Robot L…
☆29Updated 6 years ago
wendelinboehmer / dcg
☆76Updated last year
junjungoal / IMPALA-pytorch
PyTorch IMPALA implementation
☆26Updated 5 years ago
twni2016 / f-IRL
Inverse Reinforcement Learning via State Marginal Matching, CoRL 2020
☆45Updated last year
laurimi / npgi
Non-linear policy graph improvement - planning for Dec-POMDPs
☆16Updated 4 years ago
navuboy / gail_gym
Implementation of Generatve Adversarial Imitation Learning (GAIL) for classic environments from OpenAI Gym.
☆89Updated 6 years ago