Kirili4ik / HRL-taxiLinks

Solution for Taxi env using HRL (Hierarchical reinforcement learning) (2018)

☆21

Alternatives and similar repositories for HRL-taxi

Users that are interested in HRL-taxi are comparing it to the libraries listed below

Sorting:

uoe-agents / robotic-warehouse
Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment
☆65Updated 9 months ago
011235813 / hierarchical-marl
Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery
☆103Updated 3 years ago
lweitkamp / feudalnets-pytorch
PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.
☆41Updated 5 years ago
seolhokim / InverseRL-Pytorch
Pytorch GAIL VAIL AIRL VAIRL EAIRL SQIL Implementation
☆65Updated 4 years ago
BY571 / SAC_discrete
PyTorch implementation of the discrete Soft-Actor-Critic algorithm.
☆53Updated 3 years ago
kantologist / multiagent-sac
Implementation of centralized training, centralized execution of Soft Actor-Critic (SAC) on a Tennis multiagent Unity environment.
☆37Updated 4 years ago
011235813 / cm3
Cooperative Multi-goal Multi-stage Multi-agent Reinforcement Learning
☆56Updated 3 years ago
SaminYeasar / Off_Policy_Adversarial_Inverse_Reinforcement_Learning
Implementation of Off Policy Adversarial Inverse Reinforcement Learning
☆23Updated 4 years ago
parametersharingmadrl / parametersharingmadrl
☆28Updated 4 years ago
xtma / dsac
Distributional Soft Actor Critic
☆55Updated 5 years ago
Jonathan-Pearce / DDPG_PER
Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)
☆51Updated 4 months ago
puyuan1996 / MARL
Implementation for mSAC methods in PyTorch
☆42Updated 3 years ago
Felhof / DiscreteSAC
☆40Updated 3 years ago
thomashirtz / gym-hybrid
Collection of OpenAI parametrized action-space environments.
☆65Updated 3 months ago
sanmuyang / multi-agent-PPO-on-SMAC
Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.
☆71Updated 3 years ago
baimingc / delay-aware-MARL
Codes for Paper "Delay-Aware Multi-Agent Reinforcement Learning".
☆53Updated 4 years ago
hsvgbkhgbv / SQDDPG
This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …
☆119Updated 7 months ago
sisl / DICG
Deep Implicit Coordination Graphs
☆41Updated last year
matteokarldonati / Counterfactual-Multi-Agent-Policy-Gradients
PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."
☆58Updated 5 years ago
lweitkamp / option-critic-pytorch
PyTorch implementation of the Option-Critic framework, Harb et al. 2016
☆128Updated 10 months ago
koulanurag / minimal-marl
Minimal implementation of multi-agent reinforcement learning algorithms
☆56Updated 3 years ago
hijkzzz / noisy-mappo
Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)
☆64Updated 2 years ago
Stepan-Makarenko / ICM-PPO-implementation
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM) on Pyramid env, Unity ML
☆18Updated last year
zhihanyang2022 / off-policy-continuous-control
Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)
☆85Updated last year
MDrW / ICML2022-IRAT
☆39Updated 2 years ago
cyoon1729 / Policy-Gradient-Methods
Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC
☆99Updated 5 years ago
AlgTUDelft / WCSAC
Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"
☆55Updated last year
esmeralday / MARL
Multi-Agent Reinforcement Learning
☆11Updated 5 years ago
keep9oing / DRQN-Pytorch-CartPole-v1
Deep recurrent Q learning on CartPole-v1 environment
☆91Updated last year
uoe-agents / lb-foraging
Level-Based Foraging (LBF): A multi-agent reinforcement learning environment
☆46Updated 9 months ago