yangmuzhi / airl
learning robust rewards with adversarial inverse reinforcement learning
☆10Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for airl
- Pytorch implementation of InfoGAIL and WGAIL☆18Updated 2 years ago
- Pytorch implementation of Multi-Agent Generative Adversarial Imitation Learning☆37Updated 2 years ago
- Cooperative Multi-goal Multi-stage Multi-agent Reinforcement Learning☆55Updated 2 years ago
- Implementation of Off Policy Adversarial Inverse Reinforcement Learning☆21Updated 4 years ago
- Generate expert demonstrations; GAIL(Generative Adversarial Imitation Learning); IRL(Inverse Reinforcement Learning)☆30Updated 3 years ago
- Pytorch GAIL VAIL AIRL VAIRL EAIRL SQIL Implementation☆63Updated 3 years ago
- MetaLight: a value-based meta-reinforcement learning framework for traffic signal control☆36Updated 4 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆38Updated 4 years ago
- code implementation for 'Bi-level Actor-Critic for Multi-agent Coordination'(AAAI2020)☆55Updated 4 years ago
- Implementing the two pioneering IRL papers "Algorithms for Inverse Reinforcement Learning" - (Ng &Russell 2000) and "Maximum Entropy Inve…☆27Updated last year
- Adds CityFlow to Gym☆27Updated 3 years ago
- Paper list for constrained policy optimization in reinforcement learning.☆68Updated last year
- ☆61Updated last year
- [RA-L & ICRA 2021] Adversarial Inverse Reinforcement Learning with Self-attention Dynamics Model☆31Updated 2 years ago
- Deep Implicit Coordination Graphs☆41Updated 5 months ago
- ☆14Updated 8 months ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated last year
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆51Updated last year
- ☆42Updated 3 years ago
- Assignments for CS294-112.☆30Updated 5 years ago
- Pytorch implementation of "Safe Exploration in Continuous Action Spaces" [Dalal et al.]☆66Updated 5 years ago
- Learning Individual Intrinsic Reward in MARL☆62Updated last year
- Implementation of PPO Lagrangian in PyTorch☆35Updated 2 years ago
- ☆43Updated last year
- ☆83Updated 5 years ago
- Constrained Policy Optimization implementation on Safety Gym☆21Updated 2 years ago
- ☆12Updated 3 years ago
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆52Updated 4 years ago
- Codes for Paper "Delay-Aware Multi-Agent Reinforcement Learning".☆51Updated 4 years ago