davide97l / PPO-GAIL-cartpoleLinks
GAIL learning to imitate PPO playing CartPole.
☆12Updated 4 years ago
Alternatives and similar repositories for PPO-GAIL-cartpole
Users that are interested in PPO-GAIL-cartpole are comparing it to the libraries listed below
Sorting:
- MATE: the Multi-Agent Tracking Environment.☆44Updated 2 years ago
- ☆21Updated 4 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Updated 3 years ago
- multiagent-gail working with multiagent-particle-env-v2 (which was modified by magail authors)☆13Updated 6 years ago
- Experiments to train transformer network to master reinforcement learning environments.☆32Updated 4 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆51Updated 3 years ago
- Collection of OpenAI parametrized action-space environments.☆66Updated 8 months ago
- Paper Collection for Batch RL with brief introductions.☆84Updated 3 years ago
- Official codebase for paper "Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning" (ICML22)☆23Updated 3 years ago
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆31Updated 3 weeks ago
- Code for the paper: Hierarchical Reinforcement Learning With Timed Subgoals, published at NeurIPS 2021☆35Updated 3 years ago
- A general model-free off-policy actor-critic implementation. Continuous and Discrete Soft Actor-Critic with multimodal observations, data…☆40Updated last year
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆52Updated 7 months ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆41Updated 5 years ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆70Updated last year
- Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options☆46Updated 4 years ago
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆26Updated 3 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆93Updated 2 years ago
- An unofficial implementation for online decision transformer☆40Updated 3 years ago
- Implementation of Multi-Game Decision Transformers in PyTorch☆47Updated 2 years ago
- DecentralizedLearning☆24Updated 3 years ago
- MATE: the Multi-Agent Tracking Environment.☆48Updated 2 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆62Updated 2 years ago
- An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games☆32Updated 3 years ago
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆40Updated 9 months ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆90Updated 5 years ago
- ☆40Updated 3 years ago
- Generalized Proximal Policy Optimization with Sample Reuse (GePPO)☆26Updated 2 years ago
- discrete soft Q learning(SQL) and soft Q imitation learning(SQIL) implementation in pytorch, simple!☆58Updated 3 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆176Updated last year