PKU-MARL / TRPO-PPO-in-MARLView external linksLinks
☆16May 5, 2022Updated 3 years ago
Alternatives and similar repositories for TRPO-PPO-in-MARL
Users that are interested in TRPO-PPO-in-MARL are comparing it to the libraries listed below
Sorting:
- Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)☆19May 10, 2024Updated last year
- Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning☆14Apr 25, 2024Updated last year
- The Starcraft Multi-Agent challenge lite☆47Sep 13, 2024Updated last year
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Aug 8, 2022Updated 3 years ago
- Code for AAAI 2023 paper "Hypernetworks for Zero-shot Transfer in Reinforcement Learning"☆21Apr 26, 2023Updated 2 years ago
- Bayesian active RL (BARL) and trajectory information planning (TIP)☆26Oct 11, 2022Updated 3 years ago
- ☆25Apr 16, 2024Updated last year
- The implementation of ICLR 2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆46Oct 31, 2024Updated last year
- ☆13Apr 25, 2024Updated last year
- ☆17Oct 12, 2023Updated 2 years ago
- Propose & vote on reading group papers in the "Discussions" tab.☆12Feb 20, 2024Updated last year
- Official repository for "Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning".☆13Jan 25, 2023Updated 3 years ago
- Agents code for Multi-Agent Connected Autonomous Driving (MACAD) described in the paper presented in the Machine Learning for Autonomous …☆24Mar 6, 2021Updated 4 years ago
- ☆59Sep 22, 2022Updated 3 years ago
- 新增一个CBF层,并将其结合进actor网络中,得到safe RL框架。后续验证中发现这种做法并没有实质性的用处,所以不再继续这个项目☆11Mar 14, 2023Updated 2 years ago
- Docker containers of baseline agents for the Crafter environment☆30Dec 14, 2021Updated 4 years ago
- Jaxplorer is a Jax reinforcement learning (RL) framework for exploring new ideas.☆13Jul 19, 2024Updated last year
- Official code for "A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning"☆17Mar 1, 2023Updated 2 years ago
- Hello (Real) World with ROS – Robot Operating System course ROS environment☆11Mar 15, 2021Updated 4 years ago
- Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)☆16Oct 23, 2021Updated 4 years ago
- Toolkit of Causal Model-based Reinforcement Learning.☆33Jun 5, 2023Updated 2 years ago
- Modular Single-file Reinfocement Learning Algorithms Library☆38May 16, 2023Updated 2 years ago
- [NeurIPS 2022] ASPiRe: Adaptive Skill Priors for Reinforcement Learning☆13Oct 19, 2022Updated 3 years ago
- NeurIPS2022: Constrained Update Projection Approach to Safe Policy Optimization☆13Apr 10, 2023Updated 2 years ago
- A flexible Multi-Agent Reinforcement Learning (MARL) environment for Collective Robotic Construction (CRC) systems☆13Mar 22, 2023Updated 2 years ago
- Fast reinforcement learning research☆61Dec 7, 2024Updated last year
- A PyTorch Implementation of PlaNet: A Deep Planning Network for Reinforcement Learning☆12Aug 31, 2020Updated 5 years ago
- PyTorch implementation for "Discovery of Incremental Skills" (DISk) algorithm from ICLR 2022 paper "One After Another: Learning Increment…☆20Mar 22, 2022Updated 3 years ago
- ☆13Apr 25, 2023Updated 2 years ago
- Discovering Quality-Diversity Algorithms via Meta-Black-Box Optimization☆20Dec 1, 2025Updated 2 months ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆71Jun 13, 2024Updated last year
- Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.☆82May 13, 2024Updated last year
- ☆17Oct 18, 2022Updated 3 years ago
- Code release for "Training Robots to Evaluate Robots" (CoRL'22, Best Paper Award)☆17Feb 15, 2023Updated 3 years ago
- Scalable Opponent Shaping Experiments in JAX☆25Apr 13, 2024Updated last year
- Gym environment for cooperative multi-agent reinforcement learning in heterogeneous robot teams☆51Jan 11, 2022Updated 4 years ago
- Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization☆83Apr 13, 2023Updated 2 years ago
- Vectorization techniques for fast population-based training.☆57Aug 12, 2022Updated 3 years ago
- ☆91Jan 21, 2026Updated 3 weeks ago