Cognitive-AI-Systems / pogema-toolboxLinks
☆17Updated last week
Alternatives and similar repositories for pogema-toolbox
Users that are interested in pogema-toolbox are comparing it to the libraries listed below
Sorting:
- POGEMA stands for Partially-Observable Grid Environment for Multiple Agents. This is a grid-based environment that was specifically desig…☆44Updated last week
- [AAAI-2024] Follower: This study addresses the challenging problem of decentralized lifelong multi-agent pathfinding. The proposed Follow…☆41Updated 10 months ago
- [AAAI-2025] This repository contains MAPF-GPT, a deep learning-based model for solving MAPF problems. Trained with imitation learning on …☆71Updated 3 weeks ago
- [ICLR-2025] POGEMA stands for Partially-Observable Grid Environment for Multiple Agents. This is a grid-based environment that was specif…☆235Updated this week
- This is an umbrella repository that contains links and information about all the tools and algorithms related to the POGEMA Benchmark.☆25Updated 3 weeks ago
- ☆16Updated 10 months ago
- ☆16Updated 10 months ago
- The Hierarchical Intrinsically Motivated Agent (HIMA) is an algorithm that is intended to exhibit an adaptive goal-directed behavior usin…☆38Updated 4 months ago
- [AAAI-2024] MATS-LP addresses the challenging problem of decentralized lifelong multi-agent pathfinding. The proposed approach utilizes a…☆24Updated 10 months ago
- This repository contains the code for Diversity Control (DiCo), a novel method to constrain behavioral diversity in multi-agent reinforce…☆25Updated 6 months ago
- Modular Single-file Reinfocement Learning Algorithms Library☆38Updated 2 years ago
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆74Updated 2 years ago
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆51Updated 2 years ago
- ☆12Updated last year
- IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARL☆42Updated 10 months ago
- Source code for Pathfinding in Stochastic Environments paper.☆14Updated 2 years ago
- Official code for "A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning"☆15Updated 2 years ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆11Updated 2 years ago
- Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023☆53Updated 2 years ago
- ☆22Updated last year
- JAX-based implementation for multi-agent path planning (MAPP) in continuous spaces.☆53Updated 2 years ago
- PlanDQ: Hierarchical Plan Orchestration via D-Conductor and Q-Performer☆10Updated last year
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆53Updated 2 years ago
- Official Implementation of `An Optimisation Framework for Unsupervised Environment Design` from RLC 2025☆12Updated this week
- Bayesian active RL (BARL) and trajectory information planning (TIP)☆25Updated 2 years ago
- On-Policy Policy Gradient Algorithms in JAX☆38Updated last year
- Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" …☆16Updated last year
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆18Updated 8 months ago
- Model-Based Uncertainty in Value Functions (AISTATS2023)☆17Updated 2 years ago
- POPGym Library in JAX☆11Updated last year