Cognitive-AI-Systems / pogema-toolboxLinks
☆18Updated last month
Alternatives and similar repositories for pogema-toolbox
Users that are interested in pogema-toolbox are comparing it to the libraries listed below
Sorting:
- POGEMA stands for Partially-Observable Grid Environment for Multiple Agents. This is a grid-based environment that was specifically desig…☆45Updated last month
- [AAAI-2024] Follower: This study addresses the challenging problem of decentralized lifelong multi-agent pathfinding. The proposed Follow…☆42Updated last month
- This is an umbrella repository that contains links and information about all the tools and algorithms related to the POGEMA Benchmark.☆25Updated 2 months ago
- [AAAI-2025] This repository contains MAPF-GPT, a deep learning-based model for solving MAPF problems. Trained with imitation learning on …☆74Updated last month
- ☆16Updated 11 months ago
- ☆16Updated 11 months ago
- [ICLR-2025] POGEMA stands for Partially-Observable Grid Environment for Multiple Agents. This is a grid-based environment that was specif…☆238Updated last month
- This repository contains the code for Diversity Control (DiCo), a novel method to constrain behavioral diversity in multi-agent reinforce…☆25Updated 8 months ago
- The Hierarchical Intrinsically Motivated Agent (HIMA) is an algorithm that is intended to exhibit an adaptive goal-directed behavior usin…☆37Updated 6 months ago
- [AAAI-2024] MATS-LP addresses the challenging problem of decentralized lifelong multi-agent pathfinding. The proposed approach utilizes a…☆25Updated last month
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆75Updated 2 years ago
- JAX-based implementation for multi-agent path planning (MAPP) in continuous spaces.☆52Updated 2 years ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆18Updated 10 months ago
- Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Des…☆30Updated last year
- ☆36Updated 2 years ago
- Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023☆53Updated 2 years ago
- Modular Single-file Reinfocement Learning Algorithms Library☆38Updated 2 years ago
- POPGym Library in JAX☆11Updated last year
- On-Policy Policy Gradient Algorithms in JAX☆39Updated last year
- Official Implementation of `An Optimisation Framework for Unsupervised Environment Design` from RLC 2025☆15Updated 2 weeks ago
- ☆12Updated last year
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆11Updated 2 years ago
- Deep Reinforcement Learning Framework done with PyTorch☆37Updated 5 months ago
- PPO and PyMARL baseline for Pogema environment☆22Updated 11 months ago
- Scalable Opponent Shaping Experiments in JAX☆24Updated last year
- ☆22Updated last year
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆52Updated 2 years ago
- Modular and Hierachical RL baseline solution for the IGLU RL track @ NeurIPS 2022☆20Updated 2 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 3 years ago
- Source code for Pathfinding in Stochastic Environments paper.☆14Updated 2 years ago