Cognitive-AI-Systems / MAPF-GPT-DDGLinks
[IROS-2025] MAPF-GPT-DDG is a scalable decentralized multi-agent pathfinding (MAPF) solver based on imitation learning. It builds upon MAPF-GPT by introducing a novel fine-tuning method called Delta Data Generation (DDG) — a reward-free active learning approach that identifies and corrects failure cases in the policy.
☆60Updated 3 months ago
Alternatives and similar repositories for MAPF-GPT-DDG
Users that are interested in MAPF-GPT-DDG are comparing it to the libraries listed below
Sorting:
- Model-Based Transfer Learning for Contextual Reinforcement Learning (NeurIPS 2024)☆25Updated 10 months ago
- ☆20Updated 3 months ago
- Code for [ICML2025]``Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic Design``.☆63Updated 5 months ago
- Benchmarks for Multi-Objective Multi-Agent Decision Making☆111Updated last month
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆116Updated 2 weeks ago
- Explainable Reinforcement Learning (XRL) Resources☆44Updated last year
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code (ICLR 2025).☆69Updated 10 months ago
- [ICLR 2025] Official implementation of DICL (Disentangled In-Context Learning), featured in the paper "Zero-shot Model-based Reinforcemen…☆27Updated 8 months ago
- General multi-task deep RL Agent☆185Updated last year
- The original Shared Recurrent Memory Transformer implementation☆32Updated 3 months ago
- Repo to reproduce the First-Explore paper results☆38Updated 10 months ago
- MAexp is a generic platform for RL-based multi-agent exploration☆95Updated 2 months ago
- OMNI: Open-endedness via Models of human Notions of Interestingness☆57Updated 9 months ago
- Implementation of RL-Enabled Distributed Assignment (REDA)☆22Updated last year
- ☆73Updated last year
- Official Implementation for "In-Context Reinforcement Learning for Variable Action Spaces"☆90Updated last year
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆36Updated last year
- BricksRL: A Platform for Democratizing Robotics and Reinforcement Learning Research and Education with LEGO☆64Updated last year
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆34Updated 4 months ago
- ☆23Updated last year
- Implementation of Soft Actor Critic and some of its improvements in Pytorch☆60Updated 8 months ago
- How to create rational LLM-based agents? Using game-theoretic workflows!☆79Updated 5 months ago
- Drop-in environment replacements that make your RL algorithm train faster.☆21Updated last year
- POGEMA stands for Partially-Observable Grid Environment for Multiple Agents. This is a grid-based environment that was specifically desig…☆44Updated 4 months ago
- A ray-based library of Distributed POPulation-based OPtimization for Large-Scale Black-Box Optimization.☆18Updated last year
- ☆14Updated last year
- Code repository for the NAACL 2025 paper "LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language…☆40Updated last year
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆65Updated 8 months ago
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆113Updated last year
- A Gymnasium-based Environment of the Abstraction and Reasoning Corpus (ARC)☆68Updated last year