Cognitive-AI-Systems / MAPF-GPT-DDGLinks
[IROS-2025] MAPF-GPT-DDG is a scalable decentralized multi-agent pathfinding (MAPF) solver based on imitation learning. It builds upon MAPF-GPT by introducing a novel fine-tuning method called Delta Data Generation (DDG) — a reward-free active learning approach that identifies and corrects failure cases in the policy.
☆58Updated last month
Alternatives and similar repositories for MAPF-GPT-DDG
Users that are interested in MAPF-GPT-DDG are comparing it to the libraries listed below
Sorting:
- Model-Based Transfer Learning for Contextual Reinforcement Learning (NeurIPS 2024)☆25Updated 8 months ago
- Code for [ICML2025]``Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic Design``.☆54Updated 3 months ago
- The original Shared Recurrent Memory Transformer implementation☆30Updated last month
- Benchmarks for Multi-Objective Multi-Agent Decision Making☆100Updated last week
- ☆18Updated last month
- ☆23Updated 11 months ago
- ☆15Updated last year
- [ICLR 2025] Official implementation of DICL (Disentangled In-Context Learning), featured in the paper "Zero-shot Model-based Reinforcemen…☆26Updated 6 months ago
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆35Updated 10 months ago
- Drop-in environment replacements that make your RL algorithm train faster.☆21Updated last year
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆106Updated 3 weeks ago
- POGEMA stands for Partially-Observable Grid Environment for Multiple Agents. This is a grid-based environment that was specifically desig…☆45Updated last month
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."☆33Updated last month
- ☆74Updated last year
- Repo to reproduce the First-Explore paper results☆38Updated 8 months ago
- Hypothetical Minds is an autonomous LLM-based agent for diverse multi-agent settings, integrating a Theory of Mind module Theory of Mind …☆35Updated last year
- Code repository for the NAACL 2025 paper "LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language…☆39Updated 10 months ago
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆31Updated 2 months ago
- [AAAI-2025] This repository contains MAPF-GPT, a deep learning-based model for solving MAPF problems. Trained with imitation learning on …☆74Updated last month
- Vintix: Action Model via In-Context Reinforcement Learning - - — ICML 2025☆42Updated 3 months ago
- Simplest and Cleanest DreamerV3 implementation out there☆87Updated 5 months ago
- A set of communication oriented environments☆14Updated last month
- Explainable Reinforcement Learning (XRL) Resources☆42Updated 11 months ago
- Implementation of RL-Enabled Distributed Assignment (REDA)☆22Updated last year
- Implementation of Soft Actor Critic and some of its improvements in Pytorch☆59Updated 6 months ago
- ☆14Updated 6 months ago
- BricksRL: A Platform for Democratizing Robotics and Reinforcement Learning Research and Education with LEGO☆62Updated 10 months ago
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated last year
- ☆53Updated 2 months ago
- Python code to implement LLM4Teach, a policy distillation approach for teaching reinforcement learning agents with Large Language Model☆45Updated last year