Cognitive-AI-Systems / MAPF-GPT
This repository contains MAPF-GPT, a deep learning-based model for solving MAPF problems. Trained with imitation learning on trajectories produced by LaCAM, it generates actions under partial observability without heuristics or agent communication. MAPF-GPT excels on unseen instances and outperforms state-of-the-art solvers.
☆50Updated 7 months ago
Alternatives and similar repositories for MAPF-GPT:
Users that are interested in MAPF-GPT are comparing it to the libraries listed below
- [AAAI-2025] This repository contains MAPF-GPT, a deep learning-based model for solving MAPF problems. Trained with imitation learning on …☆53Updated 3 months ago
- ☆16Updated 7 months ago
- The original Shared Recurrent Memory Transformer implementation☆23Updated 3 months ago
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆32Updated 5 months ago
- Code for ``Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic Design``.☆37Updated 3 months ago
- [AAAI-2024] Follower: This study addresses the challenging problem of decentralized lifelong multi-agent pathfinding. The proposed Follow…☆39Updated 8 months ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆9Updated last year
- Vintix: Action Model via In-Context Reinforcement Learning - - —☆34Updated last month
- XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning - - —☆66Updated 2 months ago
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆51Updated 2 years ago
- Repo to reproduce the First-Explore paper results☆37Updated 4 months ago
- MAexp is a generic platform for RL-based multi-agent exploration☆80Updated 6 months ago
- ☆11Updated 11 months ago
- Benchmarks for Multi-Objective Multi-Agent Decision Making☆87Updated last month
- POGEMA stands for Partially-Observable Grid Environment for Multiple Agents. This is a grid-based environment that was specifically desig…☆34Updated last month
- Drop-in environment replacements that make your RL algorithm train faster.☆20Updated 10 months ago
- [AAAI-2024] MATS-LP addresses the challenging problem of decentralized lifelong multi-agent pathfinding. The proposed approach utilizes a…☆24Updated 8 months ago
- Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC☆55Updated last year
- Model-Based Transfer Learning for Contextual Reinforcement Learning (NeurIPS 2024)☆24Updated 4 months ago
- ☆19Updated 7 months ago
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."☆27Updated 6 months ago
- ☆13Updated 9 months ago
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated last year
- ☆42Updated 4 months ago
- Explainable Reinforcement Learning (XRL) Resources☆37Updated 7 months ago
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆27Updated 2 years ago
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆30Updated 6 months ago
- Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023☆52Updated 2 years ago
- ☆36Updated last month
- ☆27Updated 7 months ago