[ICLR'26] MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs
☆52Apr 17, 2026Updated last month
Alternatives and similar repositories for MARSHAL
Users that are interested in MARSHAL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments☆24Sep 30, 2025Updated 8 months ago
- Code of paper "HyperVLA: Efficient Inference in Vision-Language-Action Models via Hypernetworks"☆24Oct 8, 2025Updated 8 months ago
- Code for the NeurIPS 2023 Paper: Robust Multi-Agent Reinforcement Learning via Adversarial Regularization: Theoretical Foundation and Sta…☆30Oct 29, 2023Updated 2 years ago
- Repository for ICLR 2021 paper DialoGraph: Incorporating Interpretable Strategy-Graph Networks into Negotiation Dialogues☆17Feb 11, 2022Updated 4 years ago
- Code for EACL 26 Findings paper "I-MCTS: Enhancing Agentic AutoML via Introspective Monte Carlo Tree Search"☆13Jan 28, 2026Updated 4 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A Knowledge-grounded framework for Autonomous ML/AI Program Synthesis and Optimization☆92Feb 20, 2026Updated 3 months ago
- Official repository for paper: T(R,O) Grasp: Efficient Graph Diffusion of Robot-Object Spatial Transformation for Cross-Embodiment Dexter…☆40May 10, 2026Updated last month
- ☆72Dec 7, 2025Updated 6 months ago
- ☆60May 21, 2025Updated last year
- ☆22Jul 22, 2025Updated 10 months ago
- A package to convert range data from ROS range topics to pointclouds☆10Jun 30, 2017Updated 8 years ago
- Code and data release for FEABench: Evaluating Language Models on Multiphysics Reasoning Ability. [MATH-AI workshop, NeurIPS 2024]☆14May 7, 2025Updated last year
- MATE: the Multi-Agent Tracking Environment.☆46Mar 31, 2023Updated 3 years ago
- VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation [TMLR26]☆16Jun 1, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆46May 3, 2026Updated last month
- RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning☆19May 24, 2023Updated 3 years ago
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆37Apr 25, 2026Updated last month
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆38Dec 30, 2025Updated 5 months ago
- A flexible Multi-Agent Reinforcement Learning (MARL) environment for Collective Robotic Construction (CRC) systems☆13Mar 22, 2023Updated 3 years ago
- ☆16Apr 12, 2023Updated 3 years ago
- Modality-Agnostic Self-Supervised Learning with Meta-Learned Masked Auto-Encoder (NeurIPS 2023)☆10Jun 5, 2024Updated 2 years ago
- ☆22Feb 4, 2026Updated 4 months ago
- Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning [ICLR26]☆65Apr 11, 2026Updated 2 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- NeurIPS2022: Constrained Update Projection Approach to Safe Policy Optimization☆13Apr 10, 2023Updated 3 years ago
- OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.☆20Oct 14, 2024Updated last year
- [ACL'25] Code for ACL'25 paper "IRT-Router: Effective and Interpretable Multi-LLM Routing via Item Response Theory"☆32Feb 19, 2025Updated last year
- Benchmarking of Neural Network Architectures in Reinforcement Learning.☆39Jan 22, 2026Updated 4 months ago
- This repo has the code and suplementary materials of our 2024 RAL submission.☆21Nov 23, 2025Updated 6 months ago
- Environment for coverage control and learning using GNN☆24Jun 18, 2025Updated 11 months ago
- DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation.☆137Feb 10, 2026Updated 4 months ago
- [ICLR 2025] This is the code repo for our ICLR’25 paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rew…☆54Feb 10, 2025Updated last year
- ☆33Apr 28, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ACM MM2025] Official code of " HM-RAG: Hierarchical Multi-Agent Multimodal Retrieval Augmented Generation"☆108Jul 23, 2025Updated 10 months ago
- ☆70Feb 4, 2026Updated 4 months ago
- Quality Diversity through Human Feedback: Towards Open-Ended Diversity-Driven Optimization (ICML 2024)☆19Apr 6, 2025Updated last year
- Implementation and datasets for "Training Language Models to Generate Quality Code with Program Analysis Feedback"☆41Jul 21, 2025Updated 10 months ago
- CodeGuard+: Constrained Decoding for Secure Code Generation☆21Jul 30, 2024Updated last year
- ☆16May 5, 2022Updated 4 years ago
- ☆27Feb 24, 2024Updated 2 years ago