MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs
☆45Apr 17, 2026Updated 2 weeks ago
Alternatives and similar repositories for MARSHAL
Users that are interested in MARSHAL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments☆22Sep 30, 2025Updated 7 months ago
- Code of paper "HyperVLA: Efficient Inference in Vision-Language-Action Models via Hypernetworks"☆24Oct 8, 2025Updated 6 months ago
- Code for the NeurIPS 2023 Paper: Robust Multi-Agent Reinforcement Learning via Adversarial Regularization: Theoretical Foundation and Sta…☆30Oct 29, 2023Updated 2 years ago
- Harness for deep search agent☆85Apr 27, 2026Updated last week
- Code for EACL 26 Findings paper "I-MCTS: Enhancing Agentic AutoML via Introspective Monte Carlo Tree Search"☆12Jan 28, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A Knowledge-grounded framework for Autonomous ML/AI Program Synthesis and Optimization☆90Feb 20, 2026Updated 2 months ago
- ☆70Dec 7, 2025Updated 4 months ago
- ☆22Jul 22, 2025Updated 9 months ago
- Python tools for working with LAMMPS files☆16Updated this week
- MCP Atlas☆74Apr 24, 2026Updated last week
- A package to convert range data from ROS range topics to pointclouds☆10Jun 30, 2017Updated 8 years ago
- Official implementation for "How Should We Meta-Learn Reinforcement Learning Algorithms?"☆23Sep 7, 2025Updated 7 months ago
- Verifying the optimization phases of the GraalVM compiler☆14Jan 13, 2025Updated last year
- Official Repository for 'Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences' (CVPR 2024)☆16Mar 29, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆23Aug 1, 2025Updated 9 months ago
- Code and data release for FEABench: Evaluating Language Models on Multiphysics Reasoning Ability. [MATH-AI workshop, NeurIPS 2024]☆13May 7, 2025Updated 11 months ago
- MATE: the Multi-Agent Tracking Environment.☆43Mar 31, 2023Updated 3 years ago
- ESfP: Event-based Shape from Polarization (CVPR 2023)☆18May 9, 2023Updated 2 years ago
- More reliable Video Understanding Evaluation☆15Sep 23, 2025Updated 7 months ago
- ☆41Jan 25, 2026Updated 3 months ago
- AutoVRL is an open-source high fidelity simulator for simulation to real-world autonomous ground vehicle deep reinforcement learning rese…☆12Apr 26, 2023Updated 3 years ago
- Template repository for generating semantic maps☆16Feb 4, 2019Updated 7 years ago
- Intel® Tensor Processing Primitives extension for Pytorch*☆18Apr 9, 2026Updated 3 weeks ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆37Apr 25, 2026Updated last week
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆36Dec 30, 2025Updated 4 months ago
- A flexible Multi-Agent Reinforcement Learning (MARL) environment for Collective Robotic Construction (CRC) systems☆13Mar 22, 2023Updated 3 years ago
- ☆16Apr 12, 2023Updated 3 years ago
- An interactive design platform for 3D-printed multi-layer microfluidic chips with design-for-manufacturing function☆22May 22, 2024Updated last year
- Modality-Agnostic Self-Supervised Learning with Meta-Learned Masked Auto-Encoder (NeurIPS 2023)☆10Jun 5, 2024Updated last year
- Computational Chemistry Data Management Library for Machine Learning Force Field Development☆21Apr 21, 2026Updated last week
- Code for extrapolation in materials property prediction as proposed in "Known Unknowns: Out-of-Distribution Property Prediction in Materi…☆34Jan 16, 2026Updated 3 months ago
- 新增一个CBF层,并将其结合进actor网络中,得到safe RL框架。后续验证中发现这种做法并没有实质性的用处,所以不再继续这个项目☆12Mar 14, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆146Mar 31, 2026Updated last month
- [NeurIPS 2021] World modelling and action learning using a contrastive formulation of the active inference framework, for reaching visual…☆15Jan 22, 2024Updated 2 years ago
- Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning [ICLR26]☆64Apr 11, 2026Updated 3 weeks ago
- Benchmarking of Neural Network Architectures in Reinforcement Learning.☆38Jan 22, 2026Updated 3 months ago
- NeurIPS2022: Constrained Update Projection Approach to Safe Policy Optimization☆13Apr 10, 2023Updated 3 years ago
- OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.☆20Oct 14, 2024Updated last year
- [NeurIPS 2025 𝐒𝐩𝐨𝐭𝐥𝐢𝐠𝐡𝐭] AutoToM: Scaling Model-based Mental Inference via Automated Agent Modeling☆44Mar 28, 2026Updated last month