MultiagentBench/MARBLE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MultiagentBench/MARBLE)

MultiagentBench / MARBLE

(ACL 2025 Main) Code for MultiAgentBench : Evaluating the Collaboration and Competition of LLM agents https://www.arxiv.org/pdf/2503.01935

☆54

Alternatives and similar repositories for MARBLE

Users that are interested in MARBLE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ulab-uiuc / MARBLE
View on GitHub
(ACL 2025 Main) Code for MultiAgentBench : Evaluating the Collaboration and Competition of LLM agents https://www.arxiv.org/pdf/2503.019…
☆281Oct 27, 2025Updated 8 months ago
genglongling / REALM-Bench
View on GitHub
REALM-Bench: A Real-World Planning Benchmark for LLMs and Multi-Agent Systems
☆44Updated this week
qizhangli / Gradient-based-Jailbreak-Attacks
View on GitHub
Code for our NeurIPS 2024 paper Improved Generation of Adversarial Examples Against Safety-aligned LLMs
☆12Nov 7, 2024Updated last year
wslong20 / G-safeguard
View on GitHub
☆41Jun 28, 2025Updated last year
FredJiang0324 / Anatomy-of-Agentic-Memory
View on GitHub
☆25Apr 8, 2026Updated 3 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zhrli324 / Corba
View on GitHub
☆18May 17, 2025Updated last year
utnslab / Medes
View on GitHub
Deduplication over dis-aggregated memory for Serverless Computing
☆14Mar 21, 2022Updated 4 years ago
zoe-yyx / AgentNet
View on GitHub
[NIPS2025] A decentralized, RAG-enhanced multi-agent framework for LLMs with dynamic task routing and agent evolution.
☆58Oct 2, 2025Updated 9 months ago
floriangroetschla / AgentsNet
View on GitHub
☆37Jul 16, 2025Updated last year
yuzhu-cai / rSDE-Bench
View on GitHub
☆37May 29, 2025Updated last year
SamuelGong / grad_attacks
View on GitHub
Self-Teaching Notes on Gradient Leakage Attacks against GPT-2 models.
☆14Mar 18, 2024Updated 2 years ago
niveck / LLMafia
View on GitHub
Asynchronous LLM Agent playing games of Mafia against human players
☆23Nov 12, 2025Updated 8 months ago
pigeon-dove / FGLA
View on GitHub
FGLA: Fast Generation-Based Gradient Leakage Attacks against Highly Compressed Gradients
☆15Mar 17, 2026Updated 4 months ago
CodeEval-Pro / CodeEval-Pro
View on GitHub
[ACL'25 Findings] Official repo for "HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation Task"
☆40Apr 7, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
SalesforceAIResearch / swecomm
View on GitHub
☆28Jun 2, 2026Updated last month
multi-agent-systems-failure-taxonomy / MAST
View on GitHub
☆392Jul 23, 2025Updated 11 months ago
Terra-Flux / PolyRL
View on GitHub
[NSDI'26] PolyRL is a reinforcement learning framework for LLM that harvest spot instances on the cloud to reduce cost.
☆19Mar 30, 2026Updated 3 months ago
ChengshuaiZhao0 / The-Wolf-Within
View on GitHub
☆13Updated this week
7tl7qns7ch / IPOT
View on GitHub
Inducing Point Operator Transformer: A Flexible and Scalable Architecture for Solving PDEs (AAAI 2024)
☆14Jul 30, 2024Updated last year
nuster1128 / MemSim
View on GitHub
The official repository for "MemSim: A Bayesian Simulator for Evaluating Memory of LLM-based Personal Assistants".
☆17Oct 10, 2024Updated last year
amirgroup-codes / AstroAgents
View on GitHub
AstroAgents: Multi-Agent AI for Hypothesis Generation from Mass Spectrometry Data
☆15Apr 1, 2025Updated last year
PearLoveTana / DarkForest_Review
View on GitHub
☆24May 27, 2026Updated last month
YusanX / pde-agent-bench
View on GitHub
PDEAgentBench: An automated benchmark framework for evaluating Code Agents on optimizing scientific PDE solvers.
☆41Jul 13, 2026Updated last week
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
bingreeky / MaAS
View on GitHub
[ICML'25 Oral] Multi-agent Architecture Search via Agentic Supernet
☆279Nov 13, 2025Updated 8 months ago
ACMClassCourses / Arch2022-Notes
View on GitHub
☆12Sep 12, 2023Updated 2 years ago
yanweiyue / masrouter
View on GitHub
☆130Oct 29, 2025Updated 8 months ago
NanshineLoong / Self-Evolving-Benchmark
View on GitHub
A framework for evolving and testing question-answering datasets with various models.
☆26Feb 28, 2024Updated 2 years ago
sisaman / GAP
View on GitHub
GAP: Differentially Private Graph Neural Networks with Aggregation Perturbation (USENIX Security '23)
☆51Jul 3, 2023Updated 3 years ago
biasinrecsys / wsdm2021
View on GitHub
WSDM 2021 Tutorial on Advances in Bias-aware Recommendation on the Web
☆11Mar 8, 2021Updated 5 years ago
MagicHub-io / CSASR_Challenge
View on GitHub
☆11Sep 26, 2022Updated 3 years ago
smolavipour / CMI_Neural_Estimator
View on GitHub
Conditional Mutual Informaation Neural Estimator
☆15Oct 23, 2020Updated 5 years ago
sarus-tech / dp-rag
View on GitHub
A simple implementation of DP-RAG
☆18Mar 17, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yanweiyue / GDesigner
View on GitHub
☆97Dec 5, 2024Updated last year
Reason-Wang / NAT
View on GitHub
[NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language M…
☆28Mar 14, 2024Updated 2 years ago
AI45Lab / ReflectionBench
View on GitHub
[ICML 2025] ReflectionBench: Evaluating Epistemic Agency in Large Language Models
☆21Jun 24, 2025Updated last year
ppetr / lockfree-userspace-rcu
View on GitHub
Lock-free RCU (Read-Copy-Update) user-space library
☆13Jan 3, 2026Updated 6 months ago
MangoKiller / SimOAR_OAR
View on GitHub
☆11Nov 8, 2023Updated 2 years ago
SproutNan / AI-Safety_SCAV
View on GitHub
This is the code repository for "Uncovering Safety Risks of Large Language Models through Concept Activation Vector"
☆49Oct 13, 2025Updated 9 months ago
pcyyyy / BioBGT
View on GitHub
Biologically Plausible Brain Graph Transformer
☆16Mar 6, 2025Updated last year