aws-samples / multiagent-collab-scenario-benchmarkLinks
Benchmarking data and script used for LLM multi-agent collaboration systems from AWS Bedrock Agents Science team.
☆11Updated 7 months ago
Alternatives and similar repositories for multiagent-collab-scenario-benchmark
Users that are interested in multiagent-collab-scenario-benchmark are comparing it to the libraries listed below
Sorting:
- ☆30Updated 5 months ago
- ☆20Updated last month
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆24Updated last month
- ☆17Updated 11 months ago
- ☆50Updated 2 months ago
- DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆58Updated 4 months ago
- Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"☆10Updated 7 months ago
- ☆21Updated 7 months ago
- Official Repo for CRMArena and CRMArena-Pro☆101Updated 3 weeks ago
- Reproducible Language Agent Research☆29Updated 3 weeks ago
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆27Updated this week
- Context is Key: Combining Embedding-based Retrieval with LLMs for Comprehensive Knowledge Enrichment☆32Updated 2 years ago
- A simple Streamlit application to visualize document chunks and queries in embedding space 🗺️🔍☆13Updated 3 months ago
- ☆75Updated 10 months ago
- Official Repo for The Paper "Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems"☆54Updated 4 months ago
- Integrate Amazon Q Business APIs into custom applications using Identity-aware credentials.☆12Updated 8 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆115Updated 8 months ago
- ☆62Updated last month
- [ACL 2025] Agentic Knowledgeable Self-awareness☆75Updated last month
- Advanced RAG patterns on Amazon SageMaker☆14Updated last year
- ☆17Updated this week
- Function calling using Amazon Bedrock with Anthropic Claude 3 foundation model☆32Updated 8 months ago
- Official code repository for Sketch-of-Thought (SoT)☆125Updated 2 months ago
- ☆13Updated last month
- ☆40Updated 7 months ago
- ☆25Updated last month
- Deep Research through Multi-Agents, using GraphRAG☆76Updated 8 months ago
- ☆126Updated 2 months ago
- The original Shared Recurrent Memory Transformer implementation☆27Updated last week
- ☆33Updated 2 months ago