MCPToolBench++ MCP Model Context Protocol Tool Use Benchmark on AI Agent and Model Tool Use Ability
☆44Mar 17, 2026Updated 2 months ago
Alternatives and similar repositories for MCPToolBenchPP
Users that are interested in MCPToolBenchPP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- AAMAS 2026: MeCo: Enhancing LLM-Empowered Multi-Robot Collaboration via Similar Task Memoization☆31Feb 4, 2026Updated 3 months ago
- Official code for the INFOCOM 2020 paper "Guardian: Evaluating Trust in Online Social Networks with Graph Convolutional Networks."☆11Jun 16, 2021Updated 4 years ago
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆14Mar 15, 2025Updated last year
- Thai Grapheme to Phoneme (G2P) Wiktionary Corpus☆13Jul 25, 2022Updated 3 years ago
- ComfyUI node for modular, human‑like Kani TTS. Generate natural, high‑quality speech from text☆38Oct 17, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [KDD 2026] Voxlect: A Speech Foundation Model Benchmark for Modeling Dialects and Regional Languages Around the Globe☆32Aug 10, 2025Updated 9 months ago
- Code for "The Whole Truth and Nothing But the Truth: Faithful and Controllable Dialogue Response Generation with Dataflow Transduction an…☆10Apr 30, 2024Updated 2 years ago
- ☆94Mar 20, 2025Updated last year
- KoRean based ELECTRA pre-trained models (KR-ELECTRA) for Tensorflow and PyTorch☆15Feb 13, 2022Updated 4 years ago
- Preview Code for Continuum Paper☆77Apr 13, 2026Updated last month
- The KlicStudio MCP server is a connector based on the Model Context Protocol (MCP), designed to facilitate interactions with KlicStudio s…☆21Jul 30, 2025Updated 9 months ago
- Source code of paper "TrustGuard: GNN-based Robust and Explainable Trust Evaluation with Dynamicity Support"☆27Sep 14, 2024Updated last year
- OpenMediation SDK Server☆16Oct 4, 2022Updated 3 years ago
- ☆31Feb 27, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆28Feb 11, 2026Updated 3 months ago
- [COLM 2025: 1st Workshop on the Application of LLM Explainability to Reasoning and Planning] Latent Chain-of-Thought? Decoding the Depth-…☆18Oct 4, 2025Updated 7 months ago
- Notes and work-in-progress for BPF-related research projects☆12Jan 10, 2025Updated last year
- This is the official code repository for the paper: Towards General Continuous Memory for Vision-Language Models.☆27Jul 3, 2025Updated 10 months ago
- Benchmark dataset for the paper "Towards Next-Generation Recommender Systems: A Benchmark for Personalized Recommendation Assistant with …☆27May 20, 2025Updated last year
- LiveMCPBench is a benchmark for evaluating the ability of agents to navigate and utilize a large-scale MCP toolset. It provides a compreh…☆100Dec 18, 2025Updated 5 months ago
- FLoRA: A Framework for Learning Scoring Rules in Autonomous Driving Planning Systems☆13Apr 12, 2026Updated last month
- Script to demonstrate how to use a Language Model for Semantic Turn Detection. Refer to blog post for full details.☆17May 9, 2025Updated last year
- To Think or Not to Think: Exploring the Unthinking Vulnerability in Large Reasoning Models☆33May 21, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆10Apr 20, 2025Updated last year
- ☆37Jun 9, 2025Updated 11 months ago
- decision-making processes of human drivers☆13Mar 28, 2024Updated 2 years ago
- Source code of paper "Systematic Assessment of Factual Knowledge in Large Language Models" - EMNLP Findings 2023☆17Mar 17, 2026Updated 2 months ago
- [WWW '24] UnifiedSSR: A Unified Framework of Sequential Search and Recommendation☆12Feb 16, 2024Updated 2 years ago
- Röttger et al. (2025): "MSTS: A Multimodal Safety Test Suite for Vision-Language Models"☆18Mar 31, 2025Updated last year
- AIRS-Bench: an AI Research Science benchmark for quantifying the end-to-end AI research abilities of LLM agents☆91May 5, 2026Updated 3 weeks ago
- [ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis☆39Dec 24, 2025Updated 5 months ago
- ☆40Oct 15, 2025Updated 7 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Example repo showcasing model training and deployment with distil claude cli skill☆55Jan 19, 2026Updated 4 months ago
- Project Page of Paper "Drive in Corridors: Enhancing the Safety of End-to-end Autonomous Driving via Corridor Learning and Planning"☆29May 8, 2025Updated last year
- Official code of "The Automated but Risky Game: Modeling Agent-to-Agent Negotiations and Transactions in Consumer Markets"☆27Mar 24, 2026Updated 2 months ago
- ☆30May 22, 2025Updated last year
- A TTS Trained on Universal Audio.☆41Jun 6, 2025Updated 11 months ago
- Offical implementation of "Life-Harness"☆70Updated this week
- General benchmarking apparatus for running multi-agent systems against benchmarks☆46Apr 13, 2026Updated last month