MCPToolBench++ MCP Model Context Protocol Tool Use Benchmark on AI Agent and Model Tool Use Ability
☆45Mar 17, 2026Updated 3 months ago
Alternatives and similar repositories for MCPToolBenchPP
Users that are interested in MCPToolBenchPP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- AAMAS 2026: MeCo: Enhancing LLM-Empowered Multi-Robot Collaboration via Similar Task Memoization☆32Jun 8, 2026Updated last week
- ☆27May 30, 2026Updated 2 weeks ago
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆16Mar 15, 2025Updated last year
- ComfyUI node for modular, human‑like Kani TTS. Generate natural, high‑quality speech from text☆38Oct 17, 2025Updated 8 months ago
- [KDD 2026] Voxlect: A Speech Foundation Model Benchmark for Modeling Dialects and Regional Languages Around the Globe☆36Aug 10, 2025Updated 10 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for "The Whole Truth and Nothing But the Truth: Faithful and Controllable Dialogue Response Generation with Dataflow Transduction an…☆10Apr 30, 2024Updated 2 years ago
- The KlicStudio MCP server is a connector based on the Model Context Protocol (MCP), designed to facilitate interactions with KlicStudio s…☆21Jul 30, 2025Updated 10 months ago
- Source code of paper "TrustGuard: GNN-based Robust and Explainable Trust Evaluation with Dynamicity Support"☆27Sep 14, 2024Updated last year
- OpenMediation SDK Server☆16Oct 4, 2022Updated 3 years ago
- ☆23Jul 10, 2025Updated 11 months ago
- ☆31Feb 27, 2025Updated last year
- ☆28Feb 11, 2026Updated 4 months ago
- [COLM 2025: 1st Workshop on the Application of LLM Explainability to Reasoning and Planning] Latent Chain-of-Thought? Decoding the Depth-…☆19Oct 4, 2025Updated 8 months ago
- LLM-Powered Open-Vocabulary Scene Segmentation with Language Embedded 3D Gaussians☆25Jan 10, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Notes and work-in-progress for BPF-related research projects☆12Jan 10, 2025Updated last year
- This is the official code repository for the paper: Towards General Continuous Memory for Vision-Language Models.☆29Jul 3, 2025Updated 11 months ago
- Python Wrapper for RnNoise v0.2☆77Jan 14, 2026Updated 5 months ago
- A Sparse-tensor Communication Framework for Distributed Deep Learning☆13Nov 1, 2021Updated 4 years ago
- LiveMCPBench is a benchmark for evaluating the ability of agents to navigate and utilize a large-scale MCP toolset. It provides a compreh…☆101Dec 18, 2025Updated 6 months ago
- Encountering 14 different Naive RAG fails and using KG to solve it☆25Dec 4, 2025Updated 6 months ago
- To Think or Not to Think: Exploring the Unthinking Vulnerability in Large Reasoning Models☆33May 21, 2025Updated last year
- Source code of paper "Systematic Assessment of Factual Knowledge in Large Language Models" - EMNLP Findings 2023☆17Mar 17, 2026Updated 3 months ago
- decision-making processes of human drivers☆14Mar 28, 2024Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Script to demonstrate how to use a Language Model for Semantic Turn Detection. Refer to blog post for full details.☆18May 9, 2025Updated last year
- Röttger et al. (2025): "MSTS: A Multimodal Safety Test Suite for Vision-Language Models"☆20Mar 31, 2025Updated last year
- PFI: Prompt Flow Integrity to Prevent Privilege Escalation in LLM Agents☆30Mar 26, 2025Updated last year
- Deep Learning 2021 in School of Data Science, USTC☆12May 17, 2023Updated 3 years ago
- AIRS-Bench: an AI Research Science benchmark for quantifying the end-to-end AI research abilities of LLM agents☆96May 5, 2026Updated last month
- [ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis☆40Dec 24, 2025Updated 5 months ago
- Implementation of TCP connection tracking in eBPF☆15May 9, 2024Updated 2 years ago
- Templates and examples for ACL and EMNLP conference posters.☆14Oct 5, 2024Updated last year
- A simple code base for Gaussian Splatting research☆21Nov 6, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Thai News Dataset from Thai government website.☆22Oct 21, 2025Updated 7 months ago
- Official code of "The Automated but Risky Game: Modeling and Benchmarking Agent-to-Agent Negotiations and Transactions in Consumer Market…☆27Jun 9, 2026Updated last week
- Frontend for Talent, a talent acquisition web application☆10Jan 5, 2023Updated 3 years ago
- ☆30May 22, 2025Updated last year
- A TTS Trained on Universal Audio.☆41Jun 6, 2025Updated last year
- The official submission from Speech Squad team for the MTC-AIC 2 competition of 2024 where an ASR model is developed tailored for the Egy…☆18Mar 9, 2026Updated 3 months ago
- ☆20May 14, 2024Updated 2 years ago