A way to analyze tool call accuracy, structural correctness and tool recall for LLM's. Uses Native tool calling.
☆23Aug 23, 2025Updated 8 months ago
Alternatives and similar repositories for LLMToolCallingTester
Users that are interested in LLMToolCallingTester are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Eval exercises for Roo Code.☆77Jun 9, 2025Updated 11 months ago
- ☆19Aug 23, 2025Updated 8 months ago
- Monorepo for Opencom - Open Source customer engagement platform. Self-host or use the hosted version.☆38Apr 25, 2026Updated 3 weeks ago
- MLX binary vectors and associated algorithms.☆14Mar 13, 2025Updated last year
- Craft and run Agents right from your phone☆32Oct 14, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆15Feb 23, 2026Updated 2 months ago
- GitHub Action to invoke the PI coding agent on issues and PRs via comment triggers☆23May 4, 2026Updated 2 weeks ago
- The end of Screenshot 2023-12-20-21.11.59.png☆15Dec 22, 2023Updated 2 years ago
- Implementation of ModernBERT in MLX☆21Jan 7, 2026Updated 4 months ago
- ☆15Feb 12, 2025Updated last year
- ☆26Dec 9, 2025Updated 5 months ago
- ☆24Nov 19, 2024Updated last year
- Pi extension that redirects all file operations and commands to a remote host via SSH☆30Jan 10, 2026Updated 4 months ago
- A web interface for humans to interact with Beads - the issue tracker made for agents https://github.com/steveyegge/beads☆26Oct 16, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Personnal collection of pipes and filters I use for open-webui☆27Apr 15, 2026Updated last month
- 🤖 Complete reproduction of 'AlphaGo Moment for Model Architecture Discovery' using MLX-LM instead of GPT-4. Autonomous neural architectu…☆29Jul 27, 2025Updated 9 months ago
- a web logging proxy for MCP client-server communication☆29Aug 17, 2025Updated 9 months ago
- A full GTA San Andreas radio in the web.☆11Mar 11, 2026Updated 2 months ago
- An ergonomic CSS framework.☆14May 14, 2026Updated last week
- Entertainment while you wait for your llm to respond☆12Feb 12, 2025Updated last year
- Example AI chat UI built with Cloudflare Workers, Vercel AI SDK, and Shadcn☆21Apr 29, 2025Updated last year
- ☆21Oct 9, 2024Updated last year
- Suomi Poikas team website☆13Apr 28, 2026Updated 3 weeks ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Quickly get custom prompt contexts☆14Mar 19, 2026Updated 2 months ago
- ☆12Apr 16, 2026Updated last month
- UI Package to build macOS apps for Claude code☆49Feb 4, 2026Updated 3 months ago
- The official command line tool for Cosmic☆16Mar 1, 2023Updated 3 years ago
- ☆22Dec 16, 2025Updated 5 months ago
- olive-cli: a minimal llm-based operating system for engineers packaged as a terminal app.☆20Jun 13, 2025Updated 11 months ago
- JavaScript AST interpreter for sandboxed execution☆33Oct 5, 2025Updated 7 months ago
- A2A MCP Server is a lightweight Python bridge that lets Claude Desktop or any MCP client talk to A2A agents. It provides three tools: reg…☆21May 4, 2025Updated last year
- A jQuery UI widget that enables monitoring, querying, or changing the scroll position of an element relative to a scrolling container☆26Jun 6, 2014Updated 11 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- GLSL editor with real time preview☆30Jul 28, 2025Updated 9 months ago
- The old version of https://internet.dev☆23Jan 22, 2025Updated last year
- MLX implementation of GCN, with benchmark on MPS, CUDA and CPU (M1 Pro, M2 Ultra, M3 Max).☆25Dec 16, 2023Updated 2 years ago
- Cloud-based Dev-focused Context Engineering IDE, Better than Claude☆24Mar 3, 2026Updated 2 months ago
- NOT T3.Chat (Clone of T3.Chat)☆18Aug 29, 2025Updated 8 months ago
- color generation lib inspired by colorpalette.pro☆66Dec 31, 2025Updated 4 months ago
- Example Fabulous app that uses MSAL to authenticate a user on Azure Active Directory☆11Dec 8, 2022Updated 3 years ago