A way to analyze tool call accuracy, structural correctness and tool recall for LLM's. Uses Native tool calling.
☆23Aug 23, 2025Updated 6 months ago
Alternatives and similar repositories for LLMToolCallingTester
Users that are interested in LLMToolCallingTester are comparing it to the libraries listed below
Sorting:
- BH hackathon☆14Apr 4, 2024Updated last year
- Eval exercises for Roo Code.☆76Jun 9, 2025Updated 9 months ago
- Tiny evaluation of leading LLMs on competitive programming problems☆14Nov 28, 2024Updated last year
- OpenSource deployment made easy☆10Jun 13, 2015Updated 10 years ago
- HTML Tags for styling your page☆26Dec 2, 2025Updated 3 months ago
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon☆16May 8, 2025Updated 10 months ago
- ☆19Dec 31, 2025Updated 2 months ago
- ☆11Dec 23, 2023Updated 2 years ago
- REAP expert pruning for MoE LLMs on Apple Silicon via MLX☆45Updated this week
- Native iOS app for OpenCode AI coding agent — built with Expo & React Native. Real-time streaming, QR pairing, Face ID, terminal access, …☆27Jan 15, 2026Updated 2 months ago
- Craft and run Agents right from your phone☆27Oct 14, 2025Updated 5 months ago
- ☆15Feb 23, 2026Updated 3 weeks ago
- The end of Screenshot 2023-12-20-21.11.59.png☆15Dec 22, 2023Updated 2 years ago
- ☆15Feb 12, 2025Updated last year
- Implementation of ModernBERT in MLX☆20Jan 7, 2026Updated 2 months ago
- a Nushell cross.stream extension to interact with LLMs and MCP servers☆18Updated this week
- A web interface for humans to interact with Beads - the issue tracker made for agents https://github.com/steveyegge/beads☆23Oct 16, 2025Updated 5 months ago
- Personnal collection of pipes and filters I use for open-webui☆26Mar 10, 2026Updated last week
- a web logging proxy for MCP client-server communication☆28Aug 17, 2025Updated 7 months ago
- JavaScript AST interpreter for sandboxed execution☆27Oct 5, 2025Updated 5 months ago
- UI Package to build macOS apps for Claude code☆47Feb 4, 2026Updated last month
- ☆10May 2, 2025Updated 10 months ago
- ☆14Oct 24, 2024Updated last year
- Example AI chat UI built with Cloudflare Workers, Vercel AI SDK, and Shadcn☆21Apr 29, 2025Updated 10 months ago
- Suomi Poikas team website☆13Mar 1, 2026Updated 3 weeks ago
- Quickly get custom prompt contexts☆14Jan 26, 2026Updated last month
- A CLI in Rust to generate synthetic data for MLX friendly training☆25Jan 13, 2024Updated 2 years ago
- Moondream MCP Server in Python☆44Jul 2, 2025Updated 8 months ago
- Bring OKLCH to Tailwind☆18Mar 30, 2024Updated last year
- Interactive chat application that enables users to have conversations with any website's content using Groq's fast inference capabilities…☆25Sep 25, 2025Updated 5 months ago
- olive-cli: a minimal llm-based operating system for engineers packaged as a terminal app.☆20Jun 13, 2025Updated 9 months ago
- This repo maintains a 'cheat sheet' for LLMs that are undertrained on mlx☆27Mar 12, 2026Updated last week
- A jQuery UI widget that enables monitoring, querying, or changing the scroll position of an element relative to a scrolling container☆26Jun 6, 2014Updated 11 years ago
- A2A MCP Server is a lightweight Python bridge that lets Claude Desktop or any MCP client talk to A2A agents. It provides three tools: reg…☆21May 4, 2025Updated 10 months ago
- The old version of https://internet.dev☆22Jan 22, 2025Updated last year
- GLSL editor with real time preview☆30Jul 28, 2025Updated 7 months ago
- rust + wasm for declarative particle simulations☆13Jan 7, 2023Updated 3 years ago
- Cloud-based Dev-focused Context Engineering IDE, Better than Claude☆24Mar 3, 2026Updated 2 weeks ago
- MLX implementation of GCN, with benchmark on MPS, CUDA and CPU (M1 Pro, M2 Ultra, M3 Max).☆25Dec 16, 2023Updated 2 years ago