The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM models, execute structured function calls and get structured output. Works also with models not fine-tuned to JSON output and function calls.
☆647Mar 9, 2026Updated 3 months ago
Alternatives and similar repositories for llama-cpp-agent
Users that are interested in llama-cpp-agent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ToolAgents is a lightweight and flexible framework for creating function-calling agents with various language models and APIs.☆38Apr 30, 2026Updated last month
- ☆32Dec 29, 2023Updated 2 years ago
- Locally running LLM with internet access☆96Jun 30, 2025Updated 11 months ago
- function calling-based LLM agents☆291Sep 16, 2024Updated last year
- Python bindings for llama.cpp☆10,446Updated this week
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A Comprehensive survey on business use cases of AI that help them thrive in the digital economy☆13Oct 7, 2020Updated 5 years ago
- Chat language model that can use tools and interpret the results☆1,596Dec 3, 2025Updated 6 months ago
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆143Jul 9, 2024Updated last year
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- An experimental desktop client for using Claude Desktop's MCP with Novelcrafter codices.☆11Dec 3, 2024Updated last year
- Inference of Mamba, Mamba2 and Mamba3 models in pure C☆202Mar 18, 2026Updated 3 months ago
- This GUI aims to simplify the process of converting GGUF files to llamafile format by providing an intuitive and convenient way for users…☆14Jan 2, 2026Updated 5 months ago
- Harness LLMs with Multi-Agent Programming☆4,049Jun 15, 2026Updated 2 weeks ago
- Your Trusty Memory-enabled AI Companion - Simple RAG chatbot optimized for local LLMs | 12 Languages Supported | OpenAI API Compatible☆353Feb 28, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A guidance compatibility layer for llama-cpp-python☆37Sep 11, 2023Updated 2 years ago
- Create Custom LLMs☆1,857Apr 24, 2026Updated 2 months ago
- Python bindings for the Transformer models implemented in C/C++ using GGML library.☆1,886Jan 28, 2024Updated 2 years ago
- A multimodal, function calling powered LLM webui.☆213Sep 23, 2024Updated last year
- Enforce the output format (JSON Schema, Regex etc) of a language model☆2,022Apr 4, 2026Updated 2 months ago
- Simple agent framework using Ollama tool calling☆10Aug 27, 2024Updated last year
- ☆1,397Dec 22, 2025Updated 6 months ago
- ☆136May 26, 2026Updated last month
- cli tool to quantize gguf, gptq, awq, hqq and exl2 models☆78Dec 17, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The official API server for Exllama. OAI compatible, lightweight, and fast.☆1,261Updated this week
- An AI assistant beyond the chat box.☆329Mar 11, 2024Updated 2 years ago
- ☆347Mar 5, 2026Updated 3 months ago
- Converts JSON-Schema to GBNF grammar to use with llama.cpp☆55Nov 27, 2023Updated 2 years ago
- ☆19Jun 5, 2023Updated 3 years ago
- A tool for generating function arguments and choosing what function to call with local LLMs☆437Mar 12, 2024Updated 2 years ago
- WilmerAI is one of the oldest LLM semantic routers. It uses multi-layer prompt routing and complex workflows to allow you to not only cre…☆817Jun 22, 2026Updated last week
- Open-source LLM/VLM load balancer and serving platform for self-hosting LLMs (and VLMs) at scale 🏓🦙 Alternative to projects like llm-d,…☆1,607Jun 16, 2026Updated last week
- ☆38Mar 12, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A library for working with GBNF files☆30May 27, 2026Updated last month
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆152Jan 7, 2026Updated 5 months ago
- Python package wrapping llama.cpp for on-device LLM inference☆106Apr 2, 2026Updated 2 months ago
- Tools for merging pretrained large language models.☆7,190Jun 17, 2026Updated last week
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,567Mar 4, 2026Updated 3 months ago
- Efficient visual programming for AI language models☆359May 13, 2025Updated last year
- A simple experiment on letting two local LLM have a conversation about anything!☆112Jul 3, 2024Updated last year