The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM models, execute structured function calls and get structured output. Works also with models not fine-tuned to JSON output and function calls.
☆615Feb 17, 2025Updated last year
Alternatives and similar repositories for llama-cpp-agent
Users that are interested in llama-cpp-agent are comparing it to the libraries listed below
Sorting:
- ToolAgents is a lightweight and flexible framework for creating function-calling agents with various language models and APIs.☆27Dec 13, 2025Updated 2 months ago
- ☆32Dec 29, 2023Updated 2 years ago
- Locally running LLM with internet access☆97Jun 30, 2025Updated 8 months ago
- A Comprehensive survey on business use cases of AI that help them thrive in the digital economy☆13Oct 7, 2020Updated 5 years ago
- This GUI aims to simplify the process of converting GGUF files to llamafile format by providing an intuitive and convenient way for users…☆14Jan 2, 2026Updated last month
- An experimental desktop client for using Claude Desktop's MCP with Novelcrafter codices.☆10Dec 3, 2024Updated last year
- function calling-based LLM agents☆289Sep 16, 2024Updated last year
- Python bindings for llama.cpp☆10,003Aug 15, 2025Updated 6 months ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Chat language model that can use tools and interpret the results☆1,591Dec 3, 2025Updated 2 months ago
- Inference of Mamba and Mamba2 models in pure C☆197Jan 22, 2026Updated last month
- ☆19Jun 5, 2023Updated 2 years ago
- ☆17Aug 28, 2025Updated 6 months ago
- This is a AUTOSAR documents specific retriever based on LLM and RAG.☆16Nov 12, 2024Updated last year
- Create Custom LLMs☆1,810Nov 8, 2025Updated 3 months ago
- The official API server for Exllama. OAI compatible, lightweight, and fast.☆1,134Feb 9, 2026Updated 2 weeks ago
- Harness LLMs with Multi-Agent Programming☆3,909Updated this week
- Python bindings for the Transformer models implemented in C/C++ using GGML library.☆1,881Jan 28, 2024Updated 2 years ago
- Your Trusty Memory-enabled AI Companion - Simple RAG chatbot optimized for local LLMs | 12 Languages Supported | OpenAI API Compatible☆349Feb 28, 2025Updated last year
- Enforce the output format (JSON Schema, Regex etc) of a language model☆1,992Aug 24, 2025Updated 6 months ago
- Syllabus for EDCT GE 2550☆16Oct 3, 2019Updated 6 years ago
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆141Jul 9, 2024Updated last year
- ☆134Dec 11, 2025Updated 2 months ago
- A simple experiment on letting two local LLM have a conversation about anything!☆112Jul 3, 2024Updated last year
- A multimodal, function calling powered LLM webui.☆216Sep 23, 2024Updated last year
- Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).☆105Oct 31, 2024Updated last year
- ☆1,201Dec 22, 2025Updated 2 months ago
- cli tool to quantize gguf, gptq, awq, hqq and exl2 models☆78Dec 17, 2024Updated last year
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆150Jan 7, 2026Updated last month
- An instruction tuned large language model with extra support for poetry and verse generation☆25Jun 5, 2023Updated 2 years ago
- ☆23Apr 25, 2023Updated 2 years ago
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆28Jun 7, 2024Updated last year
- Simple agent framework using Ollama tool calling☆10Aug 27, 2024Updated last year
- ☆338Jul 28, 2025Updated 7 months ago
- SORTED: A curated collection of interesting ideas, tools, and resources in neuroscience, data management, and data science, all in the sp…☆27Aug 10, 2025Updated 6 months ago
- Serving LLMs in the HF-Transformers format via a PyFlask API☆72Sep 10, 2024Updated last year
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,444Dec 9, 2025Updated 2 months ago
- Open-source LLM load balancer and serving platform for self-hosting LLMs at scale 🏓🦙 Alternative to projects like llm-d, Docker Model R…☆1,467Updated this week
- Automated LLM novelist☆46Apr 11, 2024Updated last year