Maximilian-Winter / llama-cpp-agentLinks

The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM models, execute structured function calls and get structured output. Works also with models not fine-tuned to JSON output and function calls.

☆599

Alternatives and similar repositories for llama-cpp-agent

Users that are interested in llama-cpp-agent are comparing it to the libraries listed below

Sorting:

galatolofederico / microchain
function calling-based LLM agents
☆289Updated last year
abgulati / LARS
An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.
☆617Updated 11 months ago
epolewski / EricLLM
A fast batching API to serve LLM models
☆188Updated last year
itsme2417 / PolyMind
A multimodal, function calling powered LLM webui.
☆216Updated last year
NousResearch / Hermes-Function-Calling
☆1,097Updated last year
runpod-workers / worker-vllm
The RunPod worker template for serving our large language model endpoints. Powered by vLLM.
☆371Updated last month
turboderp-org / exui
Web UI for ExLlamaV2
☆510Updated 8 months ago
AndrewVeee / nucleo-ai
An AI assistant beyond the chat box.
☆327Updated last year
aphrodite-engine / aphrodite-engine
Large-scale LLM inference engine
☆1,567Updated last week
migtissera / Sensei
Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI
☆221Updated last year
severian42 / Vodalus-Expert-LLM-Forge
Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …
☆187Updated last year
rizerphe / local-llm-function-calling
A tool for generating function arguments and choosing what function to call with local LLMs
☆430Updated last year
snexus / llm-search
Querying local documents, powered by LLM
☆628Updated 3 months ago
neuml / rag
🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.
☆411Updated 5 months ago
AI-Commandos / LLaMa2lang
Convenience scripts to finetune (chat-)LLaMa3 and other models for any language
☆316Updated last year
QuixiAI / laserRMT
This is our own implementation of 'Layer Selective Rank Reduction'
☆239Updated last year
matt-c1 / llama-3-quant-comparison
Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.
☆165Updated last year
EpistasisLab / KRAGEN
Software to implement GoT with a weviate vectorized database
☆677Updated 6 months ago
nath1295 / LLMFlex
A python package for developing AI applications with local LLMs.
☆151Updated 9 months ago
simbianai / taskgen
Task-based Agentic Framework using StrictJSON as the core
☆458Updated 3 weeks ago
cmp-nct / ggllm.cpp
Falcon LLM ggml framework with CPU and GPU support
☆247Updated last year
bigcode-project / starcoder.cpp
C++ implementation for 💫StarCoder
☆455Updated 2 years ago
jondurbin / airoboros
Customizable implementation of the self-instruct paper.
☆1,050Updated last year
tjmlabs / AgentRun
The easiest, and fastest way to run AI-generated Python code safely
☆335Updated 10 months ago
zenoverflow / omnichain
Efficient visual programming for AI language models
☆361Updated 5 months ago
Vaibhavs10 / optimise-my-whisper
☆207Updated last year
TrelisResearch / one-click-llms
One click templates for inferencing Language Models
☆213Updated 2 months ago
matteoserva / GraphLLM
☆206Updated last month
QuixiAI / OpenChatML
☆162Updated 2 months ago
arcee-ai / fastmlx
FastMLX is a high performance production ready API to host MLX models.
☆331Updated 7 months ago