Maximilian-Winter / llama-cpp-agentLinks
The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM models, execute structured function calls and get structured output. Works also with models not fine-tuned to JSON output and function calls.
☆575Updated 4 months ago
Alternatives and similar repositories for llama-cpp-agent
Users that are interested in llama-cpp-agent are comparing it to the libraries listed below
Sorting:
- function calling-based LLM agents☆287Updated 9 months ago
- An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.☆600Updated 8 months ago
- ☆908Updated 10 months ago
- A fast batching API to serve LLM models☆183Updated last year
- A multimodal, function calling powered LLM webui.☆214Updated 9 months ago
- 🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.☆385Updated 2 months ago
- An AI assistant beyond the chat box.☆329Updated last year
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆328Updated 2 weeks ago
- Web UI for ExLlamaV2☆503Updated 5 months ago
- Large-scale LLM inference engine☆1,471Updated this week
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆156Updated last year
- Your Trusty Memory-enabled AI Companion - Simple RAG chatbot optimized for local LLMs | 12 Languages Supported | OpenAI API Compatible☆319Updated 4 months ago
- Convenience scripts to finetune (chat-)LLaMa3 and other models for any language☆310Updated last year
- The easiest, and fastest way to run AI-generated Python code safely☆325Updated 7 months ago
- Efficient visual programming for AI language models☆364Updated last month
- This is our own implementation of 'Layer Selective Rank Reduction'☆239Updated last year
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆185Updated 11 months ago
- Querying local documents, powered by LLM☆612Updated last month
- Task-based Agentic Framework using StrictJSON as the core☆453Updated last week
- ☆204Updated last month
- Pure C++ implementation of several models for real-time chatting on your computer (CPU & GPU)☆656Updated this week
- A tool for generating function arguments and choosing what function to call with local LLMs☆428Updated last year
- A python package for developing AI applications with local LLMs.☆150Updated 6 months ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆221Updated last year
- SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.☆273Updated 3 weeks ago
- An AI memory layer with short- and long-term storage, semantic clustering, and optional memory decay for context-aware applications.☆639Updated 5 months ago
- Local LLM ReAct Agent with Guidance☆158Updated 2 years ago
- TheBloke's Dockerfiles☆305Updated last year
- Software to implement GoT with a weviate vectorized database☆671Updated 3 months ago
- ☆205Updated last year