Maximilian-Winter / llama-cpp-agentLinks
The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM models, execute structured function calls and get structured output. Works also with models not fine-tuned to JSON output and function calls.
☆609Updated 10 months ago
Alternatives and similar repositories for llama-cpp-agent
Users that are interested in llama-cpp-agent are comparing it to the libraries listed below
Sorting:
- An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.☆624Updated last year
- function calling-based LLM agents☆289Updated last year
- A fast batching API to serve LLM models☆188Updated last year
- A multimodal, function calling powered LLM webui.☆217Updated last year
- Web UI for ExLlamaV2☆514Updated 11 months ago
- An AI assistant beyond the chat box.☆328Updated last year
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆390Updated 3 weeks ago
- Convenience scripts to finetune (chat-)LLaMa3 and other models for any language☆314Updated last year
- ☆1,173Updated 3 weeks ago
- Efficient visual programming for AI language models☆361Updated 8 months ago
- Large-scale LLM inference engine☆1,613Updated this week
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆192Updated last year
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆165Updated last year
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆267Updated 10 months ago
- 🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.☆434Updated last month
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆222Updated last year
- A python package for developing AI applications with local LLMs.☆151Updated last year
- A tool for generating function arguments and choosing what function to call with local LLMs☆434Updated last year
- Your Trusty Memory-enabled AI Companion - Simple RAG chatbot optimized for local LLMs | 12 Languages Supported | OpenAI API Compatible☆346Updated 10 months ago
- A Python-based web-assisted large language model (LLM) search assistant using Llama.cpp☆366Updated last year
- Self-evaluating interview for AI coders☆598Updated 6 months ago
- C++ implementation for 💫StarCoder☆459Updated 2 years ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆240Updated last year
- ☆210Updated last week
- ☆165Updated 5 months ago
- This project demonstrates a basic chain-of-thought interaction with any LLM (Large Language Model)☆324Updated last year
- Create Custom LLMs☆1,797Updated 2 months ago
- ☆161Updated 11 months ago
- AlwaysReddy is a LLM voice assistant that is always just a hotkey away.☆761Updated 10 months ago
- TheBloke's Dockerfiles☆308Updated last year