Maximilian-Winter / llama-cpp-agentLinks
The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM models, execute structured function calls and get structured output. Works also with models not fine-tuned to JSON output and function calls.
☆606Updated 8 months ago
Alternatives and similar repositories for llama-cpp-agent
Users that are interested in llama-cpp-agent are comparing it to the libraries listed below
Sorting:
- function calling-based LLM agents☆289Updated last year
- An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.☆620Updated last year
- A fast batching API to serve LLM models☆188Updated last year
- A multimodal, function calling powered LLM webui.☆216Updated last year
- An AI assistant beyond the chat box.☆326Updated last year
- Web UI for ExLlamaV2☆511Updated 9 months ago
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆190Updated last year
- ☆1,116Updated last year
- Convenience scripts to finetune (chat-)LLaMa3 and other models for any language☆315Updated last year
- 🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.☆413Updated 6 months ago
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆379Updated last week
- Large-scale LLM inference engine☆1,583Updated this week
- TheBloke's Dockerfiles☆307Updated last year
- This is our own implementation of 'Layer Selective Rank Reduction'☆239Updated last year
- Your Trusty Memory-enabled AI Companion - Simple RAG chatbot optimized for local LLMs | 12 Languages Supported | OpenAI API Compatible☆341Updated 8 months ago
- Efficient visual programming for AI language models☆361Updated 5 months ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆221Updated last year
- Falcon LLM ggml framework with CPU and GPU support☆247Updated last year
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆165Updated last year
- Software to implement GoT with a weviate vectorized database☆678Updated 7 months ago
- ☆207Updated 2 months ago
- A tool for generating function arguments and choosing what function to call with local LLMs☆433Updated last year
- SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.☆280Updated 4 months ago
- ☆163Updated 3 months ago
- Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens☆521Updated last year
- A python package for developing AI applications with local LLMs.☆151Updated 10 months ago
- A library for easily merging multiple LLM experts, and efficiently train the merged LLM.☆494Updated last year
- Self-evaluating interview for AI coders☆597Updated 4 months ago
- C++ implementation for 💫StarCoder☆455Updated 2 years ago
- Customizable implementation of the self-instruct paper.☆1,051Updated last year