Maximilian-Winter / llama-cpp-agent
The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM models, execute structured function calls and get structured output. Works also with models not fine-tuned to JSON output and function calls.
☆467Updated last month
Related projects: ⓘ
- ☆640Updated this week
- function calling-based LLM agents☆268Updated this week
- An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.☆459Updated this week
- Building AI agents, atomically☆344Updated this week
- Convert Compute And Books Into Instruct-Tuning Datasets (or classifiers)!☆816Updated this week
- 🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.☆223Updated last week
- Optimizing inference proxy for LLMs☆406Updated this week
- A fast batching API to serve LLM models☆172Updated 4 months ago
- The code used to train and run inference with the ColPali architecture.☆502Updated this week
- A multimodal, function calling powered LLM webui.☆204Updated 3 months ago
- Web UI for ExLlamaV2☆420Updated 2 weeks ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆790Updated last week
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆223Updated 4 months ago
- A library for easily merging multiple LLM experts, and efficiently train the merged LLM.☆388Updated 3 weeks ago
- Efficient visual programming for AI language models☆288Updated last week
- Task-based Agentic Framework using StrictJSON as the core☆334Updated last week
- Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens☆287Updated 3 months ago
- An AI assistant beyond the chat box.☆314Updated 6 months ago
- Convenience scripts to finetune (chat-)LLaMa3 and other models for any language☆260Updated 3 months ago
- A toolkit to create optimal Production-ready RAG setup for your data☆365Updated this week
- LLM powered retrieval engine designed to process a ton of sources to collect a comprehensive list of entities.☆310Updated 4 months ago
- Your Trusty Memory-enabled AI Companion - Simple RAG chatbot optimized for local LLMs | 12 Languages Supported | OpenAI API Compatible☆229Updated last month
- Automatically evaluate your LLMs in Google Colab☆511Updated 4 months ago
- Deterministic LLMs Outputs for AI Applications and AI Agents☆807Updated last week
- An OAI compatible exllamav2 API that's both lightweight and fast☆455Updated this week
- Large-scale LLM inference engine☆934Updated this week
- A tool for generating function arguments and choosing what function to call with local LLMs☆329Updated 6 months ago
- High-performance retrieval engine for unstructured data☆778Updated this week
- An AGentic Intelligence Operating System☆282Updated this week
- Python & JS/TS SDK for running AI-generated code/code interpreting in your AI app☆1,097Updated this week