ggml-org / llama.cppLinks
LLM inference in C/C++
☆90,838Updated this week
Alternatives and similar repositories for llama.cpp
Users that are interested in llama.cpp are comparing it to the libraries listed below
Sorting:
- Tensor library for machine learning☆13,648Updated 2 weeks ago
- Python bindings for llama.cpp☆9,800Updated 3 months ago
- Universal LLM Deployment Engine with ML Compilation☆21,691Updated last week
- The definitive Web UI for local AI, with powerful features and easy setup.☆45,550Updated this week
- GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.☆76,955Updated 6 months ago
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆39,290Updated 6 months ago
- Inference Llama 2 in one file of pure C☆18,995Updated last year
- Distribute and run LLMs with a single file.☆23,448Updated 2 weeks ago
- OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamical…☆37,497Updated last year
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆18,067Updated last month
- Locally run an Instruction-Tuned Chat-Style LLM☆10,197Updated 2 years ago
- Port of OpenAI's Whisper model in C/C++☆44,967Updated this week
- Inference code for Llama models☆58,968Updated 10 months ago
- Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.☆156,856Updated last week
- Open-source search and retrieval database for AI applications.☆24,734Updated this week
- The simplest way to run LLaMA on your local machine☆13,027Updated last year
- Instruct-tune LLaMA on consumer hardware☆18,983Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆64,758Updated this week
- LlamaIndex is the leading framework for building LLM-powered agents over your data.☆45,661Updated this week
- RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable)…☆14,203Updated 3 weeks ago
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆117,009Updated this week
- The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on…☆39,865Updated this week
- Official inference library for Mistral models☆10,561Updated 2 weeks ago
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.☆49,033Updated this week
- Code and documentation to train Stanford's Alpaca models, and generate the data.☆30,242Updated last year
- High-performance In-browser LLM Inference Engine☆16,885Updated 2 weeks ago
- 🔊 Text-Prompted Generative Audio Model☆38,787Updated last year
- Interact with your documents using the power of GPT, 100% privately, no data leaks☆56,880Updated last year
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,000Updated this week
- Go ahead and axolotl questions☆10,911Updated this week