Distribute and run LLMs with a single file.
☆23,859Mar 19, 2026Updated this week
Alternatives and similar repositories for llamafile
Users that are interested in llamafile are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM inference in C/C++☆98,911Updated this week
- Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.☆165,557Updated this week
- build-once run-anywhere c library☆20,667Mar 6, 2026Updated 2 weeks ago
- Port of OpenAI's Whisper model in C/C++☆47,689Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆73,479Updated this week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆39,597Updated this week
- Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.☆41,146Updated this week
- Unsloth Studio is a web UI for training and running open models like Qwen, DeepSeek, gpt-oss and Gemma locally.☆57,673Updated this week
- Tensor library for machine learning☆14,252Mar 16, 2026Updated last week
- The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement, running on consumer-g…☆43,827Updated this week
- aider is AI pair programming in your terminal☆42,197Mar 17, 2026Updated last week
- A natural language interface for computers☆62,780Feb 9, 2026Updated last month
- LlamaIndex is the leading document agent and OCR platform☆47,753Updated this week
- Inference Llama 2 in one file of pure C☆19,302Aug 6, 2024Updated last year
- Universal LLM Deployment Engine with ML Compilation☆22,246Updated this week
- GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.☆77,236May 27, 2025Updated 9 months ago
- High-performance In-browser LLM Inference Engine☆17,616Mar 13, 2026Updated last week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆128,321Updated this week
- DSPy: The framework for programming—not prompting—language models☆33,038Updated this week
- The original local LLM interface. Text, vision, tool-calling, training, and more. 100% offline.☆46,278Mar 17, 2026Updated last week
- The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configuration.☆56,545Updated this week
- Interact with your documents using the power of GPT, 100% privately, no data leaks☆57,191Feb 26, 2026Updated 3 weeks ago
- 🙌 OpenHands: AI-Driven Development☆69,594Updated this week
- Self-hosted AI coding assistant☆33,022Mar 2, 2026Updated 3 weeks ago
- ⏩ Source-controlled AI checks, enforceable in CI. Powered by the open-source Continue CLI☆31,921Updated this week
- LLM training in simple, raw C/CUDA☆29,216Jun 26, 2025Updated 8 months ago
- Fast, flexible LLM inference☆6,713Mar 15, 2026Updated last week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,256Updated this week
- Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.☆21,680Mar 16, 2026Updated last week
- Structured Outputs☆13,588Updated this week
- A vector search SQLite extension that runs anywhere!☆7,239Updated this week
- Run frontier AI locally.☆42,805Updated this week
- MLX: An array framework for Apple silicon☆24,597Updated this week
- Vane is an AI-powered answering engine.☆33,329Mar 10, 2026Updated 2 weeks ago
- Universal memory layer for AI Agents☆50,147Mar 17, 2026Updated last week
- A programming framework for agentic AI☆55,908Updated this week
- Go ahead and axolotl questions☆11,460Updated this week
- Access large language models from the command-line☆11,397Updated this week
- Convert PDF to markdown + JSON quickly with high accuracy☆32,910Mar 10, 2026Updated 2 weeks ago