Distribute and run LLMs with a single file.
☆23,755Mar 2, 2026Updated this week
Alternatives and similar repositories for llamafile
Users that are interested in llamafile are comparing it to the libraries listed below
Sorting:
- LLM inference in C/C++☆96,322Updated this week
- Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.☆163,632Updated this week
- Port of OpenAI's Whisper model in C/C++☆47,067Updated this week
- build-once run-anywhere c library☆20,577Jan 25, 2026Updated last month
- A high-throughput and memory-efficient inference and serving engine for LLMs☆71,234Updated this week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆37,083Updated this week
- Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.☆40,672Updated this week
- The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement, running on consumer-g…☆43,070Updated this week
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.☆53,029Updated this week
- aider is AI pair programming in your terminal☆41,062Feb 25, 2026Updated last week
- Tensor library for machine learning☆14,152Updated this week
- LlamaIndex is the leading document agent and OCR platform☆47,210Updated this week
- Universal LLM Deployment Engine with ML Compilation☆22,082Updated this week
- High-performance In-browser LLM Inference Engine☆17,456Feb 18, 2026Updated last week
- A natural language interface for computers☆62,427Feb 9, 2026Updated 3 weeks ago
- GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.☆77,171May 27, 2025Updated 9 months ago
- Inference Llama 2 in one file of pure C☆19,213Aug 6, 2024Updated last year
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆125,513Updated this week
- The definitive Web UI for local AI, with powerful features and easy setup.☆46,091Feb 3, 2026Updated last month
- DSPy: The framework for programming—not prompting—language models☆32,381Feb 24, 2026Updated last week
- Self-hosted AI coding assistant☆32,939Feb 24, 2026Updated last week
- The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.☆55,217Updated this week
- Interact with your documents using the power of GPT, 100% privately, no data leaks☆57,143Updated this week
- ⏩ Source-controlled AI checks, enforceable in CI. Powered by the open-source Continue CLI☆31,532Updated this week
- 🙌 OpenHands: AI-Driven Development☆68,459Updated this week
- Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.☆21,340Feb 24, 2026Updated last week
- LLM training in simple, raw C/CUDA☆28,993Jun 26, 2025Updated 8 months ago
- Fast, flexible LLM inference☆6,623Updated this week
- Run frontier AI locally.☆41,955Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,206Updated this week
- Structured Outputs☆13,488Updated this week
- Universal memory layer for AI Agents☆47,994Feb 23, 2026Updated last week
- MLX: An array framework for Apple silicon☆24,066Updated this week
- A vector search SQLite extension that runs anywhere!☆7,041Feb 13, 2026Updated 2 weeks ago
- Perplexica is an AI-powered answering engine.☆29,068Feb 13, 2026Updated 2 weeks ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆54,071Nov 12, 2025Updated 3 months ago
- Convert PDF to markdown + JSON quickly with high accuracy☆32,069Updated this week
- Build, run, manage agentic software at scale.☆38,276Updated this week
- Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cl…☆29,102Updated this week