mozilla-ai/llamafile

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mozilla-ai/llamafile)

mozilla-ai / llamafile

Distribute and run LLMs with a single file.

☆23,755

Alternatives and similar repositories for llamafile

Users that are interested in llamafile are comparing it to the libraries listed below

Sorting:

ggml-org / llama.cpp
View on GitHub
LLM inference in C/C++
☆96,322Updated this week
ollama / ollama
View on GitHub
Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
☆163,632Updated this week
ggml-org / whisper.cpp
View on GitHub
Port of OpenAI's Whisper model in C/C++
☆47,067Updated this week
jart / cosmopolitan
View on GitHub
build-once run-anywhere c library
☆20,577Jan 25, 2026Updated last month
vllm-project / vllm
View on GitHub
A high-throughput and memory-efficient inference and serving engine for LLMs
☆71,234Updated this week
BerriAI / litellm
View on GitHub
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…
☆37,083Updated this week
janhq / jan
View on GitHub
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.
☆40,672Updated this week
mudler / LocalAI
View on GitHub
The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement, running on consumer-g…
☆43,070Updated this week
unslothai / unsloth
View on GitHub
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.
☆53,029Updated this week
Aider-AI / aider
View on GitHub
aider is AI pair programming in your terminal
☆41,062Feb 25, 2026Updated last week
ggml-org / ggml
View on GitHub
Tensor library for machine learning
☆14,152Updated this week
run-llama / llama_index
View on GitHub
LlamaIndex is the leading document agent and OCR platform
☆47,210Updated this week
mlc-ai / mlc-llm
View on GitHub
Universal LLM Deployment Engine with ML Compilation
☆22,082Updated this week
mlc-ai / web-llm
View on GitHub
High-performance In-browser LLM Inference Engine
☆17,456Feb 18, 2026Updated last week
openinterpreter / open-interpreter
View on GitHub
A natural language interface for computers
☆62,427Feb 9, 2026Updated 3 weeks ago
nomic-ai / gpt4all
View on GitHub
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
☆77,171May 27, 2025Updated 9 months ago
karpathy / llama2.c
View on GitHub
Inference Llama 2 in one file of pure C
☆19,213Aug 6, 2024Updated last year
open-webui / open-webui
View on GitHub
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
☆125,513Updated this week
oobabooga / text-generation-webui
View on GitHub
The definitive Web UI for local AI, with powerful features and easy setup.
☆46,091Feb 3, 2026Updated last month
stanfordnlp / dspy
View on GitHub
DSPy: The framework for programming—not prompting—language models
☆32,381Feb 24, 2026Updated last week
TabbyML / tabby
View on GitHub
Self-hosted AI coding assistant
☆32,939Feb 24, 2026Updated last week
Mintplex-Labs / anything-llm
View on GitHub
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.
☆55,217Updated this week
zylon-ai / private-gpt
View on GitHub
Interact with your documents using the power of GPT, 100% privately, no data leaks
☆57,143Updated this week
continuedev / continue
View on GitHub
⏩ Source-controlled AI checks, enforceable in CI. Powered by the open-source Continue CLI
☆31,532Updated this week
OpenHands / OpenHands
View on GitHub
🙌 OpenHands: AI-Driven Development
☆68,459Updated this week
letta-ai / letta
View on GitHub
Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.
☆21,340Feb 24, 2026Updated last week
karpathy / llm.c
View on GitHub
LLM training in simple, raw C/CUDA
☆28,993Jun 26, 2025Updated 8 months ago
EricLBuehler / mistral.rs
View on GitHub
Fast, flexible LLM inference
☆6,623Updated this week
exo-explore / exo
View on GitHub
Run frontier AI locally.
☆41,955Updated this week
Lightning-AI / litgpt
View on GitHub
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
☆13,206Updated this week
dottxt-ai / outlines
View on GitHub
Structured Outputs
☆13,488Updated this week
mem0ai / mem0
View on GitHub
Universal memory layer for AI Agents
☆47,994Feb 23, 2026Updated last week
ml-explore / mlx
View on GitHub
MLX: An array framework for Apple silicon
☆24,066Updated this week
asg017 / sqlite-vec
View on GitHub
A vector search SQLite extension that runs anywhere!
☆7,041Feb 13, 2026Updated 2 weeks ago
ItzCrazyKns / Perplexica
View on GitHub
Perplexica is an AI-powered answering engine.
☆29,068Feb 13, 2026Updated 2 weeks ago
karpathy / nanoGPT
View on GitHub
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆54,071Nov 12, 2025Updated 3 months ago
datalab-to / marker
View on GitHub
Convert PDF to markdown + JSON quickly with high accuracy
☆32,069Updated this week
agno-agi / agno
View on GitHub
Build, run, manage agentic software at scale.
☆38,276Updated this week
qdrant / qdrant
View on GitHub
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cl…
☆29,102Updated this week