Mozilla-Ocho / llamafileLinks

Distribute and run LLMs with a single file.

☆22,793

Alternatives and similar repositories for llamafile

Users that are interested in llamafile are comparing it to the libraries listed below

Sorting:

ggml-org / llama.cpp
LLM inference in C/C++
☆83,197Updated this week
ggml-org / whisper.cpp
Port of OpenAI's Whisper model in C/C++
☆41,688Updated this week
ggml-org / ggml
Tensor library for machine learning
☆12,859Updated this week
exo-explore / exo
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
☆29,010Updated 4 months ago
ollama / ollama
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.
☆147,162Updated this week
TabbyML / tabby
Self-hosted AI coding assistant
☆31,775Updated this week
mlc-ai / mlc-llm
Universal LLM Deployment Engine with ML Compilation
☆21,005Updated 2 weeks ago
EricLBuehler / mistral.rs
Blazingly fast LLM inference.
☆5,913Updated this week
abetlen / llama-cpp-python
Python bindings for llama.cpp
☆9,353Updated this week
bigscience-workshop / petals
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
☆9,726Updated 10 months ago
bentoml / OpenLLM
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
☆11,591Updated this week
asg017 / sqlite-vec
A vector search SQLite extension that runs anywhere!
☆5,900Updated 5 months ago
oobabooga / text-generation-webui
LLM UI with advanced features, easy setup, and multiple backend support.
☆44,391Updated this week
mistralai / mistral-inference
Official inference library for Mistral models
☆10,367Updated 4 months ago
antimatter15 / alpaca.cpp
Locally run an Instruction-Tuned Chat-Style LLM
☆10,223Updated 2 years ago
BerriAI / litellm
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…
☆25,705Updated this week
karpathy / llama2.c
Inference Llama 2 in one file of pure C
☆18,566Updated 11 months ago
mlc-ai / web-llm
High-performance In-browser LLM Inference Engine
☆16,003Updated 2 months ago
nlpxucan / WizardLM
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
☆9,432Updated last month
dottxt-ai / outlines
Structured Outputs
☆12,120Updated this week
SJTU-IPADS / PowerInfer
High-speed Large Language Model Serving for Local Deployment
☆8,236Updated 5 months ago
vllm-project / vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
☆52,682Updated this week
open-webui / open-webui
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
☆103,120Updated this week
ml-explore / mlx
MLX: An array framework for Apple silicon
☆21,609Updated this week
meta-llama / codellama
Inference code for CodeLlama models
☆16,352Updated 11 months ago
neuml / txtai
💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows
☆11,230Updated last week
menloresearch / jan
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer
☆34,997Updated this week
jart / cosmopolitan
build-once run-anywhere c library
☆19,566Updated 2 months ago
ml-explore / mlx-examples
Examples in the MLX framework
☆7,666Updated last month
Aider-AI / aider
aider is AI pair programming in your terminal
☆35,856Updated this week