mozilla-ai / llamafileLinks
Distribute and run LLMs with a single file.
☆23,704Updated this week
Alternatives and similar repositories for llamafile
Users that are interested in llamafile are comparing it to the libraries listed below
Sorting:
- Get up and running with Kimi-K2.5, GLM-4.7, DeepSeek, gpt-oss, Qwen, Gemma and other models.☆162,082Updated this week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆35,429Updated this week
- Distributed LLM inference. Connect home devices into a powerful cluster to accelerate LLM inference. More devices means faster inference.☆2,822Updated last week
- ⏩ Ship faster with Continuous AI. Open-source CLI that can be used in Headless mode to run async cloud agents or TUI mode as an in sync c…☆31,266Updated this week
- LLM inference in C/C++☆94,823Updated this week
- Port of OpenAI's Whisper model in C/C++☆46,518Updated this week
- Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.☆40,345Updated this week
- Tensor library for machine learning☆13,923Updated this week
- Perplexica is an AI-powered answering engine.☆28,711Updated last month
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,930Updated last year
- Local AI API Platform☆2,762Updated 7 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆69,622Updated this week
- GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.☆77,078Updated 8 months ago
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆123,582Updated this week
- Fast, flexible LLM inference☆6,508Updated this week
- aider is AI pair programming in your terminal☆40,449Updated 3 weeks ago
- LlamaIndex is the leading framework for building LLM-powered agents over your data.☆46,841Updated this week
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,891Updated last year
- Self-hosted AI coding assistant☆32,849Updated this week
- LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a ch…☆5,967Updated 2 months ago
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆39,392Updated 8 months ago
- Python bindings for llama.cpp☆9,958Updated 5 months ago
- Convert PDF to markdown + JSON quickly with high accuracy☆31,582Updated this week
- Run frontier AI locally.☆41,347Updated this week
- Large Language Model Text Generation Inference☆10,757Updated last month
- Universal LLM Deployment Engine with ML Compilation☆22,012Updated this week
- A guidance language for controlling large language models.☆21,270Updated this week
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.☆12,099Updated 2 weeks ago
- The open source codebase powering HuggingChat☆10,501Updated this week
- ☆8,809Updated 3 months ago