mozilla-ai / llamafileLinks
Distribute and run LLMs with a single file.
☆23,704Updated this week
Alternatives and similar repositories for llamafile
Users that are interested in llamafile are comparing it to the libraries listed below
Sorting:
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.☆12,099Updated 2 weeks ago
- Port of OpenAI's Whisper model in C/C++☆46,518Updated this week
- A vector search SQLite extension that runs anywhere!☆6,858Updated last year
- LLM inference in C/C++☆94,823Updated this week
- Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.☆21,024Updated 2 weeks ago
- Structured Outputs☆13,403Updated last week
- LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a ch…☆5,967Updated 2 months ago
- GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.☆77,136Updated 8 months ago
- Local AI API Platform☆2,762Updated 7 months ago
- Python bindings for llama.cpp☆9,971Updated 5 months ago
- Tensor library for machine learning☆13,923Updated this week
- aider is AI pair programming in your terminal☆40,449Updated 3 weeks ago
- MLX: An array framework for Apple silicon☆23,812Updated last week
- Access large language models from the command-line☆11,102Updated this week
- Fast, flexible LLM inference☆6,508Updated this week
- Run frontier AI locally.☆41,347Updated this week
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,891Updated last year
- Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.☆40,345Updated last week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆35,429Updated this week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆39,402Updated 8 months ago
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,477Updated 8 months ago
- Private & local AI personal knowledge management app for high entropy people.☆8,507Updated 8 months ago
- DSPy: The framework for programming—not prompting—language models☆32,156Updated this week
- Get up and running with Kimi-K2.5, GLM-4.7, DeepSeek, gpt-oss, Qwen, Gemma and other models.☆162,082Updated this week
- High-speed Large Language Model Serving for Local Deployment☆8,635Updated 2 weeks ago
- Fully private LLM chatbot that runs entirely with a browser with no server needed. Supports Mistral and LLama 3.☆2,679Updated last year
- ⏩ Ship faster with Continuous AI. Open-source CLI that can be used in Headless mode to run async cloud agents or TUI mode as an in sync c…☆31,266Updated last week
- Perplexica is an AI-powered answering engine.☆28,872Updated last month
- The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement, running on consumer-g…☆42,616Updated last week
- High-performance In-browser LLM Inference Engine☆17,258Updated this week