Mozilla-Ocho / llamafile
Distribute and run LLMs with a single file.
☆22,317Updated 2 weeks ago
Alternatives and similar repositories for llamafile:
Users that are interested in llamafile are comparing it to the libraries listed below
- LLM inference in C/C++☆79,077Updated this week
- ⏩ Create, share, and use custom AI code assistants with our open-source IDE extensions and hub of models, rules, prompts, docs, and other…☆26,010Updated this week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆21,842Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆46,456Updated this week
- Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥☆37,861Updated this week
- Convert PDF to markdown + JSON quickly with high accuracy☆24,672Updated this week
- Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.☆139,442Updated this week
- Tensor library for machine learning☆12,445Updated this week
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆37,357Updated this week
- The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on…☆32,314Updated this week
- Port of OpenAI's Whisper model in C/C++☆39,706Updated this week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆92,548Updated this week
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,463Updated last year
- Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI☆21,616Updated last week
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.☆11,216Updated last week
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,600Updated 7 months ago
- Go ahead and axolotl questions☆9,258Updated this week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆38,502Updated 3 weeks ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆12,064Updated last week
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your p…☆42,313Updated this week
- Python bindings for llama.cpp☆9,030Updated 3 weeks ago
- aider is AI pair programming in your terminal☆32,400Updated this week
- Official inference library for Mistral models☆10,195Updated last month
- SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 16+ clouds). Get unified execution, cost savings, and high GPU availability v…☆7,981Updated this week
- Inference code for CodeLlama models☆16,286Updated 8 months ago
- Universal LLM Deployment Engine with ML Compilation☆20,548Updated this week
- Letta (formerly MemGPT) is the stateful agents framework with memory, reasoning, and context management.☆16,248Updated this week
- Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚☆27,929Updated last month
- Jan is an open source alternative to ChatGPT that runs 100% offline on your computer☆28,764Updated this week
- SGLang is a fast serving framework for large language models and vision language models.☆13,829Updated this week