Mozilla-Ocho / llamafileLinks
Distribute and run LLMs with a single file.
☆22,633Updated last month
Alternatives and similar repositories for llamafile
Users that are interested in llamafile are comparing it to the libraries listed below
Sorting:
- A high-throughput and memory-efficient inference and serving engine for LLMs☆49,721Updated this week
- LLM inference in C/C++☆81,984Updated this week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆24,159Updated this week
- The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on…☆33,219Updated this week
- Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.☆143,881Updated this week
- the AI-native open-source embedding database☆20,571Updated this week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆38,731Updated 2 weeks ago
- Letta (formerly MemGPT) is the stateful agents framework with memory, reasoning, and context management.☆16,917Updated this week
- Universal LLM Deployment Engine with ML Compilation☆20,813Updated last week
- DSPy: The framework for programming—not prompting—language models☆25,466Updated this week
- Python bindings for llama.cpp☆9,237Updated last month
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆99,512Updated this week
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,675Updated 9 months ago
- Memory for AI Agents; Announcing OpenMemory MCP - local and secure memory management.☆34,513Updated this week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆25,911Updated this week
- ⏩ Create, share, and use custom AI code assistants with our open-source IDE extensions and hub of models, rules, prompts, docs, and other…☆26,931Updated this week
- GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.☆73,654Updated 3 weeks ago
- OCR, layout analysis, reading order, table recognition in 90+ languages☆17,641Updated last week
- Tensor library for machine learning☆12,697Updated last week
- LlamaIndex is the leading framework for building LLM-powered agents over your data.☆42,374Updated this week
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your p…☆45,908Updated this week
- Large Language Model Text Generation Inference☆10,236Updated this week
- aider is AI pair programming in your terminal☆34,480Updated this week
- SGLang is a fast serving framework for large language models and vision language models.☆15,276Updated this week
- A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/aut…☆46,091Updated this week
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.☆40,545Updated last week
- SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 16+ clouds). Get unified execution, cost savings, and high GPU availability v…☆8,261Updated this week
- The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.☆45,460Updated this week
- Code and documentation to train Stanford's Alpaca models, and generate the data.☆30,040Updated 11 months ago
- LLM UI with advanced features, easy setup, and multiple backend support.☆43,933Updated this week