Mozilla-Ocho / llamafileLinks
Distribute and run LLMs with a single file.
☆22,510Updated 2 weeks ago
Alternatives and similar repositories for llamafile
Users that are interested in llamafile are comparing it to the libraries listed below
Sorting:
- LLM inference in C/C++☆80,984Updated this week
- Port of OpenAI's Whisper model in C/C++☆40,358Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆48,531Updated this week
- ⏩ Create, share, and use custom AI code assistants with our open-source IDE extensions and hub of models, rules, prompts, docs, and other…☆26,540Updated this week
- the AI-native open-source embedding database☆20,090Updated this week
- Python bindings for llama.cpp☆9,168Updated 3 weeks ago
- Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥☆39,558Updated this week
- A guidance language for controlling large language models.☆20,238Updated last week
- Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.☆142,322Updated this week
- LlamaIndex is the leading framework for building LLM-powered agents over your data.☆41,989Updated this week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆23,409Updated this week
- SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 16+ clouds). Get unified execution, cost savings, and high GPU availability v…☆8,154Updated this week
- The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on…☆32,890Updated this week
- Tensor library for machine learning☆12,591Updated this week
- Universal LLM Deployment Engine with ML Compilation☆20,685Updated 3 weeks ago
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆38,670Updated last week
- A Gradio web UI for Large Language Models with support for multiple inference backends.☆43,761Updated this week
- Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cl…☆23,835Updated this week
- High-performance In-browser LLM Inference Engine☆15,547Updated 3 weeks ago
- aider is AI pair programming in your terminal☆33,519Updated this week
- Jan is an open source alternative to ChatGPT that runs 100% offline on your computer☆29,277Updated this week
- Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI☆22,098Updated this week
- DSPy: The framework for programming—not prompting—language models☆24,538Updated this week
- Open source codebase powering the HuggingChat app☆8,748Updated this week
- Letta (formerly MemGPT) is the stateful agents framework with memory, reasoning, and context management.☆16,619Updated this week
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,634Updated 8 months ago
- Self-hosted AI coding assistant☆31,253Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆17,508Updated this week
- Interact with your documents using the power of GPT, 100% privately, no data leaks☆55,927Updated 6 months ago
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆96,527Updated this week