Mozilla-Ocho / llamafileLinks
Distribute and run LLMs with a single file.
☆23,212Updated 3 months ago
Alternatives and similar repositories for llamafile
Users that are interested in llamafile are comparing it to the libraries listed below
Sorting:
- Self-hosted AI coding assistant☆32,275Updated 3 weeks ago
- Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.☆154,407Updated this week
- LLM inference in C/C++☆87,889Updated this week
- Tensor library for machine learning☆13,302Updated last week
- ⏩ Ship faster with Continuous AI. Build and run custom agents across your IDE, terminal, and CI☆29,373Updated this week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆112,626Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆60,385Updated this week
- Port of OpenAI's Whisper model in C/C++☆43,903Updated last week
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.☆11,857Updated last week
- aider is AI pair programming in your terminal☆37,952Updated 2 weeks ago
- Blazingly fast LLM inference.☆6,149Updated this week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆30,069Updated this week
- The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.☆50,001Updated last week
- Python bindings for llama.cpp☆9,658Updated 2 months ago
- Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.☆38,269Updated this week
- Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚☆31,859Updated this week
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your p…☆51,324Updated this week
- Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 17+ clouds, o…☆8,841Updated this week
- LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a ch…☆5,949Updated 5 months ago
- Ollama Python library☆8,704Updated 2 weeks ago
- The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on…☆35,896Updated this week
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,775Updated last year
- Large Language Model Text Generation Inference☆10,580Updated last month
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆39,161Updated 4 months ago
- A vector search SQLite extension that runs anywhere!☆6,300Updated 8 months ago
- Access large language models from the command-line☆9,922Updated this week
- Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI☆26,837Updated this week
- High-speed Large Language Model Serving for Local Deployment☆8,367Updated 2 months ago
- Universal LLM Deployment Engine with ML Compilation☆21,497Updated this week
- LlamaIndex is the leading framework for building LLM-powered agents over your data.☆44,778Updated this week