mozilla-ai / llamafileLinks
Distribute and run LLMs with a single file.
☆23,258Updated 4 months ago
Alternatives and similar repositories for llamafile
Users that are interested in llamafile are comparing it to the libraries listed below
Sorting:
- LLM inference in C/C++☆88,512Updated this week
- Blazingly fast LLM inference.☆6,171Updated this week
- A vector search SQLite extension that runs anywhere!☆6,332Updated 9 months ago
- Inference Llama 2 in one file of pure C☆18,891Updated last year
- Tensor library for machine learning☆13,332Updated last week
- LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a ch…☆5,953Updated 6 months ago
- The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on…☆36,008Updated last week
- Universal LLM Deployment Engine with ML Compilation☆21,527Updated last week
- Python bindings for llama.cpp☆9,678Updated 2 months ago
- GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.☆76,829Updated 5 months ago
- Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI☆26,934Updated last week
- High-performance In-browser LLM Inference Engine☆16,712Updated this week
- Port of OpenAI's Whisper model in C/C++☆44,056Updated this week
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.☆11,883Updated this week
- ⏩ Ship faster with Continuous AI. Build and run custom agents across your IDE, terminal, and CI☆29,476Updated this week
- Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.☆38,350Updated last week
- The definitive Web UI for local AI, with powerful features and easy setup.☆45,225Updated last week
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,826Updated last year
- Open-source search and retrieval database for AI applications.☆24,161Updated this week
- Locally run an Instruction-Tuned Chat-Style LLM☆10,201Updated 2 years ago
- Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.☆154,818Updated this week
- Examples in the MLX framework☆7,949Updated 3 weeks ago
- MLX: An array framework for Apple silicon☆22,587Updated last week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆12,861Updated last week
- Local AI API Platform☆2,760Updated 3 months ago
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆30,339Updated this week
- Structured Outputs☆12,739Updated 2 weeks ago
- Ollama Python library☆8,752Updated 3 weeks ago
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.☆47,355Updated last week
- Official inference library for Mistral models☆10,521Updated 7 months ago