mozilla-ai / llamafileLinks
Distribute and run LLMs with a single file.
☆23,448Updated 2 weeks ago
Alternatives and similar repositories for llamafile
Users that are interested in llamafile are comparing it to the libraries listed below
Sorting:
- Tensor library for machine learning☆13,681Updated 2 weeks ago
- LLM inference in C/C++☆90,838Updated last week
- Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.☆157,308Updated this week
- Self-hosted AI coding assistant☆32,540Updated 2 weeks ago
- High-performance In-browser LLM Inference Engine☆16,924Updated 2 weeks ago
- Python bindings for llama.cpp☆9,800Updated 3 months ago
- Port of OpenAI's Whisper model in C/C++☆44,967Updated this week
- MLX: An array framework for Apple silicon☆22,983Updated this week
- Perplexica is an AI-powered answering engine. It is an Open source alternative to Perplexity AI☆27,559Updated this week
- Blazingly fast LLM inference.☆6,262Updated this week
- Letta is the platform for building stateful agents: open AI with advanced memory that can learn and self-improve over time.☆19,465Updated last week
- A vector search SQLite extension that runs anywhere!☆6,513Updated 10 months ago
- Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚☆32,679Updated last month
- A high-throughput and memory-efficient inference and serving engine for LLMs☆64,758Updated this week
- Open-source search and retrieval database for AI applications.☆24,734Updated this week
- ⏩ Ship faster with Continuous AI. Open-source CLI that can be used in TUI mode as a coding agent or Headless mode to run background agent…☆30,177Updated this week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆117,009Updated last week
- Interact with your documents using the power of GPT, 100% privately, no data leaks☆56,880Updated last year
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆32,131Updated this week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆39,290Updated 6 months ago
- Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.☆39,620Updated this week
- The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on…☆39,865Updated this week
- The definitive Web UI for local AI, with powerful features and easy setup.☆45,550Updated this week
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.☆11,987Updated this week
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.☆49,033Updated this week
- Universal memory layer for AI Agents☆43,920Updated last week
- Structured Outputs