mozilla-ai / llamafileLinks
Distribute and run LLMs with a single file.
☆23,402Updated 2 weeks ago
Alternatives and similar repositories for llamafile
Users that are interested in llamafile are comparing it to the libraries listed below
Sorting:
- LLM inference in C/C++☆90,119Updated this week
- High-performance In-browser LLM Inference Engine☆16,806Updated 2 weeks ago
- Letta is the platform for building stateful agents: open AI with advanced memory that can learn and self-improve over time.☆19,215Updated last week
- LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a ch…☆5,958Updated 6 months ago
- Python bindings for llama.cpp☆9,735Updated 3 months ago
- A vector search SQLite extension that runs anywhere!☆6,412Updated 9 months ago
- The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on…☆38,530Updated this week
- ⏩ Ship faster with Continuous AI. Open-source CLI that can be used in TUI mode as a coding agent or Headless mode to run background agent…☆29,855Updated this week
- GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.☆76,915Updated 5 months ago
- Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. D…☆11,966Updated last month
- Go ahead and axolotl questions☆10,798Updated this week
- Tensor library for machine learning☆13,575Updated this week
- Official inference library for Mistral models☆10,543Updated 8 months ago
- Perplexica is an AI-powered answering engine. It is an Open source alternative to Perplexity AI☆27,249Updated this week
- Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.☆156,090Updated this week
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.☆11,933Updated last week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆63,144Updated this week
- Enhanced ChatGPT Clone: Features Agents, MCP, DeepSeek, Anthropic, AWS, OpenAI, Responses API, Azure, Groq, o1, GPT-5, Mistral, OpenRoute…☆31,750Updated this week
- MLX: An array framework for Apple silicon☆22,821Updated this week
- Open-source vector similarity search for Postgres☆18,382Updated 3 weeks ago
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,838Updated last year
- lightweight, standalone C++ inference engine for Google's Gemma models.☆6,615Updated this week
- Local AI API Platform☆2,764Updated 4 months ago
- Inference Llama 2 in one file of pure C☆18,952Updated last year
- OpenUI let's you describe UI using your imagination, then see it rendered live.☆21,791Updated last month
- Self-hosted AI coding assistant☆32,435Updated last week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆31,172Updated this week
- Private & local AI personal knowledge management app for high entropy people.☆8,381Updated 6 months ago
- Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.☆39,319Updated this week
- Access large language models from the command-line☆10,258Updated this week