Distribute and run LLMs with a single file.
☆24,121Apr 11, 2026Updated this week
Alternatives and similar repositories for llamafile
Users that are interested in llamafile are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM inference in C/C++☆103,237Updated this week
- Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.☆168,287Updated this week
- build-once run-anywhere c library☆20,743Mar 6, 2026Updated last month
- Port of OpenAI's Whisper model in C/C++☆48,405Mar 29, 2026Updated 2 weeks ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆75,637Updated this week
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.☆41,658Updated this week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆42,652Updated this week
- Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.5, DeepSeek, gpt-oss locally.☆61,312Updated this week
- Tensor library for machine learning☆14,394Updated this week
- LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.☆44,986Apr 6, 2026Updated last week
- aider is AI pair programming in your terminal☆43,145Updated this week
- A natural language interface for computers☆63,040Feb 9, 2026Updated 2 months ago
- LlamaIndex is the leading document agent and OCR platform☆48,389Updated this week
- Inference Llama 2 in one file of pure C☆19,379Aug 6, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Universal LLM Deployment Engine with ML Compilation☆22,414Apr 6, 2026Updated last week
- GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.☆77,328May 27, 2025Updated 10 months ago
- High-performance In-browser LLM Inference Engine☆17,740Updated this week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆131,509Updated this week
- DSPy: The framework for programming—not prompting—language models☆33,649Updated this week
- The original local LLM interface. Text, vision, tool-calling, training, and more. 100% offline.☆46,421Apr 7, 2026Updated last week
- The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configuration.☆58,070Updated this week
- Interact with your documents using the power of GPT, 100% privately, no data leaks☆57,201Feb 26, 2026Updated last month
- 🙌 OpenHands: AI-Driven Development☆71,108Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Self-hosted AI coding assistant☆33,311Mar 2, 2026Updated last month
- ⏩ Source-controlled AI checks, enforceable in CI. Powered by the open-source Continue CLI☆32,365Updated this week
- LLM training in simple, raw C/CUDA☆29,511Jun 26, 2025Updated 9 months ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,297Updated this week
- Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.☆21,988Updated this week
- Fast, flexible LLM inference☆6,928Updated this week
- Structured Outputs☆13,657Mar 26, 2026Updated 2 weeks ago
- A vector search SQLite extension that runs anywhere!☆7,395Updated this week
- Run frontier AI locally.☆43,503Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- MLX: An array framework for Apple silicon☆25,200Updated this week
- Vane is an AI-powered answering engine.☆33,727Updated this week
- Universal memory layer for AI Agents☆52,137Apr 6, 2026Updated last week
- A programming framework for agentic AI☆56,900Apr 6, 2026Updated last week
- Go ahead and axolotl questions☆11,608Updated this week
- Access large language models from the command-line☆11,592Updated this week
- Convert PDF to markdown + JSON quickly with high accuracy☆33,701Updated this week