Mozilla-Ocho / llamafile
Distribute and run LLMs with a single file.
☆22,157Updated this week
Alternatives and similar repositories for llamafile:
Users that are interested in llamafile are comparing it to the libraries listed below
- LLM inference in C/C++☆77,955Updated this week
- Python bindings for llama.cpp☆8,943Updated this week
- The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on…☆31,682Updated this week
- Universal LLM Deployment Engine with ML Compilation☆20,355Updated this week
- Tensor library for machine learning☆12,272Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆44,418Updated this week
- ⏩ Create, share, and use custom AI code assistants with our open-source IDE extensions and hub of models, rules, prompts, docs, and other…☆25,324Updated this week
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,558Updated 7 months ago
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆20,383Updated this week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆38,364Updated 2 weeks ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆11,948Updated this week
- Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, and other large language models.☆136,553Updated this week
- LlamaIndex is the leading framework for building LLM-powered agents over your data.☆40,761Updated this week
- A guidance language for controlling large language models.☆20,017Updated this week
- Port of OpenAI's Whisper model in C/C++☆39,055Updated last week
- Letta (formerly MemGPT) is the stateful agents framework with memory, reasoning, and context management.☆15,854Updated this week
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.☆11,098Updated this week
- Jan is an open source alternative to ChatGPT that runs 100% offline on your computer☆28,381Updated this week
- Finetune Llama 4, DeepSeek-R1, Gemma 3 & Reasoning LLMs 2x faster with 70% less memory! 🦥☆36,949Updated this week
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,380Updated 11 months ago
- Inference Llama 2 in one file of pure C☆18,273Updated 8 months ago
- Inference code for CodeLlama models☆16,270Updated 8 months ago
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,365Updated 8 months ago
- Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚☆27,556Updated 3 weeks ago
- High-performance In-browser LLM Inference Engine☆15,177Updated 2 months ago
- Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work t…☆29,658Updated this week
- Structured Text Generation☆11,314Updated this week
- the AI-native open-source embedding database☆19,226Updated this week
- Official inference library for Mistral models☆10,167Updated 3 weeks ago
- A vector search SQLite extension that runs anywhere!☆5,404Updated 2 months ago