Mozilla-Ocho / llamafile
Distribute and run LLMs with a single file.
☆21,262Updated last week
Alternatives and similar repositories for llamafile:
Users that are interested in llamafile are comparing it to the libraries listed below
- Finetune Llama 3.3, Mistral, Phi-4, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory☆20,611Updated this week
- LLM inference in C/C++☆70,826Updated this week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆16,235Updated this week
- The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.☆30,389Updated this week
- Self-hosted AI coding assistant☆27,242Updated this week
- Jan is an open source alternative to ChatGPT that runs 100% offline on your computer☆26,396Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆33,809Updated this week
- Tensor library for machine learning☆11,541Updated this week
- The Memory layer for your AI apps☆23,953Updated this week
- Letta (formerly MemGPT) is a framework for creating LLM services with memory.☆13,996Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆11,197Updated this week
- ⏩ Continue is the leading open-source AI code assistant. You can connect any models and any context to build custom autocomplete and chat…☆21,432Updated this week
- aider is AI pair programming in your terminal☆24,970Updated this week
- The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on…☆28,480Updated this week
- Official inference library for Mistral models☆9,857Updated 2 months ago
- Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚☆18,680Updated this week
- A vector search SQLite extension that runs anywhere!☆4,670Updated last week
- Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI☆18,641Updated this week
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,349Updated 4 months ago
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆11,688Updated this week
- Get up and running with Llama 3.3, Phi 4, Gemma 2, and other large language models.☆107,852Updated this week
- LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a ch…☆5,783Updated 4 months ago
- SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensiv…☆14,188Updated this week
- MLX: An array framework for Apple silicon☆18,334Updated this week
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.☆13,023Updated 3 months ago
- Universal LLM Deployment Engine with ML Compilation☆19,630Updated this week
- LlamaIndex is the leading framework for building LLM-powered agents over your data.☆38,057Updated this week
- A natural language interface for computers☆57,838Updated last month
- Blazingly fast LLM inference.☆4,826Updated this week
- Port of OpenAI's Whisper model in C/C++☆36,923Updated this week