Mozilla-Ocho / llamafile
Distribute and run LLMs with a single file.
☆21,906Updated this week
Alternatives and similar repositories for llamafile:
Users that are interested in llamafile are comparing it to the libraries listed below
- Port of OpenAI's Whisper model in C/C++☆38,321Updated this week
- Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚☆26,223Updated this week
- LLM inference in C/C++☆76,180Updated this week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆18,548Updated this week
- Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.☆131,836Updated this week
- Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI☆20,441Updated this week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆82,156Updated this week
- the AI-native open-source embedding database☆18,468Updated this week
- ⏩ Create, share, and use custom AI code assistants with our open-source IDE extensions and hub of models, rules, prompts, docs, and other…☆24,283Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆11,756Updated last week
- tiny vision language model☆7,547Updated 2 weeks ago
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,492Updated 6 months ago
- The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, and more.☆40,558Updated this week
- Structured Text Generation☆10,958Updated this week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆38,048Updated last week
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.☆10,918Updated last week
- Python bindings for llama.cpp☆8,784Updated this week
- Tensor library for machine learning☆12,051Updated this week
- DSPy: The framework for programming—not prompting—language models☆22,362Updated this week
- Self-hosted AI coding assistant☆30,348Updated this week
- LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a ch…☆5,862Updated 6 months ago
- A Gradio web UI for Large Language Models with support for multiple inference backends.☆42,790Updated last week
- LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.☆19,904Updated this week
- LlamaIndex is the leading framework for building LLM-powered agents over your data.☆39,727Updated this week
- The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on…☆30,895Updated this week
- Jan is an open source alternative to ChatGPT that runs 100% offline on your computer☆27,947Updated this week
- Letta (formerly MemGPT) is a framework for creating LLM services with memory.☆15,168Updated this week
- Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥☆34,064Updated this week
- Convert PDF to markdown + JSON quickly with high accuracy☆22,514Updated this week
- structured outputs for llms☆9,706Updated this week