Mozilla-Ocho / llamafile
Distribute and run LLMs with a single file.
☆20,538Updated this week
Related projects ⓘ
Alternatives and complementary repositories for llamafile
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆13,971Updated this week
- LLM inference in C/C++☆68,097Updated this week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆47,259Updated this week
- Self-hosted AI coding assistant☆21,897Updated this week
- Tensor library for machine learning☆11,233Updated this week
- The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on…☆25,925Updated this week
- Letta (formerly MemGPT) is a framework for creating LLM services with memory.☆12,838Updated this week
- Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.☆98,420Updated this week
- The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.☆27,384Updated this week
- Python bindings for llama.cpp☆8,141Updated this week
- ⏩ Continue is the leading open-source AI code assistant. You can connect any models and any context to build custom autocomplete and chat…☆19,306Updated this week
- Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚☆15,283Updated this week
- Port of OpenAI's Whisper model in C/C++☆35,738Updated this week
- Build AI Agents with memory, knowledge, tools and reasoning. Chat with them using a beautiful Agent UI.☆15,471Updated this week
- LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a ch…☆5,677Updated 2 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆30,423Updated this week
- Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI☆15,527Updated this week
- Inference code for CodeLlama models☆16,044Updated 3 months ago
- aider is AI pair programming in your terminal☆22,251Updated this week
- Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work t…☆21,327Updated this week
- An open-source RAG-based tool for chatting with your documents.☆17,436Updated this week
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆18,840Updated this week
- 🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.☆11,989Updated last week
- A programming framework for agentic AI 🤖☆34,482Updated this week
- Inference Llama 2 in one file of pure C☆17,476Updated 3 months ago
- Run any open-source LLMs, such as Llama, Mistral, as OpenAI compatible API endpoint in the cloud.☆10,079Updated this week
- Automate browser-based workflows with LLMs and Computer Vision☆10,475Updated this week
- High-performance In-browser LLM Inference Engine☆13,661Updated this week
- tiny vision language model☆5,760Updated this week
- Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM)☆23,586Updated this week