Mozilla-Ocho / llamafile
Distribute and run LLMs with a single file.
☆21,718Updated 2 weeks ago
Alternatives and similar repositories for llamafile:
Users that are interested in llamafile are comparing it to the libraries listed below
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆17,500Updated this week
- LLM inference in C/C++☆74,390Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆38,093Updated this week
- Tensor library for machine learning☆11,857Updated this week
- MLX: An array framework for Apple silicon☆19,097Updated this week
- Python bindings for llama.cpp☆8,636Updated 2 weeks ago
- Inference Llama 2 in one file of pure C☆18,027Updated 6 months ago
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.☆10,587Updated this week
- Inference code for CodeLlama models☆16,205Updated 6 months ago
- ⏩ Continue is the leading open-source AI code assistant. You can connect any models and any context to build custom autocomplete and chat…☆23,330Updated this week
- Letta (formerly MemGPT) is a framework for creating LLM services with memory.☆14,495Updated this week
- Port of OpenAI's Whisper model in C/C++☆37,743Updated last week
- Structured Text Generation☆10,702Updated this week
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆16,206Updated this week
- A Gradio web UI for Large Language Models with support for multiple inference backends.☆42,500Updated this week
- Examples in the MLX framework☆6,939Updated this week
- The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on…☆30,412Updated this week
- High-speed Large Language Model Serving for Local Deployment☆8,101Updated 2 weeks ago
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆37,796Updated this week
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,201Updated 9 months ago
- Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI☆19,878Updated this week
- DSPy: The framework for programming—not prompting—language models☆21,882Updated this week
- LlamaIndex is the leading framework for building LLM-powered agents over your data.☆38,980Updated this week
- Universal LLM Deployment Engine with ML Compilation☆19,972Updated this week
- Official inference library for Mistral models☆9,982Updated 3 months ago
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,432Updated 5 months ago
- Large Language Model Text Generation Inference☆9,756Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆11,568Updated 2 weeks ago
- High-performance In-browser LLM Inference Engine☆14,669Updated 3 weeks ago
- Agno is a lightweight library for building multi-modal Agents☆18,930Updated this week