Mozilla-Ocho / llamafileLinks
Distribute and run LLMs with a single file.
β22,681Updated this week
Alternatives and similar repositories for llamafile
Users that are interested in llamafile are comparing it to the libraries listed below
Sorting:
- Run your own AI cluster at home with everyday devices π±π» π₯οΈββ28,775Updated 3 months ago
- LLM inference in C/C++β82,419Updated this week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagβ¦β24,658Updated this week
- LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chβ¦β5,928Updated 2 months ago
- β© Create, share, and use custom AI code assistants with our open-source IDE extensions and hub of models, rules, prompts, docs, and otherβ¦β27,417Updated this week
- Access large language models from the command-lineβ8,743Updated last week
- Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.β145,020Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.β12,380Updated this week
- Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AIβ22,921Updated this week
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your pβ¦β46,746Updated this week
- A programming framework for agentic AI π€ PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autβ¦β46,610Updated this week
- Letta (formerly MemGPT) is the stateful agents framework with memory, reasoning, and context management.β17,080Updated this week
- aider is AI pair programming in your terminalβ35,002Updated this week
- Official inference library for Mistral modelsβ10,320Updated 3 months ago
- Self-hosted AI coding assistantβ31,620Updated this week
- πΈ Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloadingβ9,694Updated 9 months ago
- The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running onβ¦β33,633Updated this week
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.β25,720Updated last week
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMathβ9,423Updated 3 weeks ago
- the AI-native open-source embedding databaseβ20,790Updated this week
- LLM UI with advanced features, easy setup, and multiple backend support.β44,184Updated this week
- Blazingly fast LLM inference.β5,779Updated this week
- Fine-tuning & Reinforcement Learning for LLMs. π¦₯ Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.β41,413Updated this week
- Inference Llama 2 in one file of pure Cβ18,508Updated 10 months ago
- Open source codebase powering the HuggingChat appβ8,892Updated last week
- Memory for AI Agents; Announcing OpenMemory MCP - local and secure memory management.β35,923Updated this week
- tiny vision language modelβ8,158Updated last week
- Tensor library for machine learningβ12,738Updated last week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)β100,914Updated this week
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We alsβ¦β17,551Updated this week