mozilla-ai / llamafileLinks
Distribute and run LLMs with a single file.
☆23,639Updated last week
Alternatives and similar repositories for llamafile
Users that are interested in llamafile are comparing it to the libraries listed below
Sorting:
- Tensor library for machine learning☆13,874Updated 2 weeks ago
- LLM inference in C/C++☆93,398Updated last week
- A vector search SQLite extension that runs anywhere!☆6,723Updated last year
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.☆12,064Updated this week
- Blazingly fast LLM inference.☆6,379Updated this week
- Self-hosted AI coding assistant☆32,782Updated this week
- Convert PDF to markdown + JSON quickly with high accuracy☆31,071Updated last week
- ⏩ Ship faster with Continuous AI. Open-source CLI that can be used in Headless mode to run async cloud agents or TUI mode as an in sync c…☆31,040Updated this week
- Perplexica is an AI-powered answering engine. It is an Open source alternative to Perplexity AI☆28,441Updated 2 weeks ago
- Get up and running with OpenAI GLM-4.7, DeepSeek, gpt-oss, Qwen, Gemma and other models.☆160,492Updated this week
- LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a ch…☆5,965Updated last month
- Inference Llama 2 in one file of pure C☆19,137Updated last year
- aider is AI pair programming in your terminal☆40,085Updated last week
- Port of OpenAI's Whisper model in C/C++☆46,066Updated last week
- Access large language models from the command-line☆10,983Updated last week
- High-speed Large Language Model Serving for Local Deployment☆8,591Updated 5 months ago
- A massively parallel, high-level programming language☆19,145Updated 7 months ago
- You like pytorch? You like micrograd? You love tinygrad! ❤️☆31,181Updated last week
- MLX: An array framework for Apple silicon☆23,568Updated this week
- lightweight, standalone C++ inference engine for Google's Gemma models.☆6,705Updated 2 weeks ago
- AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recording☆16,464Updated last week
- tiny vision language model☆9,260Updated 2 months ago
- Local AI API Platform☆2,761Updated 6 months ago
- Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.☆22,075Updated 3 months ago
- The simplest way to run LLaMA on your local machine☆13,005Updated last year
- Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. …☆32,312Updated 3 weeks ago
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,880Updated last year
- Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.☆40,168Updated this week
- Go ahead and axolotl questions☆11,138Updated this week
- Python bindings for llama.cpp☆9,917Updated 5 months ago