A mobile Implementation of llama.cpp
☆327Feb 1, 2024Updated 2 years ago
Alternatives and similar repositories for sherpa
Users that are interested in sherpa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A mobile Implementation of llama.cpp☆26Oct 11, 2023Updated 2 years ago
- Android app for the Hole in your Palm project, making LLMs accessible on-device!☆18May 3, 2024Updated last year
- Maid is a free and open source application for interfacing with llama.cpp models locally, and with Anthropic, DeepSeek, Ollama, Mistral a…☆2,448Apr 7, 2026Updated last week
- dart binding for llama.cpp☆288Jan 22, 2026Updated 2 months ago
- llama.cpp bindings for Flutter☆20Sep 9, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- lcpp is a dart implementation of llama.cpp used by the mobile artificial intelligence distribution (maid)☆117Updated this week
- IRIS is an android app for interfacing with GGUF / llama.cpp models locally.☆274Feb 1, 2025Updated last year
- This is a simple shell script to install the alpaca llama 7B model on termux for Android phones. All credits goes to the original develop…☆64Sep 19, 2023Updated 2 years ago
- Flutter / Dart bindings for llama.cpp☆20Sep 30, 2023Updated 2 years ago
- llama.cpp tutorial on Android phone☆162Mar 21, 2026Updated 3 weeks ago
- llama.cpp for Flutter☆206Updated this week
- Android app for running transformers locally using LLama.cpp & Whisper.cpp☆30Jun 21, 2024Updated last year
- ☆13May 25, 2023Updated 2 years ago
- Universal LLM Deployment Engine with ML Compilation☆22,414Apr 6, 2026Updated last week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A flutter binding for llama.cpp, which use platform channel.☆36Mar 31, 2026Updated 2 weeks ago
- AI Based "Happiness Optimizer"☆12Oct 20, 2024Updated last year
- To deploy Transformer models in CV to mobile devices.☆18Jan 20, 2022Updated 4 years ago
- Unreal Media Player to bring Live and VOD video streaming in HLS and DASH formats into your Unreal apps across multiple platforms.☆12Updated this week
- Fork of llama.cpp, extended for GPT-NeoX, RWKV-v4, and Falcon models☆28Jul 26, 2023Updated 2 years ago
- A basic startup guide on running LLMs on android locally or using an external ollama server☆36Jan 31, 2024Updated 2 years ago
- Demonstration of running a native LLM on Android device.☆243Updated this week
- Locally run an Instruction-Tuned Chat-Style LLM (Android/Linux/Windows/Mac)☆262Apr 5, 2023Updated 3 years ago
- ggml implementation of BERT☆500Feb 23, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Android JNI for port of Facebook's LLaMA model in C/C++☆26Jun 7, 2023Updated 2 years ago
- AubAI brings you on-device gen-AI capabilities, including offline text generation and more, directly within your app.☆310Apr 27, 2024Updated last year
- Android本地运行mnn-llm语言模型简单示例☆13Oct 2, 2025Updated 6 months ago
- Visual Studio Code on https://deta.space☆17Jan 24, 2024Updated 2 years ago
- Tiny Dream - An embedded, Header Only, Stable Diffusion C++ implementation☆265Oct 31, 2023Updated 2 years ago
- Llama.cpp-qt is a Python-based GUI wrapper for the LLama.cpp server, providing a user-friendly interface for configuring and running the …☆16Oct 4, 2023Updated 2 years ago
- No-messing-around sh client for llama.cpp's server☆30Aug 7, 2024Updated last year
- An educational Rust project for exporting and running inference on Qwen3 LLM family☆42Aug 3, 2025Updated 8 months ago
- A rework of the gradio WebUI for the open-source unified multimodal model by ByteDance☆22Jun 3, 2025Updated 10 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Like ChatGPT's voice conversations with an AI, but entirely offline/private/trade-secret-friendly, using local AI models such as LLama 2 …☆161Aug 20, 2024Updated last year
- BlinkDL's RWKV-v4 running in the browser☆48Mar 2, 2023Updated 3 years ago
- This package provides Swift bindings for llama.cpp☆26Apr 4, 2023Updated 3 years ago
- Strong and Open Vision Language Assistant for Mobile Devices☆1,350Apr 15, 2024Updated last year
- ☆10May 7, 2022Updated 3 years ago
- MiniCPM on Android platform.☆641Mar 19, 2025Updated last year
- llama and other large language models on iOS and MacOS offline using GGML library.☆2,008Jan 30, 2026Updated 2 months ago