☆320May 19, 2026Updated last month
Alternatives and similar repositories for binary-mlc-llm-libs
Users that are interested in binary-mlc-llm-libs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A mobile Implementation of llama.cpp☆26Oct 11, 2023Updated 2 years ago
- Universal LLM Deployment Engine with ML Compilation☆22,863May 11, 2026Updated last month
- ☆14Updated this week
- ☆176Jun 14, 2026Updated 2 weeks ago
- A frontend for running models on mobile or connecting to your preferred API providers.☆2,535Jun 24, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 适用于 NAS、路由器、树莓派等轻量级设备的 xiaozhi-esp32 服务端☆40May 7, 2026Updated last month
- High-performance In-browser LLM Inference Engine☆18,279Jun 9, 2026Updated 3 weeks ago
- Run pytorch models on GPU Android with Vulkan backend☆10Aug 15, 2023Updated 2 years ago
- Serve local ML inference engines to web apps☆32Apr 9, 2024Updated 2 years ago
- Stable Diffusion AI client app for Android and iOS☆1,213Jun 26, 2026Updated last week
- ☆85Jun 18, 2026Updated 2 weeks ago
- Maid is a free and open source application for interfacing with llama.cpp models locally, and with Anthropic, DeepSeek, Ollama, Mistral a…☆2,566Apr 7, 2026Updated 2 months ago
- A Next.js chat app to use Llama locally using node-llama-cpp☆12Oct 27, 2024Updated last year
- Sentence Embedding as a Service☆15Jun 30, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Flutter / Dart bindings for llama.cpp☆20Sep 30, 2023Updated 2 years ago
- A set of bash scripts to automate deployment of GGML/GGUF models [default: RWKV] with the use of KoboldCpp on Android - Termux☆44Jun 21, 2024Updated 2 years ago
- A mobile Implementation of llama.cpp☆327Feb 1, 2024Updated 2 years ago
- Experimental sampler to make LLMs more creative☆31Aug 2, 2023Updated 2 years ago
- 基于MNN-llm的安卓手机部署大语言模型:Qwen1.5-0.5B-Chat☆97Apr 8, 2024Updated 2 years ago
- MiniCPM on Android platform.☆639Mar 19, 2025Updated last year
- Making offline AI models accessible to all types of edge devices.☆146Feb 12, 2024Updated 2 years ago
- Eidos – A Self-Growing AI Agent with Long-Term Memory and Environmental Awareness☆23Jul 4, 2025Updated 11 months ago
- Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.☆3,719Mar 12, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A class that allows you to view (almost) any object in AHK v2.☆32May 3, 2024Updated 2 years ago
- PB-LLM: Partially Binarized Large Language Models☆157Nov 20, 2023Updated 2 years ago
- A simple GUI for managing MCP servers, for easy toggle mcp servers.☆14Dec 8, 2024Updated last year
- 移植Android工具getevent到Linux☆18Jun 21, 2018Updated 8 years ago
- WebLLM Chrome Extension Starter Pack.☆12Aug 10, 2023Updated 2 years ago
- Trying to deconstruct RWKV in understandable terms☆14May 6, 2023Updated 3 years ago
- Fast Multimodal LLM on Mobile Devices☆1,552Jun 9, 2026Updated 3 weeks ago
- lcpp is a dart implementation of llama.cpp used by the mobile artificial intelligence distribution (maid)☆117Updated this week
- GNO is the UNIX-like environment for the Apple IIgs☆12Jun 11, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A one-page WebUI integrating VITS inference, training, and output in Sherpa-Onnx format.☆14Feb 2, 2025Updated last year
- Fast, async GeoTIFF and COG reader for Python☆52Jun 22, 2026Updated last week
- MLX version of z-image model☆63May 28, 2026Updated last month
- [DEPRECATED] Glue your types to GraphQL☆18Dec 6, 2015Updated 10 years ago
- Generative AI web UI and server☆22May 23, 2023Updated 3 years ago
- An Activist Grade Privacy & Security App that provides multi-layered defense by isolating data within encrypted sandbox, inaccessible to …☆71Jun 1, 2026Updated last month
- [DEPRECATED] gRPC service that renders webpage HTML using Chromeless☆24May 17, 2018Updated 8 years ago