☆312Apr 23, 2026Updated last week
Alternatives and similar repositories for binary-mlc-llm-libs
Users that are interested in binary-mlc-llm-libs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A mobile Implementation of llama.cpp☆26Oct 11, 2023Updated 2 years ago
- Universal LLM Deployment Engine with ML Compilation☆22,557Apr 22, 2026Updated last week
- ☆16Apr 20, 2026Updated 2 weeks ago
- ☆175Apr 20, 2026Updated 2 weeks ago
- Simple frontend for LLMs built in react-native.☆2,368Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆58Mar 30, 2026Updated last month
- High-performance In-browser LLM Inference Engine☆17,858Apr 24, 2026Updated last week
- Run pytorch models on GPU Android with Vulkan backend☆10Aug 15, 2023Updated 2 years ago
- Nintendo 64 demo written in Java.☆11Jan 11, 2023Updated 3 years ago
- Serve local ML inference engines to web apps☆31Apr 9, 2024Updated 2 years ago
- Maid is a free and open source application for interfacing with llama.cpp models locally, and with Anthropic, DeepSeek, Ollama, Mistral a…☆2,451Apr 7, 2026Updated 3 weeks ago
- A Next.js chat app to use Llama 2 locally using node-llama-cpp☆12Oct 27, 2024Updated last year
- LLM plugin for running models using MLC☆192Mar 30, 2024Updated 2 years ago
- An old–school 3d–shooter with cartoon graphics from creators of Gloomy Dungeons.☆21Nov 20, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- AI Assistant running within your browser.☆82Dec 3, 2024Updated last year
- Flutter / Dart bindings for llama.cpp☆20Sep 30, 2023Updated 2 years ago
- Lightweight termux desktop environment using openbox and polybar for low-end android devices☆81Apr 4, 2026Updated last month
- A set of bash scripts to automate deployment of GGML/GGUF models [default: RWKV] with the use of KoboldCpp on Android - Termux☆44Jun 21, 2024Updated last year
- A mobile Implementation of llama.cpp☆327Feb 1, 2024Updated 2 years ago
- 基于MNN-llm的安卓手机部署大语言模型:Qwen1.5-0.5B-Chat☆93Apr 8, 2024Updated 2 years ago
- 参考自mlc-llm,个人尝试在android手机上部署大模型并运行☆90Aug 5, 2024Updated last year
- MiniCPM on Android platform.☆640Mar 19, 2025Updated last year
- Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.☆10Aug 19, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Making offline AI models accessible to all types of edge devices.☆146Feb 12, 2024Updated 2 years ago
- Transform an XML document into a tabular data set. Better than spreadsheets.☆10Jan 18, 2016Updated 10 years ago
- An omnipowerful personal assistant powered by LLMs, Zapier NLA, and custom actions.☆15Sep 13, 2024Updated last year
- Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.☆3,721Mar 12, 2024Updated 2 years ago
- PB-LLM: Partially Binarized Large Language Models☆155Nov 20, 2023Updated 2 years ago
- Quick offline Android voting app for Majority Judgment polls where the phone is shared amongst participants.☆25Updated this week
- Huggingface Backup - Jupyter, Colab and Python Script☆10Jan 20, 2026Updated 3 months ago
- ✨ Your Custom Offline Role Play with LLM and Stable Diffusion on Mac and Linux (for now) 🧙♂️☆30Nov 21, 2023Updated 2 years ago
- Trying to deconstruct RWKV in understandable terms☆14May 6, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Fast Multimodal LLM on Mobile Devices☆1,484Updated this week
- Code auto-complete and type checking for AWS boto3 in your VSCode☆21Jun 7, 2024Updated last year
- Scraper for Chub.ai and JanitorAI.com☆35Jan 11, 2024Updated 2 years ago
- The repo for: TriHuman: A Real-time and Controllable Tri-plane Representation for Detailed Human Geometry and Appearance Synthesis☆19Nov 15, 2025Updated 5 months ago
- lcpp is a dart implementation of llama.cpp used by the mobile artificial intelligence distribution (maid)☆118Updated this week
- GNO is the UNIX-like environment for the Apple IIgs☆12Jun 11, 2025Updated 10 months ago
- A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, …☆47Oct 8, 2025Updated 6 months ago