☆320May 19, 2026Updated 3 weeks ago
Alternatives and similar repositories for binary-mlc-llm-libs
Users that are interested in binary-mlc-llm-libs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A mobile Implementation of llama.cpp☆26Oct 11, 2023Updated 2 years ago
- Universal LLM Deployment Engine with ML Compilation☆22,792May 11, 2026Updated last month
- ☆176May 11, 2026Updated last month
- A frontend for running models on mobile or connecting to your preferred API providers.☆2,476Jun 7, 2026Updated last week
- High-performance In-browser LLM Inference Engine☆18,147Jun 5, 2026Updated last week
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Run pytorch models on GPU Android with Vulkan backend☆10Aug 15, 2023Updated 2 years ago
- ☆78May 17, 2026Updated 3 weeks ago
- 此版本由于语言问题导致的功能无法更进或者性能问题、UI问题等,已停止支持,要使用受支持的版本,请访问https://github.com/shijuhao/QingZhouCE,轻昼CE具有更精美的UI,更实用的功能,以及更佳的性能☆17Jul 8, 2025Updated 11 months ago
- Maid is a free and open source application for interfacing with llama.cpp models locally, and with Anthropic, DeepSeek, Ollama, Mistral a…☆2,543Apr 7, 2026Updated 2 months ago
- Sentence Embedding as a Service☆15Jun 30, 2025Updated 11 months ago
- AI Assistant running within your browser.☆81Dec 3, 2024Updated last year
- Flutter / Dart bindings for llama.cpp☆20Sep 30, 2023Updated 2 years ago
- A mobile Implementation of llama.cpp☆328Feb 1, 2024Updated 2 years ago
- Experimental sampler to make LLMs more creative☆31Aug 2, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 基于MNN-llm的安卓手机部署大语言模型:Qwen1.5-0.5B-Chat☆97Apr 8, 2024Updated 2 years ago
- 参考自mlc-llm,个人尝试在android手机上部署大模型并运行☆90Aug 5, 2024Updated last year
- Port of Facebook's LLaMA model in C/C++☆21Nov 6, 2023Updated 2 years ago
- MiniCPM on Android platform.☆638Mar 19, 2025Updated last year
- Making offline AI models accessible to all types of edge devices.☆146Feb 12, 2024Updated 2 years ago
- Transform an XML document into a tabular data set. Better than spreadsheets.☆10Jan 18, 2016Updated 10 years ago
- An omnipowerful personal assistant powered by LLMs, Zapier NLA, and custom actions.☆15Sep 13, 2024Updated last year
- Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.☆3,719Mar 12, 2024Updated 2 years ago
- Easy-to-use headless React Hooks to run LLMs in the browser with WebGPU. Just useLLM().☆700Jun 27, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- PB-LLM: Partially Binarized Large Language Models☆157Nov 20, 2023Updated 2 years ago
- WebLLM Chrome Extension Starter Pack.☆12Aug 10, 2023Updated 2 years ago
- ✨ Your Custom Offline Role Play with LLM and Stable Diffusion on Mac and Linux (for now) 🧙♂️☆30Nov 21, 2023Updated 2 years ago
- Trying to deconstruct RWKV in understandable terms☆14May 6, 2023Updated 3 years ago
- Developer documentation for EMF APIs☆14Updated this week
- A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, …☆47Oct 8, 2025Updated 8 months ago
- Inner Self is an AI Dungeon mod that grants memory, goals, secrets, planning, and self-reflection capabilities to the characters living i…☆53Jan 20, 2026Updated 4 months ago
- Hubcap is an autonomous AI agent in 25 lines of code: a small Autobot that you can't trust. *This is the Python fork/port* from https://g…☆22Nov 10, 2025Updated 7 months ago
- A one-page WebUI integrating VITS inference, training, and output in Sherpa-Onnx format.☆12Feb 2, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- FastPy-RS is a high-performance Python library that provides optimized implementations of common functions using Rust.☆18Aug 19, 2025Updated 9 months ago
- A Simple OS (currenlty) developed in C, Assembly☆10Jul 14, 2016Updated 9 years ago
- Repo for hosting LocalDiffusion ONNX models for SDAI Android app☆37Feb 25, 2024Updated 2 years ago
- DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling☆25Updated this week
- Auto-Video maker handling many AI's☆11Mar 18, 2024Updated 2 years ago
- personal repository of dicts (I made) for Aard2. Automated builds based on Actions are planned.☆19May 19, 2026Updated 3 weeks ago
- Generative AI web UI and server☆22May 23, 2023Updated 3 years ago