JackZeng0208 / llama.cpp-android-tutorialLinks

llama.cpp tutorial on Android phone

☆120

Alternatives and similar repositories for llama.cpp-android-tutorial

Users that are interested in llama.cpp-android-tutorial are comparing it to the libraries listed below

Sorting:

mlc-ai / binary-mlc-llm-libs
☆249Updated last month
nerve-sparks / iris_android
IRIS is an android app for interfacing with GGUF / llama.cpp models locally.
☆226Updated 6 months ago
Bip-Rep / sherpa
A mobile Implementation of llama.cpp
☆314Updated last year
latestissue / AltaeraAI
A set of bash scripts to automate deployment of GGML/GGUF models [default: RWKV] with the use of KoboldCpp on Android - Termux
☆42Updated last year
Tempaccnt / Termux-alpaca
This is a simple shell script to install the alpaca llama 7B model on termux for Android phones. All credits goes to the original develop…
☆62Updated last year
UbiquitousLearning / PhoneLM
☆59Updated 8 months ago
quic / ai-hub-apps
The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) a…
☆252Updated last week
Vali-98 / cui-llama.rn
React Native binding of llama.cpp
☆35Updated 2 weeks ago
ZTMIDGO / Android-Stable-diffusion-ONNX
使用Android手机的CPU推理stable diffusion
☆156Updated last year
leafspark / AutoGGUF
automatically quant GGUF models
☆190Updated this week
MikeBirdTech / open-interpreter-termux
Instructions for installing Open Interpreter on your Android device.
☆224Updated last year
dsd / sherpa
A mobile Implementation of llama.cpp
☆25Updated last year
akx / ggify
Tool to download models from Huggingface Hub and convert them to GGML/GGUF for llama.cpp
☆158Updated 3 months ago
shubham0204 / SmolChat-Android
Running any GGUF SLMs/LLMs locally, on-device in Android
☆432Updated this week
Manuel030 / llama2.c-android
Inference Llama 2 in one file of pure C
☆42Updated 2 years ago
DakeQQ / Native-LLM-for-Android
Demonstration of running a native LLM on Android device.
☆161Updated this week
Achazwl / mlc
MiniCPM on Android platform.
☆636Updated 4 months ago
menloresearch / cortex.tensorrt-llm
Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU a…
☆43Updated 10 months ago
kevkid / gguf_gui
☆117Updated 8 months ago
rmatif / Local-Diffusion
Run SD1.x/2.x/3.x, SDXL, and FLUX.1 on your phone device
☆37Updated 2 weeks ago
powerserve-project / PowerServe
High-speed and easy-use LLM serving framework for local deployment
☆115Updated 4 months ago
saic-fi / MobileQuant
[EMNLP Findings 2024] MobileQuant: Mobile-friendly Quantization for On-device Language Models
☆66Updated 10 months ago
nuance1979 / llama-server
LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.
☆128Updated 2 years ago
aneeshjoy / vllm-windows
Docker compose to run vLLM on Windows
☆98Updated last year
puff-dayo / Kokoro-82M-Android
A minimal Android demo app for Kokoro-TTS
☆28Updated 6 months ago
Rivridis / LLM-Assistant
Locally running LLM with internet access
☆96Updated last month
mbzuai-oryx / MobiLlama
[ICLR-2025-SLLM Spotlight 🔥]MobiLlama : Small Language Model tailored for edge devices
☆653Updated 2 months ago
william-murray1204 / stable-diffusion-cpp-python
stable-diffusion.cpp bindings for python
☆56Updated last month
KingNish24 / OpenGPT-4o
OpenGPT 4o is a free alternative to OpenAI GPT 4o
☆211Updated 9 months ago
pranavjad / tinyllama-bitnet
Train your own small bitnet model
☆75Updated 9 months ago