JackZeng0208 / llama.cpp-android-tutorialLinks
llama.cpp tutorial on Android phone
☆120Updated 3 months ago
Alternatives and similar repositories for llama.cpp-android-tutorial
Users that are interested in llama.cpp-android-tutorial are comparing it to the libraries listed below
Sorting:
- ☆249Updated last month
- IRIS is an android app for interfacing with GGUF / llama.cpp models locally.☆226Updated 6 months ago
- A mobile Implementation of llama.cpp☆314Updated last year
- A set of bash scripts to automate deployment of GGML/GGUF models [default: RWKV] with the use of KoboldCpp on Android - Termux☆42Updated last year
- This is a simple shell script to install the alpaca llama 7B model on termux for Android phones. All credits goes to the original develop…☆62Updated last year
- ☆59Updated 8 months ago
- The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) a…☆252Updated last week
- React Native binding of llama.cpp☆35Updated 2 weeks ago
- 使用Android手机的CPU推理stable diffusion☆156Updated last year
- automatically quant GGUF models☆190Updated this week
- Instructions for installing Open Interpreter on your Android device.☆224Updated last year
- A mobile Implementation of llama.cpp☆25Updated last year
- Tool to download models from Huggingface Hub and convert them to GGML/GGUF for llama.cpp☆158Updated 3 months ago
- Running any GGUF SLMs/LLMs locally, on-device in Android☆432Updated this week
- Inference Llama 2 in one file of pure C☆42Updated 2 years ago
- Demonstration of running a native LLM on Android device.☆161Updated this week
- MiniCPM on Android platform.☆636Updated 4 months ago
- Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU a…☆43Updated 10 months ago
- ☆117Updated 8 months ago
- Run SD1.x/2.x/3.x, SDXL, and FLUX.1 on your phone device☆37Updated 2 weeks ago
- High-speed and easy-use LLM serving framework for local deployment☆115Updated 4 months ago
- [EMNLP Findings 2024] MobileQuant: Mobile-friendly Quantization for On-device Language Models☆66Updated 10 months ago
- LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.☆128Updated 2 years ago
- Docker compose to run vLLM on Windows☆98Updated last year
- A minimal Android demo app for Kokoro-TTS☆28Updated 6 months ago
- Locally running LLM with internet access☆96Updated last month
- [ICLR-2025-SLLM Spotlight 🔥]MobiLlama : Small Language Model tailored for edge devices☆653Updated 2 months ago
- stable-diffusion.cpp bindings for python☆56Updated last month
- OpenGPT 4o is a free alternative to OpenAI GPT 4o☆211Updated 9 months ago
- Train your own small bitnet model☆75Updated 9 months ago