JackZeng0208 / llama.cpp-android-tutorialLinks
llama.cpp tutorial on Android phone
☆144Updated 9 months ago
Alternatives and similar repositories for llama.cpp-android-tutorial
Users that are interested in llama.cpp-android-tutorial are comparing it to the libraries listed below
Sorting:
- ☆291Updated this week
- IRIS is an android app for interfacing with GGUF / llama.cpp models locally.☆266Updated last year
- A mobile Implementation of llama.cpp☆326Updated 2 years ago
- This is a simple shell script to install the alpaca llama 7B model on termux for Android phones. All credits goes to the original develop…☆64Updated 2 years ago
- ☆65Updated last year
- A set of bash scripts to automate deployment of GGML/GGUF models [default: RWKV] with the use of KoboldCpp on Android - Termux☆44Updated last year
- Inference Llama 2 in one file of pure C☆43Updated 2 years ago
- The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) a…☆369Updated this week
- React Native binding of llama.cpp☆45Updated last week
- [EMNLP Findings 2024] MobileQuant: Mobile-friendly Quantization for On-device Language Models☆68Updated last year
- Tool to download models from Huggingface Hub and convert them to GGML/GGUF for llama.cpp☆170Updated 9 months ago
- Train your own small bitnet model☆77Updated last year
- A minimal Android demo app for Kokoro-TTS☆41Updated 11 months ago
- Run SD1.x/2.x/3.x, SDXL, and FLUX.1 on your phone device☆88Updated 6 months ago
- Instructions for installing Open Interpreter on your Android device.☆238Updated last year
- A mobile Implementation of llama.cpp☆26Updated 2 years ago
- MiniCPM on Android platform.☆636Updated 10 months ago
- Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU a…☆42Updated last year
- automatically quant GGUF models☆219Updated last month
- Llama cute voice assistant☆27Updated 2 years ago
- LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.☆130Updated 2 years ago
- ☆128Updated last year
- llama.cpp fork used by GPT4All☆55Updated 11 months ago
- ☆23Updated 2 years ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆53Updated last year
- Awesome Mobile LLMs☆301Updated 2 months ago
- [ICLR-2025-SLLM Spotlight 🔥]MobiLlama : Small Language Model tailored for edge devices☆668Updated 8 months ago
- Docker compose to run vLLM on Windows☆114Updated 2 years ago
- ☆22Updated last year
- Fully-featured, beautiful web interface for vLLM - built with NextJS.☆172Updated last month