JackZeng0208 / llama.cpp-android-tutorialLinks
llama.cpp tutorial on Android phone
☆110Updated last month
Alternatives and similar repositories for llama.cpp-android-tutorial
Users that are interested in llama.cpp-android-tutorial are comparing it to the libraries listed below
Sorting:
- A mobile Implementation of llama.cpp☆312Updated last year
- IRIS is an android app for interfacing with GGUF / llama.cpp models locally.☆216Updated 4 months ago
- ☆247Updated last month
- Inference Llama 2 in one file of pure C☆42Updated last year
- This is a simple shell script to install the alpaca llama 7B model on termux for Android phones. All credits goes to the original develop…☆62Updated last year
- ☆57Updated 7 months ago
- A Ollama client for Android!☆86Updated last year
- A set of bash scripts to automate deployment of GGML/GGUF models [default: RWKV] with the use of KoboldCpp on Android - Termux☆42Updated last year
- Port of Facebook's LLaMA model in C/C++☆96Updated 2 weeks ago
- Awesome Mobile LLMs☆204Updated 3 weeks ago
- automatically quant GGUF models☆184Updated last week
- A mobile Implementation of llama.cpp☆25Updated last year
- Port of Facebook's LLaMA model in C/C++☆52Updated last month
- [EMNLP Findings 2024] MobileQuant: Mobile-friendly Quantization for On-device Language Models☆63Updated 9 months ago
- React Native binding of llama.cpp☆30Updated 3 weeks ago
- High-speed and easy-use LLM serving framework for local deployment☆112Updated 3 months ago
- stable-diffusion.cpp bindings for python☆53Updated 3 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆54Updated 10 months ago
- ☆21Updated last year
- run ollama & gguf easily with a single command☆51Updated last year
- The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) a…☆216Updated 2 weeks ago
- 使用Android手机的CPU推理stable diffusion☆152Updated last year
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆64Updated last year
- Run SD onnx model on termux☆21Updated 2 years ago
- Demonstration of running a native LLM on Android device.☆144Updated 2 weeks ago
- TPI-LLM: Serving 70b-scale LLMs Efficiently on Low-resource Edge Devices☆180Updated 3 weeks ago
- entropix style sampling + GUI☆26Updated 7 months ago
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆11Updated last year
- Forces DeepSeek R1 models to engage in extended reasoning by intercepting early termination tokens.☆19Updated 4 months ago
- ☆114Updated 7 months ago