JackZeng0208 / llama.cpp-android-tutorial
llama.cpp tutorial on Android phone
☆101Updated last week
Alternatives and similar repositories for llama.cpp-android-tutorial
Users that are interested in llama.cpp-android-tutorial are comparing it to the libraries listed below
Sorting:
- IRIS is an android app for interfacing with GGUF / llama.cpp models locally.☆203Updated 3 months ago
- A mobile Implementation of llama.cpp☆311Updated last year
- ☆241Updated last week
- A set of bash scripts to automate deployment of GGML/GGUF models [default: RWKV] with the use of KoboldCpp on Android - Termux☆41Updated 10 months ago
- This is a simple shell script to install the alpaca llama 7B model on termux for Android phones. All credits goes to the original develop…☆62Updated last year
- A Ollama client for Android!☆84Updated last year
- A mobile Implementation of llama.cpp☆25Updated last year
- React Native binding of llama.cpp☆30Updated this week
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆54Updated 8 months ago
- automatically quant GGUF models☆175Updated this week
- 使用Android手机的CPU推理stable diffusion☆152Updated last year
- Testing LLM reasoning abilities with family relationship quizzes.☆62Updated 3 months ago
- Inference Llama 2 in one file of pure C☆42Updated last year
- The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) a…☆189Updated this week
- stable-diffusion.cpp bindings for python☆50Updated 2 months ago
- Demonstration of running a native LLM on Android device.☆136Updated this week
- Android app for the Hole in your Palm project, making LLMs accessible on-device!☆17Updated last year
- ☆56Updated 5 months ago
- TinyClick: Single-Turn Agent for Empowering GUI Automation☆33Updated 6 months ago
- Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU a…☆43Updated 7 months ago
- [EMNLP Findings 2024] MobileQuant: Mobile-friendly Quantization for On-device Language Models☆56Updated 7 months ago
- ☆42Updated 3 months ago
- Low-Rank adapter extraction for fine-tuned transformers models☆173Updated last year
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆11Updated last year
- 1.58-bit LLaMa model☆81Updated last year
- idea: https://github.com/nyxkrage/ebook-groupchat/☆86Updated 8 months ago
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆53Updated 6 months ago
- Inference of Large Multimodal Models in C/C++. LLaVA and others☆46Updated last year
- ☆89Updated 4 months ago
- A multimodal, function calling powered LLM webui.☆214Updated 7 months ago