JackZeng0208 / llama.cpp-android-tutorial
llama.cpp tutorial on Android phone
☆86Updated 5 months ago
Alternatives and similar repositories for llama.cpp-android-tutorial:
Users that are interested in llama.cpp-android-tutorial are comparing it to the libraries listed below
- ☆216Updated 2 months ago
- IRIS is an android app for interfacing with GGUF / llama.cpp models locally.☆161Updated 2 weeks ago
- A mobile Implementation of llama.cpp☆299Updated 11 months ago
- This is a simple shell script to install the alpaca llama 7B model on termux for Android phones. All credits goes to the original develop…☆59Updated last year
- A mobile Implementation of llama.cpp☆25Updated last year
- automatically quant GGUF models☆151Updated this week
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆56Updated 5 months ago
- React Native binding of llama.cpp☆24Updated this week
- A Ollama client for Android!☆81Updated 8 months ago
- Docker compose to run vLLM on Windows☆53Updated last year
- A set of bash scripts to automate deployment of GGML/GGUF models [default: RWKV] with the use of KoboldCpp on Android - Termux☆41Updated 7 months ago
- Inference Llama 2 in one file of pure C☆42Updated last year
- 使用Android手机的CPU推理stable diffusion☆146Updated last year
- A fast batching API to serve LLM models☆177Updated 8 months ago
- Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU a…☆42Updated 3 months ago