JackZeng0208 / llama.cpp-android-tutorialLinks
llama.cpp tutorial on Android phone
☆136Updated 6 months ago
Alternatives and similar repositories for llama.cpp-android-tutorial
Users that are interested in llama.cpp-android-tutorial are comparing it to the libraries listed below
Sorting:
- IRIS is an android app for interfacing with GGUF / llama.cpp models locally.☆243Updated 9 months ago
- ☆264Updated last week
- A mobile Implementation of llama.cpp☆322Updated last year
- A set of bash scripts to automate deployment of GGML/GGUF models [default: RWKV] with the use of KoboldCpp on Android - Termux☆43Updated last year
- automatically quant GGUF models☆214Updated last month
- This is a simple shell script to install the alpaca llama 7B model on termux for Android phones. All credits goes to the original develop…☆64Updated 2 years ago
- React Native binding of llama.cpp☆42Updated last month
- Inference Llama 2 in one file of pure C☆43Updated 2 years ago
- ☆64Updated last year
- A Ollama client for Android!☆88Updated last year
- Tool to download models from Huggingface Hub and convert them to GGML/GGUF for llama.cpp☆162Updated 7 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆54Updated last year
- A minimal Android demo app for Kokoro-TTS☆38Updated 9 months ago
- ☆125Updated last year
- Running any GGUF SLMs/LLMs locally, on-device in Android☆578Updated last week
- stable-diffusion.cpp bindings for python☆78Updated this week
- Run SD1.x/2.x/3.x, SDXL, and FLUX.1 on your phone device☆59Updated 4 months ago
- Testing LLM reasoning abilities with family relationship quizzes.☆63Updated 10 months ago
- A mobile Implementation of llama.cpp☆26Updated 2 years ago
- ☆107Updated 3 months ago
- Train your own small bitnet model☆75Updated last year
- A pipeline parallel training script for LLMs.☆163Updated 7 months ago
- This reference can be used with any existing OpenAI integrated apps to run with TRT-LLM inference locally on GeForce GPU on Windows inste…☆127Updated last year
- 1.58-bit LLaMa model☆83Updated last year
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆266Updated 8 months ago
- Port of Facebook's LLaMA model in C/C++☆64Updated 7 months ago
- Docker compose to run vLLM on Windows☆107Updated last year
- Running Microsoft's BitNet via Electron, React & Astro☆48Updated 2 months ago
- run ollama & gguf easily with a single command☆52Updated last year
- Pure C++ implementation of several models for real-time chatting on your computer (CPU & GPU)☆749Updated this week