JackZeng0208 / llama.cpp-android-tutorialLinks
llama.cpp tutorial on Android phone
☆134Updated 6 months ago
Alternatives and similar repositories for llama.cpp-android-tutorial
Users that are interested in llama.cpp-android-tutorial are comparing it to the libraries listed below
Sorting:
- IRIS is an android app for interfacing with GGUF / llama.cpp models locally.☆241Updated 9 months ago
- ☆259Updated 2 months ago
- A mobile Implementation of llama.cpp☆320Updated last year
- This is a simple shell script to install the alpaca llama 7B model on termux for Android phones. All credits goes to the original develop…☆64Updated 2 years ago
- 使用Android手机的CPU推理stable diffusion☆159Updated last year
- A set of bash scripts to automate deployment of GGML/GGUF models [default: RWKV] with the use of KoboldCpp on Android - Termux☆42Updated last year
- Inference Llama 2 in one file of pure C☆43Updated 2 years ago
- A Ollama client for Android!☆86Updated last year
- automatically quant GGUF models☆214Updated last week
- React Native binding of llama.cpp☆42Updated last week
- ☆63Updated 11 months ago
- A minimal Android demo app for Kokoro-TTS☆32Updated 8 months ago
- A mobile Implementation of llama.cpp☆26Updated 2 years ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆53Updated last year
- ☆124Updated 11 months ago
- Docker compose to run vLLM on Windows☆104Updated last year
- The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) a…☆323Updated this week
- High-speed and easy-use LLM serving framework for local deployment☆130Updated 2 months ago
- Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU a…☆42Updated last year
- Train your own small bitnet model☆75Updated last year
- Run SD1.x/2.x/3.x, SDXL, and FLUX.1 on your phone device☆47Updated 3 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆264Updated 7 months ago
- Tool to download models from Huggingface Hub and convert them to GGML/GGUF for llama.cpp☆161Updated 6 months ago
- Testing LLM reasoning abilities with family relationship quizzes.☆62Updated 9 months ago
- ☆51Updated 8 months ago
- Port of Facebook's LLaMA model in C/C++☆63Updated 6 months ago
- Running any GGUF SLMs/LLMs locally, on-device in Android☆546Updated last month
- Croco.Cpp is fork of KoboldCPP infering GGML/GGUF models on CPU/Cuda with KoboldAI's UI. It's powered partly by IK_LLama.cpp, and compati…☆152Updated last week
- ☆24Updated 9 months ago
- Android app for the Hole in your Palm project, making LLMs accessible on-device!☆17Updated last year