JackZeng0208 / llama.cpp-android-tutorial
llama.cpp tutorial on Android phone
☆68Updated last month
Related projects: ⓘ
- ☆194Updated this week
- A mobile Implementation of llama.cpp☆281Updated 7 months ago
- Local LLM App☆114Updated this week
- automatically quant GGUF models☆119Updated this week
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆56Updated last month
- A mobile Implementation of llama.cpp☆25Updated 11 months ago
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3☆33Updated last week
- Low-Rank adapter extraction for fine-tuned transformers model☆154Updated 4 months ago
- multi1: create o1-like reasoning chains with multiple AI providers (and locally)☆64Updated this week
- An unsupervised model merging algorithm for Transformers-based language models.☆96Updated 4 months ago
- Llama cute voice assistant☆28Updated last year
- Gradio based tool to run opensource LLM models directly from Huggingface☆84Updated 2 months ago
- A set of bash scripts to automate deployment of GGML/GGUF models [default: RWKV] with the use of KoboldCpp on Android - Termux☆38Updated 2 months ago
- A simple light terminal style chat app that lets you use connect to your local llama.cpp server☆27Updated 2 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆157Updated this week
- An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.☆37Updated 10 months ago
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆39Updated 2 months ago
- idea: https://github.com/nyxkrage/ebook-groupchat/☆77Updated last month
- An extension for oobabooga/text-generation-webui that enables the LLM to search the web using DuckDuckGo☆159Updated this week
- B-Llama3o a llama3 with Vision Audio and Audio understanding as well as text and Audio and Animation Data output.☆25Updated 3 months ago
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆66Updated 11 months ago
- run ollama & gguf easily with a single command☆46Updated 4 months ago
- A Ollama client for Android!☆78Updated 4 months ago
- A pipeline parallel training script for LLMs.☆79Updated 3 weeks ago
- ☆14Updated 3 months ago
- maid_llm is a dart implementation of llama.cpp used by the mobile artificial intelligence distribution (maid)☆42Updated 3 weeks ago
- ☆20Updated 10 months ago
- A collection of prompts to challenge the reasoning abilities of large language models in presence of misguiding information☆51Updated this week
- ☆144Updated 2 months ago
- ☆50Updated 3 months ago