Manuel030 / llama2.c-android
Inference Llama 2 in one file of pure C
☆38Updated last year
Related projects: ⓘ
- This repository is an implementation of quantizing and converting the Llama3-8B-Instruct model weights and deploying it on Android for on…☆45Updated 3 months ago
- maid_llm is a dart implementation of llama.cpp used by the mobile artificial intelligence distribution (maid)☆42Updated 3 weeks ago
- A mobile Implementation of llama.cpp☆281Updated 7 months ago
- llama.cpp tutorial on Android phone☆68Updated last month
- A mobile Implementation of llama.cpp☆25Updated 11 months ago
- Code for finetuning RedPajama-Chat-3B using LoRA☆13Updated last year
- Local LLM App☆114Updated this week
- ☆78Updated 8 months ago
- ☆56Updated 9 months ago
- A sample Android app using [whisper.cpp](https://github.com/ggerganov/whisper.cpp/) to do voice-to-text transcriptions.☆62Updated last year
- ☆194Updated this week
- An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.☆37Updated 10 months ago
- DeepFloyd IF web UI☆27Updated last year
- ☆40Updated 5 months ago
- Eh, simple and works.☆27Updated 9 months ago
- Offline voice input panel & keyboard with punctuation for Android.☆84Updated 3 months ago
- ☆18Updated last year
- ☆45Updated 7 months ago
- A repository to store helpful information and emerging insights in regard to LLMs☆20Updated 10 months ago
- ☆29Updated 9 months ago
- Fast approximate inference on a single GPU with sparsity aware offloading☆39Updated 8 months ago
- Llama cute voice assistant☆28Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆77Updated 3 months ago
- ☆86Updated 3 weeks ago
- ☆78Updated 9 months ago
- A voice to text keyboard based on OpenAI Whisper Model.☆45Updated last year
- An unsupervised model merging algorithm for Transformers-based language models.☆96Updated 4 months ago
- inference code for mixtral-8x7b-32kseqlen☆97Updated 9 months ago
- A custom RAG pipeline for multi-document QA from PDF/DOCX documents, in Android☆38Updated last week
- Gradio based tool to run opensource LLM models directly from Huggingface☆84Updated 2 months ago