Bip-Rep / sherpaLinks
A mobile Implementation of llama.cpp
☆312Updated last year
Alternatives and similar repositories for sherpa
Users that are interested in sherpa are comparing it to the libraries listed below
Sorting:
- AubAI brings you on-device gen-AI capabilities, including offline text generation and more, directly within your app.☆288Updated last year
- IRIS is an android app for interfacing with GGUF / llama.cpp models locally.☆216Updated 4 months ago
- A mobile Implementation of llama.cpp☆25Updated last year
- lcpp is a dart implementation of llama.cpp used by the mobile artificial intelligence distribution (maid)☆88Updated this week
- Making offline AI models accessible to all types of edge devices.☆142Updated last year
- dart binding for llama.cpp☆232Updated 3 months ago
- llama.cpp tutorial on Android phone☆110Updated last month
- llama.cpp for Flutter☆174Updated last week
- ☆247Updated last month
- Falcon LLM ggml framework with CPU and GPU support☆246Updated last year
- A ggml (C++) re-implementation of tortoise-tts☆186Updated 10 months ago
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆826Updated 7 months ago
- Local ML voice chat using high-end models.☆172Updated 2 weeks ago
- C++ implementation for 💫StarCoder☆453Updated last year
- Locally run an Instruction-Tuned Chat-Style LLM (Android/Linux/Windows/Mac)☆263Updated 2 years ago
- On-device LLM Inference Powered by X-Bit Quantization☆249Updated 2 weeks ago
- Optimized OpenAI's Whisper TFLite Port for Efficient Offline Inference on Edge Devices☆236Updated 10 months ago
- Inference Llama 2 in one file of pure C☆42Updated last year
- A Ollama client for Android!☆86Updated last year
- An AI assistant beyond the chat box.☆328Updated last year
- Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models.☆77Updated 2 months ago
- Offline voice input panel & keyboard with punctuation for Android.☆105Updated last year
- LLaVA server (llama.cpp).☆180Updated last year
- Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely.☆2,046Updated 3 weeks ago
- LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.☆126Updated 2 years ago
- automatically quant GGUF models☆184Updated last week
- fastLLaMa: An experimental high-performance framework for running Decoder-only LLMs with 4-bit quantization in Python using a C/C++ backe…☆409Updated 2 years ago
- ggml implementation of BERT☆493Updated last year
- Awesome Mobile LLMs☆204Updated 3 weeks ago
- ONNX runtime for Flutter.☆273Updated 2 weeks ago