Bip-Rep / sherpa
A mobile Implementation of llama.cpp
☆294Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for sherpa
- A mobile Implementation of llama.cpp☆25Updated last year
- Local LLM App☆134Updated last month
- Making offline AI models accessible to all types of edge devices.☆127Updated 9 months ago
- dart binding for llama.cpp☆168Updated 4 months ago
- llama.cpp tutorial on Android phone☆77Updated 3 months ago
- Falcon LLM ggml framework with CPU and GPU support☆245Updated 10 months ago
- ☆209Updated 2 weeks ago
- maid_llm is a dart implementation of llama.cpp used by the mobile artificial intelligence distribution (maid)☆48Updated 2 months ago
- AubAI brings you on-device gen-AI capabilities, including offline text generation and more, directly within your app.☆226Updated 6 months ago
- C++ implementation for 💫StarCoder☆447Updated last year
- Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely.☆1,487Updated 2 weeks ago
- Simple frontend for LLMs built in react-native.☆578Updated this week
- An AI assistant beyond the chat box.☆315Updated 8 months ago
- llama.cpp for Flutter☆99Updated 3 weeks ago
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆145Updated last year
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆733Updated last week
- llama.cpp with BakLLaVA model describes what does it see☆381Updated last year
- An extension for oobabooga/text-generation-webui that enables the LLM to search the web using DuckDuckGo☆173Updated this week
- A fast batching API to serve LLM models☆172Updated 6 months ago
- ggml implementation of BERT☆467Updated 9 months ago
- Offline voice input panel & keyboard with punctuation for Android.☆90Updated 5 months ago
- LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.☆114Updated last year
- automatically quant GGUF models☆140Updated this week
- Extend the original llama.cpp repo to support redpajama model.☆117Updated 2 months ago
- WebAssembly binding for llama.cpp - Enabling in-browser LLM inference☆441Updated this week
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆124Updated last year
- On-device LLM Inference Powered by X-Bit Quantization☆190Updated last week
- Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models.☆73Updated 4 months ago
- LLM-based code completion engine☆175Updated last year