On-device LLM Inference Powered by X-Bit Quantization
☆311Apr 29, 2026Updated last week
Alternatives and similar repositories for picollm
Users that are interested in picollm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- On-device streaming text-to-speech engine powered by deep learning☆139Apr 17, 2026Updated 3 weeks ago
- On-device speaker recognition engine powered by deep learning☆42Apr 28, 2026Updated last week
- Recipes for on-device voice AI and local LLM☆109Updated this week
- LLM Compression Benchmark☆22Apr 8, 2026Updated last month
- On-device voice activity detection (VAD) powered by deep learning☆250Apr 17, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- benchmark for Speech-to-Intent engines☆18Mar 27, 2026Updated last month
- On-device speaker diarization powered by deep learning☆71Updated this week
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆23Jan 5, 2026Updated 4 months ago
- On-device speech-to-text engine powered by deep learning☆481Updated this week
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequ…☆31Sep 20, 2025Updated 7 months ago
- Speaker diarization benchmark framework☆40Jan 8, 2026Updated 4 months ago
- WebAssembly binding for llama.cpp - Enabling on-browser LLM inference☆1,054Updated this week
- A Ollama client for Android!☆90May 6, 2024Updated 2 years ago
- ☆34Nov 18, 2025Updated 5 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆28Nov 3, 2025Updated 6 months ago
- ☆22Aug 21, 2025Updated 8 months ago
- ☆36Jan 6, 2026Updated 4 months ago
- On-device Speech-to-Intent engine powered by deep learning☆699Apr 17, 2026Updated 3 weeks ago
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆35May 1, 2026Updated last week
- On-device streaming speech-to-text engine powered by deep learning☆662Apr 18, 2026Updated 3 weeks ago
- Metal GPU implementation of the Qwen3 transformer model on macOS with complete Apple Silicon compute shader acceleration.☆45Oct 6, 2025Updated 7 months ago
- Browse, search, and visualize ONNX models.☆34May 6, 2025Updated last year
- Face Recognition using RPI5 Hailo8L AI Accelerator KIT☆20Aug 30, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Yet another `llama.cpp` Rust wrapper☆12Updated this week
- An fully autonomous agent that accesses the browser and performs tasks.☆18Apr 25, 2025Updated last year
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆22Jan 10, 2025Updated last year
- Experience the power of AI with this free AI voice generator demo. Utilizing Deepgram and Groq, we transform text into voice seamlessly. …☆37Jun 12, 2024Updated last year
- Awesome Mobile LLMs☆339May 1, 2026Updated last week
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Fast Multimodal LLM on Mobile Devices☆1,497Apr 30, 2026Updated last week
- Voice activity engine benchmark framework☆21Jan 14, 2026Updated 3 months ago
- This repository is an implementation of quantizing and converting the Llama3-8B-Instruct model weights and deploying it on Android for on…☆77May 25, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- LLm Collaboration☆12Aug 23, 2024Updated last year
- 33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU☆13May 5, 2024Updated 2 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 5 years ago
- ☆42May 3, 2026Updated last week
- automatically quant GGUF models☆224Dec 23, 2025Updated 4 months ago
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆17Aug 30, 2024Updated last year
- ☆13Sep 12, 2024Updated last year