voicedock / ttspiperLinks
Piper based VoiceDock TTS implementation
☆11Updated 2 years ago
Alternatives and similar repositories for ttspiper
Users that are interested in ttspiper are comparing it to the libraries listed below
Sorting:
- Booster - open accelerator for LLM models. Better inference and debugging for AI hackers☆167Updated last year
- Voxtral: Convert Mistral into a end2end SpeechLM. No information bottleneck, preserves prosody, learns interruptions from data. Unlike GP…☆42Updated 11 months ago
- Go Lang API Wrapper around Piper TTS - Supports TTS Inference and List of Voices☆31Updated 2 years ago
- Video+code lecture on building nanoGPT from scratch☆68Updated last year
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆31Updated last year
- LLaVA server (llama.cpp).☆183Updated 2 years ago
- ☆17Updated last year
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆76Updated last year
- Joint speech-language model - respond directly to audio!☆372Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆22Updated last year
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated 2 years ago
- A ggml (C++) re-implementation of tortoise-tts☆193Updated last year
- Binding to transformers in ggml☆64Updated this week
- AirLLM 70B inference with single 4GB GPU☆17Updated 7 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆161Updated last year
- SGLang is fast serving framework for large language models and vision language models.☆32Updated 2 months ago
- ASR + diarization model server with speculative decoding☆64Updated last year
- A lightweight Python package for Automatic Speech Recognition using ONNX models☆245Updated last week
- Cross-platform audio recorder designed for real-time speech audio processing☆128Updated 2 weeks ago
- pure go for rwkv☆19Updated 2 years ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆74Updated 6 months ago
- A simple GUI utility for gathering LIMA-like chat data.☆23Updated 4 months ago
- ☆175Updated 2 years ago
- Open TTS models, built for streaming on the edge☆45Updated 10 months ago
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆38Updated 2 years ago
- Code for paper https://arxiv.org/abs/2501.00522☆14Updated 9 months ago
- Pybind11 bindings for Whisper.cpp☆63Updated last week
- Curriculum training of instruction-following LLMs with Unsloth☆14Updated last month
- GGML implementation of BERT model with Python bindings and quantization.☆58Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆100Updated last year