kyutai-labs / pocket-ttsLinks
A TTS that fits in your CPU (and pocket)
☆621Updated this week
Alternatives and similar repositories for pocket-tts
Users that are interested in pocket-tts are comparing it to the libraries listed below
Sorting:
- ☆382Updated 2 months ago
- A high quality and fast TTS repository☆461Updated 3 weeks ago
- Soprano: Instant, Ultra-Realistic Text-to-Speech☆746Updated this week
- VLLM Port of the Chatterbox TTS model☆357Updated 2 months ago
- An open-source implementation of Whisper☆472Updated 2 months ago
- A highly compressive and high-quality neural audio codec for speech models.☆204Updated last week
- TTS model capable of streaming conversational audio in realtime.☆1,011Updated last month
- Liquid Audio - Speech-to-Speech audio models by Liquid AI☆356Updated last week
- Self-host the ultra-lightweight Kitten TTS model with this enhanced API server with an intuitive Web UI, large text processing for audiob…☆225Updated 5 months ago
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆347Updated 9 months ago
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆306Updated 7 months ago
- ☆532Updated 3 months ago
- ☆635Updated 2 months ago
- Streaming and Fine-tuning for Chatterbox TTS☆258Updated 7 months ago
- Fast audio super resolution from 16khz to 48khz.☆177Updated last week
- ☆342Updated 4 months ago
- A highly optimized engine for neutts-air model to generate minutes of audio in seconds. Over 200x realtime on modern hardware!☆95Updated last month
- This project is a collection of Docker-based web user interfaces designed to easily run various state-of-the-art generative AI models loc…☆376Updated this week
- ☆461Updated last week
- A lightning fast audio upsampler.☆224Updated this week
- Unofficial WIP LoRa Finetuning repository for VibeVoice☆325Updated 3 months ago
- Extract any sound with text prompts. Memory-optimized SAM-Audio with modern UI.☆255Updated 2 weeks ago
- VibeVoice: Expressive, longform conversational speech synthesis. (Community fork)☆921Updated 3 weeks ago
- Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.☆2,200Updated last week
- Clean, polished interface for Tencent’s SongGeneration. Create songs from text prompts or reference audio, with batch processing and smar…☆226Updated 2 weeks ago
- Open Audio Watermarking Tool☆447Updated 3 weeks ago
- Official Python toolkit for the Qwen3-ASR API. Parallel high‑throughput calls, robust long‑audio transcription, multi‑sample‑rate support…☆736Updated 2 months ago
- ☆245Updated 3 weeks ago
- A highly optimized engine for maya-1 tts model to generate minutes of audio in seconds.☆56Updated last month
- This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming archi…☆231Updated last month