ekwek1 / sopranoLinks
Soprano: Instant, Ultra-Realistic Text-to-Speech
☆1,137Updated 2 weeks ago
Alternatives and similar repositories for soprano
Users that are interested in soprano are comparing it to the libraries listed below
Sorting:
- A high quality and fast TTS repository☆486Updated last month
- Make text LLMs listen and speak☆1,152Updated last week
- ☆385Updated 3 months ago
- A high-quality rapid TTS voice cloning model that reaches speeds of 150x realtime.☆632Updated last week
- Run Orpheus 3B Locally With LM Studio☆510Updated 10 months ago
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…☆283Updated 9 months ago
- Streaming and Fine-tuning for Chatterbox TTS☆264Updated 7 months ago
- VLLM Port of the Chatterbox TTS model☆364Updated 3 months ago
- High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.☆648Updated 7 months ago
- VibeVoice: Expressive, longform conversational speech synthesis. (Community fork)☆964Updated last week
- Self-host the powerful Dia TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), suppor…☆343Updated 8 months ago
- TTS model capable of streaming conversational audio in realtime.☆1,027Updated 2 months ago
- Realtime demo, Streaming and Finetuning code for CSM☆440Updated 4 months ago
- A TTS that fits in your CPU (and pocket)☆2,683Updated last week
- Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.☆2,552Updated 2 weeks ago
- Soprano-Factory: Train your own 2000x realtime text-to-speech model☆156Updated 3 weeks ago
- OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YT☆431Updated 4 months ago
- G2P☆400Updated 5 months ago
- ☆637Updated 2 months ago
- Unofficial WIP LoRa Finetuning repository for VibeVoice☆342Updated 4 months ago
- Liquid Audio - Speech-to-Speech audio models by Liquid AI☆388Updated last week
- ☆536Updated 4 months ago
- A random walk voice style cloning application for Kokoro text to speech☆205Updated 7 months ago
- Local, OpenAI-compatible text-to-speech (TTS) API using Chatterbox, enabling users to generate voice cloned speech anywhere the OpenAI AP…☆514Updated last month
- Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.☆2,822Updated last week
- Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages☆2,620Updated last month
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆219Updated 9 months ago
- Interface for OuteTTS models.☆1,421Updated 7 months ago
- ☆346Updated 5 months ago
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆347Updated 9 months ago