SMART-TTS / SMART-Multi-Speaker-Style-TTSView external linksLinks
Multi-speaker & Multi-style TTS
☆29Jul 3, 2024Updated last year
Alternatives and similar repositories for SMART-Multi-Speaker-Style-TTS
Users that are interested in SMART-Multi-Speaker-Style-TTS are comparing it to the libraries listed below
Sorting:
- ☆51Jul 6, 2023Updated 2 years ago
- ☆52Jan 6, 2022Updated 4 years ago
- ☆97Jul 6, 2023Updated 2 years ago
- ☆102Mar 24, 2023Updated 2 years ago
- ☆60Jan 6, 2022Updated 4 years ago
- ☆24Feb 14, 2025Updated last year
- 발화자 지정 모듈☆21Feb 14, 2025Updated last year
- ☆34Feb 14, 2025Updated last year
- ☆35Feb 14, 2025Updated last year
- 한국어 음성 인식을 위한 deep speech 2☆27Jul 14, 2020Updated 5 years ago
- ☆43Feb 14, 2025Updated last year
- Prosody-semantics Interface in Seoul Korean☆12Oct 9, 2020Updated 5 years ago
- Updated folk of g2pk☆13Aug 18, 2023Updated 2 years ago
- ☆18Nov 18, 2022Updated 3 years ago
- ☆31Nov 24, 2023Updated 2 years ago
- Cross attentive pooling for speaker verification (IEEE SLT, 2021)☆12Dec 14, 2020Updated 5 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Oct 8, 2023Updated 2 years ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Dec 10, 2020Updated 5 years ago
- Trends, Tools, News timeline ...☆19Oct 13, 2025Updated 4 months ago
- Official Demo Page for DiTTo-TTS: Efficient and Scalable Zero-Shot Text-to-Speech with Diffusion Transformer☆38Feb 17, 2025Updated 11 months ago
- FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS☆20Nov 15, 2022Updated 3 years ago
- ☆48Feb 14, 2025Updated last year
- ☆15Nov 28, 2021Updated 4 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 2 years ago
- Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆129Apr 8, 2023Updated 2 years ago
- Official implementation of Meta-StyleSpeech and StyleSpeech☆252Feb 9, 2022Updated 4 years ago
- Korean Speech to English Translation Corpus☆45Sep 3, 2021Updated 4 years ago
- Objective metrics used in several text-to-speech (TTS) papers.☆52Jun 17, 2025Updated 7 months ago
- ProsodyLM: Uncovering the Emerging Prosody Processing Capabilities in Speech Language Models☆34Nov 18, 2025Updated 2 months ago
- ☆87Dec 21, 2022Updated 3 years ago
- Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale☆28Aug 4, 2023Updated 2 years ago
- A trainer for SNAC (Multi-Scale Neural Audio Codec) has replaced the decoder with Vocos.☆64Oct 28, 2024Updated last year
- Implementation of Korean FastSpeech2☆215Jan 29, 2023Updated 3 years ago
- Deep Speech Distances PyTorch☆29Feb 21, 2022Updated 3 years ago
- ☆32Dec 2, 2024Updated last year
- ☆21Apr 6, 2021Updated 4 years ago
- A lightweight audio codec based on a single quantizer☆69Aug 15, 2025Updated 5 months ago
- High fidelity, lightweight, end-to-end, streaming, convolution-based neural audio codec☆115Jun 23, 2025Updated 7 months ago
- List of Podcast Feeds using iTunes API and script to download 6,000,000~ hours of English speech.☆31Apr 13, 2023Updated 2 years ago