supertone-inc / super-monotonic-align
☆123Updated last month
Related projects ⓘ
Alternatives and complementary repositories for super-monotonic-align
- LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning☆114Updated 4 months ago
- An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data"☆133Updated last year
- Unofficial pytorch implementation of BigVGAN: A Universal Neural Vocoder with Large-Scale Training☆130Updated last year
- UTokyo-SaruLab MOS Prediction System☆83Updated this week
- Evaluation Protocol for Large-Scale Zero-Shot TTS Literature☆64Updated last month
- ICASSP 2023 Accepted☆190Updated 6 months ago
- [InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter☆78Updated 4 months ago
- The open source code for SimpleSpeech series☆108Updated last month
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations☆121Updated 8 months ago
- FlashSpeech: Efficient Zero-Shot Speech Synthesis☆93Updated last month
- [AAAI 2024] Code for CTX-vec2wav in UniCATS☆122Updated 4 months ago
- Official Implementation of EnCLAP (ICASSP 2024)☆89Updated 5 months ago
- ☆62Updated last year
- NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis☆143Updated last year
- The official implementation of EmoSphere-TTS☆80Updated 3 months ago
- Reference-aware automatic speech evaluation toolkit☆106Updated 8 months ago
- Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E☆135Updated 2 weeks ago
- All generative model in one for better TTS model☆66Updated 2 months ago
- [ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations☆135Updated 6 months ago
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆89Updated last week
- Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS☆162Updated 6 months ago
- Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech (INTERSPEECH 2022)☆117Updated last year
- ☆100Updated last month
- Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model☆102Updated last month
- UT-Sarulab MOS prediction system using SSL models☆183Updated 6 months ago
- PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions☆57Updated 3 weeks ago
- Train the next generation of TTS systems.☆160Updated last month
- Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization☆157Updated 3 months ago
- Implementation of TTS model based on NVIDIA P-Flow TTS Paper☆67Updated 5 months ago
- Official Pytorch Implementation for "DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for V…☆194Updated 3 months ago