☆16Dec 23, 2021Updated 4 years ago
Alternatives and similar repositories for MonTTS
Users that are interested in MonTTS are comparing it to the libraries listed below
Sorting:
- ☆25Mar 12, 2022Updated 4 years ago
- ☆11May 7, 2022Updated 3 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- Sing any popular song with your voice☆11Jul 10, 2022Updated 3 years ago
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Mar 24, 2023Updated 2 years ago
- Torch-based tool for quantizing high-dimensional vectors using additive codebooks☆54May 25, 2022Updated 3 years ago
- ☆55Jan 13, 2023Updated 3 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Sep 24, 2021Updated 4 years ago
- video cut powered by AI☆24Nov 15, 2022Updated 3 years ago
- ☆15May 8, 2021Updated 4 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Dec 16, 2022Updated 3 years ago
- calculate bhattacharyya distance based on zero cross rate feature between different Gaussian model for speech emotion recognition. corpus…☆11Oct 17, 2018Updated 7 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Testing sets for semanticVAD☆20Feb 18, 2025Updated last year
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆44Dec 17, 2020Updated 5 years ago
- FunAudioLLM homepage☆17Dec 11, 2024Updated last year
- Synthesized singing voice demos of WeSinger 2 paper.☆26Feb 20, 2023Updated 3 years ago
- text to speech☆10Mar 19, 2024Updated 2 years ago
- Pytorch-Named-Entity-Recognition-with-BERT☆15Oct 31, 2020Updated 5 years ago
- ☆12Nov 5, 2019Updated 6 years ago
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Aug 7, 2023Updated 2 years ago
- End-to-end Text-to-Speech with Generative Adversarial Networks☆20Feb 6, 2021Updated 5 years ago
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆19Jun 14, 2021Updated 4 years ago
- Rich Prosody Diversity Modelling with Phone-level Mixture Density Network☆45Dec 1, 2021Updated 4 years ago
- ☆20Feb 4, 2024Updated 2 years ago
- AudioVisual Diarization - Supervised and Unsupervised☆15Nov 22, 2022Updated 3 years ago
- Spherical residual vector quantization (SRVQ)☆31Aug 25, 2024Updated last year
- Digital Speech Processing in PyTorch.☆15Aug 12, 2022Updated 3 years ago
- 44100Hz日本語音源に対応させた unofficial vits2-TTS implementation in pytorchです。☆24Sep 1, 2023Updated 2 years ago
- ☆26Apr 21, 2021Updated 4 years ago
- ☆16Apr 4, 2022Updated 3 years ago
- ☆11May 9, 2023Updated 2 years ago
- ☆49May 3, 2020Updated 5 years ago
- Cyrillic Mongolian text classification with tensorflow 2, and also some fine-tuning on TugsTugi's Mongolian BERT model and other NLP expe…☆32Dec 8, 2022Updated 3 years ago
- Repository for the paper: VoiceMe: Personalized voice generation in TTS☆126Apr 29, 2022Updated 3 years ago
- ☆33Nov 29, 2022Updated 3 years ago
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Oct 8, 2021Updated 4 years ago
- ALBERT trained on Mongolian text corpus☆18Jan 10, 2021Updated 5 years ago