ZackHodari / discrete_intonationView external linksLinks
Code for paper titled "Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0" submitted to Speech Prosody
☆17May 24, 2020Updated 5 years ago
Alternatives and similar repositories for discrete_intonation
Users that are interested in discrete_intonation are comparing it to the libraries listed below
Sorting:
- Code for paper titled "Using generative modelling to produce varied intonation for speech synthesis" submitted to the Speech Synthesis Wo…☆24Dec 8, 2019Updated 6 years ago
- ☆46Apr 16, 2023Updated 2 years ago
- (semi) Grapheme-to-Phoneme (G2P) - seq2seq model using PyTorch for Korean☆23Dec 17, 2017Updated 8 years ago
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- Analysis code for speech intonation project☆14Feb 19, 2019Updated 6 years ago
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆33Jul 31, 2024Updated last year
- ☆25Mar 6, 2024Updated last year
- ☆14Aug 16, 2023Updated 2 years ago
- TTS前,文本标准化,将数字字母处理转化为汉字☆12Apr 27, 2024Updated last year
- ☆25Apr 24, 2019Updated 6 years ago
- Data processing tools for preparing speech and labels for training TTS voices☆29Aug 13, 2020Updated 5 years ago
- An automatic prosodic boundary annotation tool for Text-to-Speech Synthesis (TTS).☆51Jun 11, 2024Updated last year
- Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]☆17Aug 16, 2023Updated 2 years ago
- visual-text to speech☆14Apr 3, 2022Updated 3 years ago
- ☆26Sep 22, 2022Updated 3 years ago
- ☆31Nov 7, 2018Updated 7 years ago
- ☆33Jun 29, 2023Updated 2 years ago
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆59Oct 23, 2024Updated last year
- A Generative Adversarial Network for Shakuhachi Music☆14Jul 2, 2019Updated 6 years ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆55Oct 15, 2021Updated 4 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Oct 8, 2023Updated 2 years ago
- Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech (INTERSPEECH 2022)☆121Jan 24, 2023Updated 3 years ago
- E2E TTS using Conditional Flow Matching (Experimental*)☆71Nov 10, 2023Updated 2 years ago
- Google's TPGST reimplementation.☆34Dec 11, 2019Updated 6 years ago
- The source code for the paper CrossSinger (asru2023)☆18Oct 12, 2023Updated 2 years ago
- [ACMMM'2024] Generative Expressive Conversational Speech Synthesis☆43Oct 28, 2024Updated last year
- TransferTTS (Zero-Shot learning of VITS)☆100Sep 23, 2022Updated 3 years ago
- [INTERSPEECH'2022] Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning☆83Nov 4, 2022Updated 3 years ago
- Language independent SSL-based Speaker Anonymization system☆19May 28, 2024Updated last year
- Fine tuning the UnifiedVoice autoregressor for TortoiseTTS.☆15Nov 25, 2023Updated 2 years ago
- Python implementation of the SFC intonation model.☆18Nov 29, 2017Updated 8 years ago
- ASR & TTS joint training, asr, tts, machine speech chain☆16Oct 16, 2021Updated 4 years ago
- Heteronym to Phoneme Parser☆19Nov 4, 2023Updated 2 years ago
- ☆18Dec 7, 2023Updated 2 years ago
- Rich Prosody Diversity Modelling with Phone-level Mixture Density Network☆45Dec 1, 2021Updated 4 years ago
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)☆119Feb 7, 2024Updated 2 years ago
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆23Mar 12, 2023Updated 2 years ago
- ☆19May 2, 2024Updated last year