yuanhao-chen-nyoeghau / shanghainese-ttsLinks
Shanghainese TTS
☆26Updated 2 years ago
Alternatives and similar repositories for shanghainese-tts
Users that are interested in shanghainese-tts are comparing it to the libraries listed below
Sorting:
- Visual Speech Recongnition☆19Updated 10 months ago
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆21Updated 2 years ago
- ☆21Updated last year
- Command-line interface and Python library to transcribe pinyin to IPA. The tones are attached to the vowel of the syllable.☆51Updated 6 months ago
- [ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels☆42Updated last year
- This is the experimental description of MnTTS2.☆11Updated last year
- ☆25Updated 3 years ago
- 基于vits fastspeech2 visinger的tts模型☆24Updated 2 years ago
- Chinese polyphone disambiguation for Text-to-Speech application☆37Updated last year
- An audio and transcribed corpus of contemporary Hong Kong Cantonese☆38Updated 4 years ago
- Python3 code for the IEEE SPL paper "Auto-Tuning Spectral Clustering for SpeakerDiarization Using Normalized Maximum Eigengap"☆12Updated 5 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Updated last year
- Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…☆33Updated last year
- ☆14Updated 6 months ago
- Taiwanese Speech Synthesis with Tacotron2☆22Updated 3 years ago
- Grapheme-to-Phoneme lexicons for Chinese dialects☆69Updated 2 years ago
- ☆11Updated 2 years ago
- ☆55Updated 3 years ago
- [ASRU 2023] Code of paper SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation☆20Updated last year
- 粵語拼音自動標註工具 Cantonese Pronunciation Automatic Labeling Tool☆78Updated last year
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆26Updated last year
- Goodness of Pronunciation algorithm using PyKaldi☆16Updated 3 years ago
- VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.☆39Updated 2 years ago
- ☆47Updated last year
- ☆14Updated 2 years ago
- This is the official implementation for εar-VAE model including inference and evaluation parts, more details coming soon...☆30Updated last month
- ☆14Updated last year
- ☆10Updated 2 months ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆25Updated 11 months ago
- VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.☆35Updated 3 years ago