yuanhao-chen-nyoeghau / shanghainese-ttsLinks
Shanghainese TTS
☆26Updated 2 years ago
Alternatives and similar repositories for shanghainese-tts
Users that are interested in shanghainese-tts are comparing it to the libraries listed below
Sorting:
- ☆22Updated last year
- Visual Speech Recongnition☆19Updated last year
- [ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels☆42Updated last year
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆40Updated 2 years ago
- Command-line interface and Python library to transcribe pinyin to IPA. The tones are attached to the vowel of the syllable.☆53Updated 9 months ago
- ☆11Updated 2 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Updated last year
- Voice conversion with just linear regression.☆32Updated 4 months ago
- Grapheme-to-Phoneme lexicons for Chinese dialects☆69Updated 3 years ago
- Chinese polyphone disambiguation for Text-to-Speech application☆42Updated last year
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆54Updated 2 years ago
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆22Updated 3 years ago
- Just another FastSpeech 2 but cleaner code :)☆29Updated last year
- 基于vits fastspeech2 visinger的tts模型☆24Updated 2 years ago
- Extract phoneme-level timestamps from speeh audio.☆114Updated 2 weeks ago
- An audio and transcribed corpus of contemporary Hong Kong Cantonese☆40Updated 5 years ago
- ☆25Updated 3 years ago
- VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.☆36Updated 3 years ago
- [ASRU 2023] Code of paper SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation☆21Updated last year
- English conversation corpus for conversational TTS.☆21Updated 2 years ago
- ☆47Updated last year
- Reimplementation of Miipher☆29Updated 2 years ago
- An upgrade framework for train and validate compare with icefall using Lightning.☆15Updated 10 months ago
- Pre-trained grapheme-to-phoneme (G2P) models☆26Updated 4 years ago
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Updated 3 years ago
- ☆14Updated last year
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆21Updated 8 months ago
- Survey on speech generation work.☆21Updated 2 years ago
- ☆13Updated last year
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆67Updated last year