yuanhao-chen-nyoeghau / shanghainese-ttsView external linksLinks
Shanghainese TTS
☆26Jul 30, 2023Updated 2 years ago
Alternatives and similar repositories for shanghainese-tts
Users that are interested in shanghainese-tts are comparing it to the libraries listed below
Sorting:
- ☆14Aug 19, 2024Updated last year
- Visual Speech Recongnition☆19Dec 24, 2024Updated last year
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆12Apr 15, 2025Updated 10 months ago
- text to speech☆10Mar 19, 2024Updated last year
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago
- ☆10Apr 17, 2024Updated last year
- 基于PC-DDSP和nsf-HiFiGAN的声码器☆18Jul 17, 2023Updated 2 years ago
- Cantonese Grapheme-to-Phoneme Converter based on GitYCC/g2pW☆15Dec 10, 2024Updated last year
- Simple Kaldi recipe for forced alignment☆11Jul 16, 2023Updated 2 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Oct 2, 2024Updated last year
- Python scripts and datasets of the "Extremely Low-Resource Neural Machine Translation: A Case Study of Cantonese" project☆16Oct 28, 2022Updated 3 years ago
- This is the experimental description of MnTTS2.☆11Apr 11, 2024Updated last year
- Toward Multi Modality Language Model - implementation of GPT-4o/Project Astra☆16Dec 10, 2024Updated last year
- LoRA-based phoneme/prosody control for LLM-based TTS with no G2P - Lightweight adapter for edit and control the target language's phoneme…☆22Aug 14, 2025Updated 6 months ago
- ☆15Mar 31, 2025Updated 10 months ago
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆31Aug 30, 2025Updated 5 months ago
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆22Feb 7, 2026Updated last week
- cpp inference for EmotiVoice☆16Jan 1, 2024Updated 2 years ago
- A lightweight audio codec based on a single quantizer☆31Sep 4, 2025Updated 5 months ago
- 英単語から読みを推測するライブラリ。☆26Nov 8, 2025Updated 3 months ago
- ☆14Jul 24, 2025Updated 6 months ago
- A composition of offline tools to achieve high quality multilingual speech to text transcription☆23Feb 2, 2026Updated 2 weeks ago
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- ☆19Aug 27, 2018Updated 7 years ago
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆22Dec 5, 2022Updated 3 years ago
- poorman's ar-dit tts☆45Dec 31, 2025Updated last month
- [ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels☆42Mar 20, 2024Updated last year
- A family of efficient speech models for multilingual phone recognition☆42Updated this week
- Taiwanese Speech Synthesis with Tacotron2☆25Oct 2, 2022Updated 3 years ago
- An AR+AR TTS attempt.☆18Jan 13, 2025Updated last year
- Bilingual Singing Voice Synthesis☆18Mar 25, 2024Updated last year
- Self-supervised Generative LM-based Voice Conversion☆54Apr 24, 2025Updated 9 months ago
- Variable Bitrate Residual Vector Quantization for Audio Coding☆51May 1, 2025Updated 9 months ago
- ☆32Aug 22, 2024Updated last year
- [ASRU 2023] Code of paper SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation☆21Aug 13, 2024Updated last year
- Voice conversion with just linear regression.☆33Sep 25, 2025Updated 4 months ago
- ☆36Updated this week
- Pronounce Arabic words☆19May 27, 2019Updated 6 years ago