gheyret / thuyg20_scripts
Script files of THUYG-20(A free Uyghur speech database Released by CSLT@Tsinghua University & Xinjiang University)
☆12Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for thuyg20_scripts
- The project for speech translation☆11Updated last year
- Chinese polyphone disambiguation for Text-to-Speech application☆29Updated 5 months ago
- A simple command line tool to calculate WER for ASR.☆13Updated last month
- [ASRU 2023] Code of paper SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation☆17Updated 3 months ago
- Mutiband version of HIFIGAN☆17Updated 4 years ago
- 将normalize过的中文文本,做逆向normalize。具体功能即实现 chinese_text_normalization的逆向版本。☆12Updated 3 years ago
- ☆15Updated 2 years ago
- Awesome Neural Codec Models, Text-to-Speech Synthesizers & Speech Language Models☆15Updated this week
- Speech samples and code of BEdit-TTS☆32Updated last year
- Just another FastSpeech 2 but cleaner code :)☆25Updated 4 months ago
- End-to-End Speech Processing Toolkit☆11Updated this week
- Pre-trained grapheme-to-phoneme (G2P) models☆25Updated 3 years ago
- Chinese Mandarin Synthesis Corpus-Female/Emotional☆11Updated 3 months ago
- PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"☆11Updated last year
- source code of EfficientTTS 2☆12Updated 9 months ago
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆16Updated last year
- Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"☆18Updated 3 months ago
- ☆11Updated last year
- AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data☆29Updated 10 months ago
- The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".☆18Updated 2 months ago
- Python Wrapper for RnNoise v0.2☆21Updated last month
- 基于单语种语料的中英混合语音识别算法-同花顺算法挑战赛-2021年9-10月双月赛☆14Updated 3 years ago
- C++ version of pyannote audio overlapped speech detection pipeline☆9Updated 9 months ago
- The implementation of g2pL with a new open dataset.☆16Updated last year
- ☆11Updated 3 years ago
- Survey on speech generation work.☆12Updated last year
- ☆13Updated last year
- Keyword spotting and forced alignment in any language☆39Updated 4 months ago
- ☆13Updated 2 years ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆19Updated last year