gheyret / thuyg20_scriptsLinks
Script files of THUYG-20(A free Uyghur speech database Released by CSLT@Tsinghua University & Xinjiang University)
☆17Updated 5 years ago
Alternatives and similar repositories for thuyg20_scripts
Users that are interested in thuyg20_scripts are comparing it to the libraries listed below
Sorting:
- Hong Kong Cantonese Corpus of transcribed speech (spontaneous speech, radio programmes and a monologue).☆73Updated last week
- Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)☆365Updated 4 months ago
- A toolset for computation and comparison of Chinese dialects☆41Updated last week
- Data and code for grapheme-to-phoneme transducers in lots of languages☆140Updated last year
- The repo provides information about KeSpeech dataset.☆160Updated 3 years ago
- 粵文語料篩選器 Cantonese text filter☆41Updated 7 months ago
- ☆93Updated last year
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆172Updated 2 years ago
- Multilingual G2P in 100 languages☆361Updated 2 years ago
- Implementation of TTS with combination of Tacotron2 and HiFi-GAN☆11Updated 3 years ago
- PyTorch implementation of Tacotron-2. Tacotron-2 的 PyTorch 实现。☆14Updated 4 years ago
- ASCEND Chinese-English code-switching dataset☆30Updated 3 years ago
- Collection of pretrained models for the Montreal Forced Aligner☆175Updated last month
- Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)☆95Updated last year
- A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset☆356Updated 3 years ago
- Universal multilingual automatic speech transcription into IPA☆69Updated 8 months ago
- Pre-trained Wav2vec2.0 for Mandarin☆41Updated 3 years ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆95Updated 7 months ago
- Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.☆260Updated 6 years ago
- SEAME corpus two develop set☆41Updated 5 years ago
- Weighted Cross-entropy for Low-Resource Languages in Multilingual Speech Recognition☆15Updated last year
- 语音合成端到端TTS模型vits中文版,VITS Mandarin☆15Updated 3 years ago
- A fast parallel PyTorch implementation of the "CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition" https://arxiv.org/ab…☆34Updated last year
- Some basic praat scripts.☆221Updated last year
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆179Updated this week
- Grapheme-to-Phoneme lexicons for Chinese dialects☆69Updated 2 years ago
- The dataset of Speech Recognition☆432Updated 10 months ago
- [WIP] Scripts for fine-tuning Whisper☆222Updated 2 years ago
- Keyword spotting and forced alignment in any language☆77Updated 2 months ago
- This converter converts multiple Uyghur scripts: ULS(Uyghur Latin Script), UAS(Uyghur Arabic Script), CTS(Common Turkick Scritp), UCS(Uyg…☆52Updated 2 months ago