gheyret / thuyg20_scripts
Script files of THUYG-20(A free Uyghur speech database Released by CSLT@Tsinghua University & Xinjiang University)
☆15Updated 5 years ago
Alternatives and similar repositories for thuyg20_scripts:
Users that are interested in thuyg20_scripts are comparing it to the libraries listed below
- Chinese polyphone disambiguation for Text-to-Speech application☆32Updated 9 months ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆9Updated 2 years ago
- The project for speech translation☆11Updated last year
- 将normalize过的中文文本,做逆向normalize。具体功能即实现 chinese_text_normalization的逆向版本。☆13Updated 3 years ago
- ☆14Updated last year
- A fast parallel PyTorch implementation of the "CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition" https://arxiv.org/ab…☆33Updated last year
- ☆18Updated 6 months ago
- open-source Mandarian biased word dataset☆11Updated last year
- [ICASSP 2022] Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection☆25Updated last year
- Grapheme-to-Phoneme lexicons for Chinese dialects☆67Updated 2 years ago
- ☆15Updated 8 months ago
- A semi-supervised sequence-to-sequence ASR☆10Updated 2 years ago
- ☆16Updated 5 years ago
- ☆11Updated last year
- This is the experimental description of MnTTS2.☆9Updated 11 months ago
- ☆9Updated 5 years ago
- End-to-End Speech Processing Toolkit☆13Updated 2 months ago
- One command to start a streaming ASR server.☆11Updated 6 months ago
- 基于单语种语料的中英混合语音识别算法-同花顺算法挑战赛-2021年9-10月双月赛☆14Updated 3 years ago
- kaldi cnn-tdnnf baseline☆13Updated 3 years ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Updated last year
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Updated 3 years ago
- Taiwanese Speech Synthesis with Tacotron2☆19Updated 2 years ago
- ☆10Updated 4 months ago
- Torchaudio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆11Updated 3 months ago
- Mutiband version of HIFIGAN☆18Updated 4 years ago
- ☆15Updated 2 years ago
- E2E ASR system☆14Updated 2 years ago
- Speech samples and code of BEdit-TTS☆32Updated last year
- Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"☆16Updated 3 years ago