gheyret / thuyg20_scripts
Script files of THUYG-20(A free Uyghur speech database Released by CSLT@Tsinghua University & Xinjiang University)
☆12Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for thuyg20_scripts
- Chinese polyphone disambiguation for Text-to-Speech application☆28Updated 5 months ago
- The project for speech translation☆11Updated last year
- 将normalize过的中文文本,做逆向normalize。具体功能即实现 chinese_text_normalization的逆向版本。☆12Updated 3 years ago
- Speech Recognition for Uyghur using deep learning☆30Updated 3 years ago
- ☆16Updated this week
- ☆13Updated last year
- Speech Recognition for Uyghur using Speech transformer☆20Updated 3 years ago
- An audio and transcribed corpus of contemporary Hong Kong Cantonese☆34Updated 3 years ago
- E2E ASR system☆14Updated 2 years ago
- kaldi cnn-tdnnf baseline☆13Updated 3 years ago
- Accompanying code for paper "Attention-Based Contextual Language Model Adaptation for Speech Recognition", submitted to ACL 2021.☆14Updated last year
- [ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.☆25Updated 7 months ago
- A simple command line tool to calculate WER for ASR.☆13Updated last month
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆13Updated last week
- End-to-End Speech Processing Toolkit☆11Updated last month
- The implementation of g2pL with a new open dataset.☆16Updated last year
- Converts Mandarin Chinese pinyin notation to IPA (international phonetic alphabet) notation☆15Updated 11 months ago
- The case study and multilingfual performance of ICASSP submission☆19Updated 2 years ago
- Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"☆15Updated 3 years ago
- ☆25Updated 2 weeks ago
- ☆15Updated 2 years ago
- PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"☆11Updated last year
- List of speech synthesis papers.☆11Updated this week
- [ICASSP 2022] Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection☆25Updated last year
- Survey on speech generation work.☆12Updated 11 months ago
- A chinese singing voice dataset, professional male singer, 105 songs, 132 minutes☆10Updated last year
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆19Updated last year
- ☆13Updated 2 years ago
- ☆9Updated 3 years ago
- Uyghur Single Speaker Speech Dataset. ウイグル語音声データセット☆22Updated 2 years ago