Shelton1013 / Whisper_MCELinks
[ICASSP‘25] Developing a Multilingual Dataset and Evaluation Metrics for Code-Switching: A Focus on Hong Kong's Polylingual Dynamics
☆33Updated 2 months ago
Alternatives and similar repositories for Whisper_MCE
Users that are interested in Whisper_MCE are comparing it to the libraries listed below
Sorting:
- CTC decoder with hotwords for ASR.☆31Updated 6 months ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆25Updated 11 months ago
- AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data☆33Updated last year
- [ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels☆42Updated last year
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆62Updated 4 years ago
- Goodness of Pronunciation (GOP) for oral reading assessment.☆52Updated 3 years ago
- ☆15Updated last year
- ASCEND Chinese-English code-switching dataset☆30Updated 3 years ago
- Speech samples and code of BEdit-TTS☆34Updated 2 years ago
- Colab notebooks for Next-gen Kaldi☆29Updated 3 weeks ago
- Pronunciation-assisted Subword Modeling☆31Updated 6 years ago
- ☆11Updated 2 years ago
- The case study and multilingfual performance of ICASSP submission☆24Updated 3 years ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆77Updated 4 months ago
- ☆25Updated 3 years ago
- The project for speech translation☆12Updated 2 years ago
- ☆17Updated 5 months ago
- ASR text preprocessing utility☆21Updated last year
- A Comprehensive Mandarin Speech Dataset for Young Children Aged 3-5☆42Updated 7 months ago
- This is the experimental description of MnTTS2.☆11Updated last year
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆33Updated last year
- ☆87Updated 3 months ago
- ☆23Updated last year
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆48Updated last year
- kaldi cnn-tdnnf baseline☆13Updated 4 years ago
- Phoneme segmentation using pre-trained speech models☆55Updated 3 years ago
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆22Updated 3 weeks ago
- Python wrapper for kaldi's arpa2fst☆38Updated 2 months ago
- A Benchmark for Evaluating Turn-Taking and Overlap Handling in Full-Duplex Spoken Dialogue Models☆96Updated last month
- ☆20Updated 2 months ago