Shelton1013 / Whisper_MCELinks
[ICASSP‘25] Developing a Multilingual Dataset and Evaluation Metrics for Code-Switching: A Focus on Hong Kong's Polylingual Dynamics
☆34Updated 3 months ago
Alternatives and similar repositories for Whisper_MCE
Users that are interested in Whisper_MCE are comparing it to the libraries listed below
Sorting:
- CTC decoder with hotwords for ASR.☆34Updated 7 months ago
- ☆24Updated last year
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models☆25Updated last year
- ☆17Updated 6 months ago
- How to use our public wav2vec2 age and gender model☆51Updated 2 years ago
- Goodness of Pronunciation (GOP) for oral reading assessment.☆52Updated 4 years ago
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆48Updated 2 years ago
- ☆20Updated 3 months ago
- A Comprehensive Mandarin Speech Dataset for Young Children Aged 3-5☆42Updated 8 months ago
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆77Updated 2 years ago
- Phoneme segmentation using pre-trained speech models☆55Updated 3 years ago
- AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data☆33Updated last year
- The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".☆30Updated 4 months ago
- ☆15Updated last year
- Survey on speech generation work.☆20Updated 2 years ago
- [ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels☆41Updated last year
- ☆11Updated 2 years ago
- The case study and multilingfual performance of ICASSP submission☆24Updated 3 years ago
- The repoduction codes for Qwen-Audio Fine-tuning☆52Updated last year
- ☆95Updated last year
- An audio and transcribed corpus of contemporary Hong Kong Cantonese☆38Updated 4 years ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆28Updated 3 weeks ago
- ASR text preprocessing utility☆21Updated last year
- End-to-end MOdeling of ASR (Automatic Speech Recognition)☆33Updated 2 years ago
- ☆88Updated 4 months ago
- ASCEND Chinese-English code-switching dataset☆30Updated 3 years ago
- [ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.☆38Updated last year
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆75Updated 4 months ago
- A Benchmark for Evaluating Turn-Taking and Overlap Handling in Full-Duplex Spoken Dialogue Models☆108Updated 2 months ago
- Colab notebooks for Next-gen Kaldi☆30Updated last month