Shelton1013 / Whisper_MCELinks
[ICASSP‘25] Developing a Multilingual Dataset and Evaluation Metrics for Code-Switching: A Focus on Hong Kong's Polylingual Dynamics
☆34Updated 4 months ago
Alternatives and similar repositories for Whisper_MCE
Users that are interested in Whisper_MCE are comparing it to the libraries listed below
Sorting:
- Prosodic Speech Segmentation with Transformers☆26Updated last year
- ☆11Updated 2 years ago
- ☆15Updated last year
- [ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels☆41Updated last year
- Visual Speech Recongnition☆19Updated 11 months ago
- Pronunciation-assisted Subword Modeling☆31Updated 6 years ago
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆48Updated 2 years ago
- ☆88Updated 4 months ago
- ☆22Updated last year
- CTC decoder with hotwords for ASR.☆34Updated 8 months ago
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆75Updated 2 weeks ago
- Python wrapper for kaldi's arpa2fst☆38Updated 3 months ago
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆77Updated 2 years ago
- A handy dataset of noises for ASR☆22Updated 6 years ago
- AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data☆33Updated last year
- Phoneme segmentation using pre-trained speech models☆55Updated 3 years ago
- ☆13Updated last year
- ☆20Updated 4 months ago
- ☆37Updated 4 years ago
- Crowdsourced and Automatic Speech Prominence Estimation☆23Updated last year
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆33Updated last month
- Colab notebooks for Next-gen Kaldi☆29Updated 2 months ago
- The project for speech translation☆12Updated 2 years ago
- Collection of scripts from mHuBERT-147.☆32Updated last year
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆78Updated 5 months ago
- Official release of StyleTalk dataset.☆70Updated last year
- How to use our public wav2vec2 age and gender model☆52Updated 2 years ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆68Updated 7 months ago
- Extract phoneme-level timestamps from speeh audio.☆103Updated last month
- ☆25Updated 3 years ago