Shelton1013 / Whisper_MCEView external linksLinks
[ICASSP‘25] Developing a Multilingual Dataset and Evaluation Metrics for Code-Switching: A Focus on Hong Kong's Polylingual Dynamics
☆36Aug 10, 2025Updated 6 months ago
Alternatives and similar repositories for Whisper_MCE
Users that are interested in Whisper_MCE are comparing it to the libraries listed below
Sorting:
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Oct 2, 2024Updated last year
- ☆99Feb 1, 2024Updated 2 years ago
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆10Nov 6, 2024Updated last year
- ☆11Aug 11, 2023Updated 2 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆33Oct 23, 2025Updated 3 months ago
- ☆13Nov 22, 2022Updated 3 years ago
- A SPMI Lab toolkit for language models.☆11Apr 12, 2017Updated 8 years ago
- ☆14Aug 16, 2023Updated 2 years ago
- ☆11May 7, 2022Updated 3 years ago
- Neural model for prediction of stress position in Russian words☆12Jun 22, 2025Updated 7 months ago
- Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…☆33Jun 14, 2024Updated last year
- ☆11Oct 14, 2023Updated 2 years ago
- ☆13Oct 27, 2021Updated 4 years ago
- The Official PyTorch Implementation of "Mel-McNet: A Mel-Scale Framework for Online Multichannel Speech Enhancement" [Interspeech 2025]☆21Jun 9, 2025Updated 8 months ago
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆16Jun 23, 2024Updated last year
- ☆13Mar 25, 2021Updated 4 years ago
- ☆14Aug 19, 2024Updated last year
- This is a repository dedicated for pre-trained acoustic models of Hong Kong Cantonese and Cantonese forced alignment.☆21Nov 14, 2024Updated last year
- Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies☆15Nov 25, 2024Updated last year
- 中文语音识别,automatic speech recognition(ASR)☆14Dec 30, 2021Updated 4 years ago
- Russian accentuator and IPA transcriber☆16Sep 10, 2024Updated last year
- ☆16Dec 23, 2021Updated 4 years ago
- Megatts2 use HierSpeechpp's vocoder☆18Dec 2, 2024Updated last year
- AsoSoft Speech Corpus for Central-Kurdish Text-To-Speech☆19Jun 24, 2022Updated 3 years ago
- ☆19Jan 8, 2025Updated last year
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor☆18Jun 5, 2023Updated 2 years ago
- ☆20Jul 22, 2022Updated 3 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆18Nov 30, 2022Updated 3 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- [EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion☆33Sep 9, 2025Updated 5 months ago
- Dynamic Mixing For Speech Processing (mix-on-the-fly)☆21Jul 19, 2022Updated 3 years ago
- ☆32Dec 24, 2025Updated last month
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆19Jun 14, 2021Updated 4 years ago
- Tools for processing open Cantonese dictionary data provided words.hk☆23Jan 30, 2025Updated last year
- materials for learing various stuff by doing them myself☆24Dec 25, 2025Updated last month
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- DEPRECATED - A webapp for collecting speech samples for voice recognition testing and training☆20May 23, 2019Updated 6 years ago