Shelton1013/Whisper_MCE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Shelton1013/Whisper_MCE)

Shelton1013 / Whisper_MCE

[ICASSP‘25] Developing a Multilingual Dataset and Evaluation Metrics for Code-Switching: A Focus on Hong Kong's Polylingual Dynamics

☆40

Alternatives and similar repositories for Whisper_MCE

Users that are interested in Whisper_MCE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cpii-cai / PunCantonese
View on GitHub
A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts
☆15Dec 3, 2024Updated last year
HLTCHKUST / cantonese-asr
View on GitHub
☆103Feb 1, 2024Updated 2 years ago
bshall / dusted
View on GitHub
DUSTED: Spoken-Term Discovery using Discrete Speech Units
☆17Oct 2, 2024Updated last year
Sreyan88 / RECAP
View on GitHub
Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning
☆16Jun 23, 2024Updated 2 years ago
avishaiElmakies / unsupervised_speech_segmentation_using_slm
View on GitHub
☆20Jan 8, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
WangHelin1997 / Automatic_Speech_Annotator
View on GitHub
Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…
☆33Jun 14, 2024Updated 2 years ago
ictnlp / LSG
View on GitHub
The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”
☆15Jan 3, 2025Updated last year
isaacOnline / whisper
View on GitHub
Robust Speech Recognition via Large-Scale Weak Supervision
☆13Oct 28, 2023Updated 2 years ago
fcbond / hkcancor
View on GitHub
Hong Kong Cantonese Corpus of transcribed speech (spontaneous speech, radio programmes and a monologue).
☆95Nov 3, 2025Updated 8 months ago
michaelneri / unsupervised-audio-anomaly-detection
View on GitHub
Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …
☆11Nov 6, 2024Updated last year
haoheliu / ontology-aware-audio-tagging
View on GitHub
☆14Nov 22, 2022Updated 3 years ago
HLTCHKUST / ASCEND
View on GitHub
ASCEND Chinese-English code-switching dataset
☆33Jul 12, 2022Updated 4 years ago
yuhangear / wenet-android
View on GitHub
☆13Oct 27, 2021Updated 4 years ago
Koziev / StressModel
View on GitHub
Neural model for prediction of stress position in Russian words
☆13Jun 22, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
TTS-Research / PEL-TTS
View on GitHub
☆14Aug 16, 2023Updated 2 years ago
thu-spmi / SPMILM
View on GitHub
A SPMI Lab toolkit for language models.
☆11Apr 12, 2017Updated 9 years ago
janson9192 / autokws2021
View on GitHub
☆13Mar 25, 2021Updated 5 years ago
chenchenzi / HKCantonese_models
View on GitHub
This is a repository dedicated for pre-trained acoustic models of Hong Kong Cantonese and Cantonese forced alignment.
☆29Nov 14, 2024Updated last year
pengzhendong / g2p-mix
View on GitHub
Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.
☆115Updated this week
cyhuang-tw / robust-vc
View on GitHub
☆11May 7, 2022Updated 4 years ago
naver / multilingual-distilwhisper
View on GitHub
This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.
☆34Apr 22, 2026Updated 3 months ago
khanld / Dynamic-Mixing
View on GitHub
Dynamic Mixing For Speech Processing (mix-on-the-fly)
☆22Jul 19, 2022Updated 4 years ago
RRisto / learning
View on GitHub
materials for learing various stuff by doing them myself
☆24Dec 25, 2025Updated 7 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
CaA23187 / GCCRN_full
View on GitHub
A pytorch implementation of GCCRN
☆14Dec 18, 2021Updated 4 years ago
arnabdas8901 / StarGAN-VC_PlusPlus
View on GitHub
☆11Aug 11, 2023Updated 2 years ago
seungheondoh / hi_kia
View on GitHub
wake-up word emotion recognition [APSIPA 2022]
☆17Nov 11, 2022Updated 3 years ago
AdolfVonKleist / RnnLMG2P
View on GitHub
Grapheme-to-Phoneme conversion with Joint-Sequence RnnLMs
☆30Dec 15, 2014Updated 11 years ago
X-LANCE / StoryTTS
View on GitHub
[ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations
☆141Apr 27, 2024Updated 2 years ago
iamanigeeit / present
View on GitHub
☆14Aug 19, 2024Updated last year
liuhuang31 / Megatts2_HierSpeechpp
View on GitHub
Megatts2 use HierSpeechpp's vocoder
☆18Dec 2, 2024Updated last year
The-Data-Dilemma / MediBeng-Whisper-Tiny
View on GitHub
MediBeng Whisper Tiny improves doctor-patient transcription by training the Whisper Tiny model to translate mixed Bengali-English speech…
☆29Jul 24, 2025Updated last year
BUTSpeechFIT / DeCRED
View on GitHub
☆18Aug 13, 2025Updated 11 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
AsoSoft / AsoSoft-TTS-Speech-Corpus-for-Central-Kurdish
View on GitHub
AsoSoft Speech Corpus for Central-Kurdish Text-To-Speech
☆23Jun 24, 2022Updated 4 years ago
mozilla / murmur
View on GitHub
DEPRECATED - A webapp for collecting speech samples for voice recognition testing and training
☆20May 23, 2019Updated 7 years ago
nethermanpro / ComSL
View on GitHub
☆11Oct 14, 2023Updated 2 years ago
R1ckShi / FrontEnd-AEC
View on GitHub
Acoustic echo cancelation(AEC) is a main algorithm in the pipe line of acoustic devices with KWS or ASR. FNLMS is used.
☆19Apr 22, 2019Updated 7 years ago
emagar / elecRetrns
View on GitHub
Recent Mexican Election Vote Returns
☆12Updated this week
mt-upc / ZeroSwot
View on GitHub
Pushing the Limits of Zero-shot End-to-End Speech Translation
☆25Dec 12, 2024Updated last year
burrmill / burrmill
View on GitHub
BurrMill core
☆22Nov 2, 2021Updated 4 years ago