HLTCHKUST / cantonese-asrView external linksLinks
☆99Feb 1, 2024Updated 2 years ago
Alternatives and similar repositories for cantonese-asr
Users that are interested in cantonese-asr are comparing it to the libraries listed below
Sorting:
- An audio and transcribed corpus of contemporary Hong Kong Cantonese☆40Dec 30, 2020Updated 5 years ago
- A frequency lexicon for Hong Kong Cantonese☆23Aug 27, 2020Updated 5 years ago
- [ICASSP‘25] Developing a Multilingual Dataset and Evaluation Metrics for Code-Switching: A Focus on Hong Kong's Polylingual Dynamics☆36Aug 10, 2025Updated 6 months ago
- ☆10Apr 17, 2024Updated last year
- Hong Kong Cantonese Corpus of transcribed speech (spontaneous speech, radio programmes and a monologue).☆85Nov 3, 2025Updated 3 months ago
- cantonese-mandarin unsupervised neural translation for sw project☆28May 2, 2023Updated 2 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- ☆14Aug 16, 2023Updated 2 years ago
- Python scripts and datasets of the "Extremely Low-Resource Neural Machine Translation: A Case Study of Cantonese" project☆16Oct 28, 2022Updated 3 years ago
- A ctc decoder for both online and offline asr model☆66Nov 18, 2023Updated 2 years ago
- ASCEND Chinese-English code-switching dataset☆30Jul 12, 2022Updated 3 years ago
- Transformers for Cantonese☆57Oct 24, 2020Updated 5 years ago
- 粵文語料篩選器 Cantonese text filter☆41Feb 4, 2026Updated last week
- Code repository for the Cantonese In-car Audio-Visual Speech Recognition (CI-AVSR) dataset.☆41Jul 16, 2024Updated last year
- This is a repository dedicated for pre-trained acoustic models of Hong Kong Cantonese and Cantonese forced alignment.☆21Nov 14, 2024Updated last year
- A curated list of resources dedicated to Natural Language Processing (NLP) of Cantonese | 粵語 NLP☆92Oct 17, 2021Updated 4 years ago
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- Pre-trained ELECTRA from Hong Kong data☆29Jul 7, 2020Updated 5 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- Spoken Cantonese from Hong Kong.☆30Nov 12, 2025Updated 3 months ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- A SPMI Lab toolkit for language models.☆11Apr 12, 2017Updated 8 years ago
- Cantonese Grapheme-to-Phoneme Converter based on GitYCC/g2pW☆15Dec 10, 2024Updated last year
- kaldi cnn-tdnnf baseline☆13Aug 31, 2021Updated 4 years ago
- ☆10Sep 19, 2022Updated 3 years ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆114Dec 2, 2025Updated 2 months ago
- 粵語拼音自動標註工具 Cantonese Pronunciation Automatic Labeling Tool☆81Sep 24, 2024Updated last year
- The Cantonese Wordnet☆14Dec 4, 2023Updated 2 years ago
- ☆13Oct 27, 2021Updated 4 years ago
- Phoneme segmentation using pre-trained speech models☆55Nov 4, 2022Updated 3 years ago
- MagicData-RAMC Dataset and Baseline☆57Sep 13, 2022Updated 3 years ago
- Toward Multi Modality Language Model - implementation of GPT-4o/Project Astra☆16Dec 10, 2024Updated last year
- LoRA-based phoneme/prosody control for LLM-based TTS with no G2P - Lightweight adapter for edit and control the target language's phoneme…☆23Aug 14, 2025Updated 6 months ago
- NMT based punctuation prediction system using lexical and acoustic features .☆14Mar 30, 2020Updated 5 years ago
- ☆14Aug 19, 2024Updated last year
- 《SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks》Speech processing with prompting paradigm☆82Oct 19, 2023Updated 2 years ago
- Code for "Distribution-based Emotion Recognition in Conversation"☆19Feb 6, 2023Updated 3 years ago
- 粵語/廣東話參考資料 Reference Materials for Yue / Cantonese☆14Dec 12, 2025Updated 2 months ago
- Cantonese TTS frontend☆16Oct 14, 2019Updated 6 years ago