fcbond / hkcancorView external linksLinks
Hong Kong Cantonese Corpus of transcribed speech (spontaneous speech, radio programmes and a monologue).
☆85Nov 3, 2025Updated 3 months ago
Alternatives and similar repositories for hkcancor
Users that are interested in hkcancor are comparing it to the libraries listed below
Sorting:
- An audio and transcribed corpus of contemporary Hong Kong Cantonese☆40Dec 30, 2020Updated 5 years ago
- 粵文語料篩選器 Cantonese text filter☆41Feb 4, 2026Updated last week
- ☆99Feb 1, 2024Updated 2 years ago
- This is a repository dedicated for pre-trained acoustic models of Hong Kong Cantonese and Cantonese forced alignment.☆21Nov 14, 2024Updated last year
- A curated list of resources dedicated to Natural Language Processing (NLP) of Cantonese | 粵語 NLP☆92Oct 17, 2021Updated 4 years ago
- A frequency lexicon for Hong Kong Cantonese☆23Aug 27, 2020Updated 5 years ago
- Transformers for Cantonese☆57Oct 24, 2020Updated 5 years ago
- cantonese-mandarin unsupervised neural translation for sw project☆28May 2, 2023Updated 2 years ago
- Pre-trained ELECTRA from Hong Kong data☆29Jul 7, 2020Updated 5 years ago
- Cantonese Linguistics and NLP☆396May 23, 2024Updated last year
- 《香港二十世紀中期粵語語料庫》打包器☆16Apr 12, 2016Updated 9 years ago
- ASR text preprocessing utility☆21Aug 5, 2024Updated last year
- Cantonese Video Transcribe Service☆22Jul 25, 2025Updated 6 months ago
- fastText vectors created from Hong Kong data.☆22Jul 7, 2020Updated 5 years ago
- Dictionary of pairs of Korean word and IPA crawled from Wiktionary (Korean edition)☆23Nov 12, 2025Updated 3 months ago
- 56 language, 1 model Multilingual ASR☆24Jul 25, 2021Updated 4 years ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- ☆13Sep 25, 2024Updated last year
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 2 years ago
- Latex template for CUHK PhD Thesis☆11Jun 29, 2025Updated 7 months ago
- ☆10Apr 17, 2024Updated last year
- An English-to-Cantonese machine translation model☆55Mar 26, 2025Updated 10 months ago
- ☆11May 7, 2022Updated 3 years ago
- ☆11Jul 14, 2023Updated 2 years ago
- A Cantonese-English translator based on prompt engineering☆12Sep 19, 2023Updated 2 years ago
- ☆10Sep 19, 2022Updated 3 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…☆33Jun 14, 2024Updated last year
- 粵語對話語料☆29May 12, 2023Updated 2 years ago
- Cantonese segmentation tool 粵語分詞工具☆30Aug 22, 2020Updated 5 years ago
- ☆53May 26, 2022Updated 3 years ago
- ☆12Mar 31, 2020Updated 5 years ago
- CleanUMamba: A Compact Mamba Network for Speech Denoising using Channel Pruning [Official PyTorch implementation]☆22Jun 12, 2025Updated 8 months ago
- CyberCan is a lexicon of contemporary Cantonese based on more than 100 million pieces of internet texts from discussion forums in Hong Ko…☆12Aug 24, 2021Updated 4 years ago
- Python scripts and datasets of the "Extremely Low-Resource Neural Machine Translation: A Case Study of Cantonese" project☆16Oct 28, 2022Updated 3 years ago
- Ideographic Description Sequences☆32Nov 27, 2025Updated 2 months ago
- Spoken Cantonese from Hong Kong.☆30Nov 12, 2025Updated 3 months ago
- Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.☆14Jun 27, 2023Updated 2 years ago
- Toward Multi Modality Language Model - implementation of GPT-4o/Project Astra☆16Dec 10, 2024Updated last year