Hong Kong Cantonese Corpus of transcribed speech (spontaneous speech, radio programmes and a monologue).
☆89Nov 3, 2025Updated 7 months ago
Alternatives and similar repositories for hkcancor
Users that are interested in hkcancor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An audio and transcribed corpus of contemporary Hong Kong Cantonese☆41Dec 30, 2020Updated 5 years ago
- This is a repository dedicated for pre-trained acoustic models of Hong Kong Cantonese and Cantonese forced alignment.☆27Nov 14, 2024Updated last year
- ☆102Feb 1, 2024Updated 2 years ago
- A curated list of resources dedicated to Natural Language Processing (NLP) of Cantonese | 粵語 NLP☆94Oct 17, 2021Updated 4 years ago
- 粵文語料篩選器 Cantonese text filter☆43Feb 4, 2026Updated 4 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A frequency lexicon for Hong Kong Cantonese☆25Aug 27, 2020Updated 5 years ago
- Cantonese Linguistics and NLP☆408May 26, 2026Updated 3 weeks ago
- Transformers for Cantonese☆58Oct 24, 2020Updated 5 years ago
- cantonese-mandarin unsupervised neural translation for sw project☆29May 2, 2023Updated 3 years ago
- Cantonese Video Transcribe Service☆42Jul 25, 2025Updated 10 months ago
- fastText vectors created from Hong Kong data.☆22Jul 7, 2020Updated 5 years ago
- Latex template for CUHK PhD Thesis☆14Jun 29, 2025Updated 11 months ago
- Python scripts and datasets of the "Extremely Low-Resource Neural Machine Translation: A Case Study of Cantonese" project☆16Oct 28, 2022Updated 3 years ago
- 粵語對話語料☆31May 12, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…☆33Jun 14, 2024Updated 2 years ago
- ☆12Mar 31, 2020Updated 6 years ago
- Dictionary of pairs of Korean word and IPA crawled from Wiktionary (Korean edition)☆23Nov 12, 2025Updated 7 months ago
- Pre-trained ELECTRA from Hong Kong data☆29Jul 7, 2020Updated 5 years ago
- Paper list of dementia detection☆44Mar 24, 2026Updated 2 months ago
- An open Cantonese dictionary for iOS and Android built with Flutter☆60Jan 30, 2025Updated last year
- 56 language, 1 model Multilingual ASR☆24Jul 25, 2021Updated 4 years ago
- 此仓库用于储存湖南理工学院oj上的题解☆11Oct 7, 2021Updated 4 years ago
- [ICASSP‘25] Developing a Multilingual Dataset and Evaluation Metrics for Code-Switching: A Focus on Hong Kong's Polylingual Dynamics☆38Aug 10, 2025Updated 10 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 《香港二十世紀中期粵語語料庫》打包器☆16Apr 12, 2016Updated 10 years ago
- Dictionary for Cantonese word segmentation☆39Jun 4, 2024Updated 2 years ago
- ☆10Jun 1, 2024Updated 2 years ago
- 粵語拼音自動標註工具 Cantonese Pronunciation Automatic Labeling Tool☆90Feb 17, 2026Updated 3 months ago
- ASR text preprocessing utility☆21Aug 5, 2024Updated last year
- ToneNet: A CNN Model of Tone Classification of Mandarin Chinese☆20Nov 27, 2019Updated 6 years ago
- A Cantonese-English translator based on prompt engineering☆12Sep 19, 2023Updated 2 years ago
- ☆18May 4, 2025Updated last year
- ASCEND Chinese-English code-switching dataset☆32Jul 12, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Ideographic Description Sequences☆32Nov 27, 2025Updated 6 months ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- Example code - use word embeddings to make emoji prediction smarter with context☆11Sep 14, 2018Updated 7 years ago
- A recipe for constituency parsing, disfluency tagging and obtaining the fluent transcripts of English Fisher dataset☆13May 2, 2021Updated 5 years ago
- Detect emotion from audio☆14Nov 20, 2018Updated 7 years ago
- ☆11May 7, 2022Updated 4 years ago