cantonese-mandarin unsupervised neural translation for sw project
☆28May 2, 2023Updated 2 years ago
Alternatives and similar repositories for cantonese-nlp
Users that are interested in cantonese-nlp are comparing it to the libraries listed below
Sorting:
- Python scripts and datasets of the "Extremely Low-Resource Neural Machine Translation: A Case Study of Cantonese" project☆16Oct 28, 2022Updated 3 years ago
- Fine-tuning Wav2Vec2.0 on Common Voice(zh-HK)☆16May 8, 2022Updated 3 years ago
- Cantonese TTS frontend☆16Oct 14, 2019Updated 6 years ago
- 粵語拼音自動標註工具 Cantonese Pronunciation Automatic Labeling Tool☆81Feb 17, 2026Updated 2 weeks ago
- An audio and transcribed corpus of contemporary Hong Kong Cantonese☆40Dec 30, 2020Updated 5 years ago
- ☆99Feb 1, 2024Updated 2 years ago
- An English-to-Cantonese machine translation model☆55Mar 26, 2025Updated 11 months ago
- A Cantonese-English translator based on prompt engineering☆12Sep 19, 2023Updated 2 years ago
- rime-cantonese 上游詞表倉庫☆32Dec 24, 2025Updated 2 months ago
- Scrape cantonese syllables from CUHK Multi-function Chinese Character Database.☆10Mar 18, 2015Updated 10 years ago
- The Cantonese Wordnet☆14Dec 4, 2023Updated 2 years ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆35Feb 26, 2026Updated last week
- Transformers for Cantonese☆57Oct 24, 2020Updated 5 years ago
- This is a repository dedicated for pre-trained acoustic models of Hong Kong Cantonese and Cantonese forced alignment.☆23Nov 14, 2024Updated last year
- A curated list of resources dedicated to Natural Language Processing (NLP) of Cantonese | 粵語 NLP☆92Oct 17, 2021Updated 4 years ago
- an method to make vlm think like r1☆21May 28, 2025Updated 9 months ago
- Python 汉字到粤拼转换工具。☆35Feb 26, 2024Updated 2 years ago
- Hong Kong Cantonese Corpus of transcribed speech (spontaneous speech, radio programmes and a monologue).☆86Nov 3, 2025Updated 4 months ago
- BERT Tokenizer with vocabulary tailored for Cantonese☆23Oct 27, 2022Updated 3 years ago
- Transferability of cross-lingual and cross-age speech emotion recognition☆21Jun 30, 2023Updated 2 years ago
- fastText vectors created from Hong Kong data.☆22Jul 7, 2020Updated 5 years ago
- PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS☆24Jan 29, 2022Updated 4 years ago
- A frequency lexicon for Hong Kong Cantonese☆23Aug 27, 2020Updated 5 years ago
- The official code for our EMNLP 2022 long paper [Breaking the Representation Bottleneck of Chinese Characters: Neural Machine Translation…☆26Sep 10, 2025Updated 6 months ago
- A Python script for scraping LIHKG☆32Mar 7, 2022Updated 4 years ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆62Nov 1, 2024Updated last year
- 粵語對話語料☆29May 12, 2023Updated 2 years ago
- Cantonese segmentation tool 粵語分詞工具☆30Aug 22, 2020Updated 5 years ago
- ☆10Jan 20, 2023Updated 3 years ago
- ☆33Jun 29, 2023Updated 2 years ago
- [ICASSP‘25] Developing a Multilingual Dataset and Evaluation Metrics for Code-Switching: A Focus on Hong Kong's Polylingual Dynamics☆36Aug 10, 2025Updated 7 months ago
- ☆41May 15, 2023Updated 2 years ago
- Machine Translation (MT) Preparation Scripts☆36May 25, 2025Updated 9 months ago
- ☆13Jul 17, 2021Updated 4 years ago
- A working example of a NestJS project using the Prisma ORM with MySQL☆11Feb 26, 2024Updated 2 years ago
- 此仓库用于储存湖南理工学院oj上的题解☆11Oct 7, 2021Updated 4 years ago
- 粵音資料集叢:典籍資料☆233Mar 3, 2026Updated last week
- QA - Answer Selection (Rank candidate answers for a given question)☆36Apr 12, 2018Updated 7 years ago
- fastlangid, the only language identification package that support cantonese (zh-yue), simplified (zh-hans) and traditional chinese (zh-ha…☆43Dec 6, 2022Updated 3 years ago