lmorgadodacosta / CantoneseWN
The Cantonese Wordnet
☆14Updated last year
Alternatives and similar repositories for CantoneseWN:
Users that are interested in CantoneseWN are comparing it to the libraries listed below
- A frequency lexicon for Hong Kong Cantonese☆21Updated 4 years ago
- An audio and transcribed corpus of contemporary Hong Kong Cantonese☆35Updated 4 years ago
- 粵文語料篩選器 Cantonese text filter☆36Updated last month
- 《香港二十世紀中期粵語語料庫》打包器☆16Updated 8 years ago
- Spoken Cantonese from Hong Kong.☆29Updated 2 months ago
- Cantonese segmentation tool 粵語分詞工具☆29Updated 4 years ago
- rime-cantonese 上游詞表倉庫☆27Updated 4 months ago
- fastText vectors created from Hong Kong data.☆21Updated 4 years ago
- BERT Tokenizer with vocabulary tailored for Cantonese☆20Updated 2 years ago
- An open-access corpus of conversational bilingual speech in Cantonese and English☆40Updated 2 years ago
- Global ASP - African Storybook Project for the World☆14Updated 2 months ago
- Unicode Standard tokenization routines and orthography profile segmentation☆34Updated 2 years ago
- 粵語拼音轉換表☆31Updated 8 months ago
- An English-to-Cantonese machine translation model☆49Updated 9 months ago
- Chinese Wordnet v.2☆22Updated 8 years ago
- Estimate the phonetic distance between Chinese words and get similar sounding candidate words.☆35Updated last year
- Python scripts and datasets of the "Extremely Low-Resource Neural Machine Translation: A Case Study of Cantonese" project☆14Updated 2 years ago
- American English Pronunciation Dictionary☆34Updated 6 years ago
- Compute useful transcriptions metrics (CER, WER, SER, ...)☆26Updated 10 years ago
- Transformers for Cantonese☆56Updated 4 years ago
- Aligned bilingual word vectors for English and Chinese☆11Updated 6 years ago
- ToneNet: A CNN Model of Tone Classification of Mandarin Chinese☆17Updated 5 years ago
- explores Chinese language models with sub-character level visual information☆16Updated 6 years ago
- The zhong [|] Chinese grammars☆13Updated 3 years ago
- Taiwanese Hokkien Transliterator and Tokeniser☆29Updated 4 months ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Updated 2 years ago
- Labeled data for homograph disambiguation☆54Updated last year
- cantonese-mandarin unsupervised neural translation for sw project☆25Updated last year
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysox☆13Updated 6 years ago
- A database of number names for 186 languages, locales, and scripts☆65Updated last year