indiejoseph / hkcc-corpusLinks
《香港二十世紀中期粵語語料庫》打包器
☆16Updated 9 years ago
Alternatives and similar repositories for hkcc-corpus
Users that are interested in hkcc-corpus are comparing it to the libraries listed below
Sorting:
- fastText vectors created from Hong Kong data.☆21Updated 5 years ago
- 粵文語料篩選器 Cantonese text filter☆40Updated 4 months ago
- A curated list of resources dedicated to Natural Language Processing (NLP) of Cantonese | 粵語 NLP☆92Updated 3 years ago
- A frequency lexicon for Hong Kong Cantonese☆22Updated 4 years ago
- The Cantonese Wordnet☆14Updated last year
- 粵語/廣東話參考資料 Reference Materials for Yue / Cantonese☆14Updated last year
- Cantonese segmentation tool 粵語分詞工具☆30Updated 4 years ago
- 粵語拼音轉換表☆35Updated 3 months ago
- CyberCan is a lexicon of contemporary Cantonese based on more than 100 million pieces of internet texts from discussion forums in Hong Ko…☆12Updated 3 years ago
- Cantonese Linguistics and NLP☆387Updated last year
- Spoken Cantonese from Hong Kong.☆29Updated 2 months ago
- Python library for CJK (Chinese, Japanese, and Korean) language dictionary☆92Updated last week
- 粵音資料集叢:典籍資料☆215Updated last week
- Input a Chinese character. Output all the variant characters of it.☆20Updated 3 months ago
- Cantonese Romanization Converter☆17Updated 4 years ago
- A toolset for computation and comparison of Chinese dialects☆37Updated last week
- An open-access corpus of conversational bilingual speech in Cantonese and English☆40Updated 3 years ago
- Transformers for Cantonese☆57Updated 4 years ago
- 《现代汉语大词典》字词头☆28Updated 4 years ago
- BERT Tokenizer with vocabulary tailored for Cantonese☆22Updated 2 years ago
- Han character library for CJKV languages☆159Updated 4 years ago
- ToneNet: A CNN Model of Tone Classification of Mandarin Chinese☆18Updated 5 years ago
- ☆51Updated 3 years ago
- Nanning Dialect Booklet☆11Updated last week
- 漢字データベースの辞書関連データ☆99Updated 2 years ago
- YonhTenxMyangx 韻典網☆124Updated last year
- Pre-trained ELECTRA from Hong Kong data☆29Updated 5 years ago
- A tool for ancient Chinese segmentation.☆54Updated 6 years ago
- classic Chinese punctuate experiment with keras using daizhige(殆知阁古代文献藏书) dataset☆35Updated 2 years ago
- Python module that identifies Chinese text as being Simplified or Traditional☆100Updated 8 months ago