shenfei1010 / CyberCanView external linksLinks
CyberCan is a lexicon of contemporary Cantonese based on more than 100 million pieces of internet texts from discussion forums in Hong Kong.
☆12Aug 24, 2021Updated 4 years ago
Alternatives and similar repositories for CyberCan
Users that are interested in CyberCan are comparing it to the libraries listed below
Sorting:
- Loengfan (粵語兩分) is the Cantonese version of the Liang Fen input method☆15Mar 3, 2022Updated 3 years ago
- 漢語常用字詞表☆16Jun 3, 2023Updated 2 years ago
- A frequency lexicon for Hong Kong Cantonese☆23Aug 27, 2020Updated 5 years ago
- A Cantonese-English translator based on prompt engineering☆12Sep 19, 2023Updated 2 years ago
- Cantonese segmentation tool 粵語分詞工具☆30Aug 22, 2020Updated 5 years ago
- 粵語/廣東話參考資料 Reference Materials for Yue / Cantonese☆14Dec 12, 2025Updated 2 months ago
- 《香港二十世紀中期粵語語料庫》打包器☆16Apr 12, 2016Updated 9 years ago
- BERT Tokenizer with vocabulary tailored for Cantonese☆23Oct 27, 2022Updated 3 years ago
- Rime TUPA input schema | rime 切韻拼音輸入方案☆46Nov 22, 2024Updated last year
- A Package for Cantonese Tokenisation☆18Jun 17, 2021Updated 4 years ago
- 粵文語料篩選器 Cantonese text filter☆41Feb 4, 2026Updated last week
- ☆22Apr 21, 2022Updated 3 years ago
- a simple html5 jyutping learning game☆22Nov 25, 2025Updated 2 months ago
- An English-to-Cantonese machine translation model☆55Mar 26, 2025Updated 10 months ago
- fastText vectors created from Hong Kong data.☆22Jul 7, 2020Updated 5 years ago
- Input a Chinese character and get all of its variant forms☆21Apr 13, 2025Updated 10 months ago
- 中州韻粵語拼音輸入法分歧拼音系統補丁 | For users of alternative Cantonese romanisation schemes☆25Sep 29, 2025Updated 4 months ago
- 💒 Reproducible Extraction of Cross-lingual Topics using R☆20Jul 12, 2023Updated 2 years ago
- 蘇州吳語拼音輸入方案 · 苏州吴语拼音输入方案 · A Rime input schema for Suzhou Dialect☆21Jan 7, 2026Updated last month
- Cantonese Input Method for macOS☆30Jan 25, 2025Updated last year
- 電腦用漢字粵語拼音表 / Cantonese Pronunciation List of the Characters for Computers☆62Jan 11, 2024Updated 2 years ago
- 粵語對話語料☆29May 12, 2023Updated 2 years ago
- Google Input Tools for macOS☆32Feb 3, 2022Updated 4 years ago
- Pre-trained ELECTRA from Hong Kong data☆29Jul 7, 2020Updated 5 years ago
- rime-cantonese 上游詞表倉庫☆31Dec 24, 2025Updated last month
- Spoken Cantonese from Hong Kong.☆30Nov 12, 2025Updated 3 months ago
- Cantoboard - Smart Cantonese Keyboard on iOS☆69Oct 15, 2023Updated 2 years ago
- 中古漢語(切韻音系)全拼及三拼☆32Mar 26, 2021Updated 4 years ago
- ☆32Jul 6, 2015Updated 10 years ago
- repository for 2018 Fall Stats 131 class at UCLA☆14Mar 1, 2019Updated 6 years ago
- Calculating Expected Time for training LLM.☆38Apr 17, 2023Updated 2 years ago
- An audio and transcribed corpus of contemporary Hong Kong Cantonese☆40Dec 30, 2020Updated 5 years ago
- ☆10Jan 20, 2023Updated 3 years ago
- ☆13Jul 17, 2021Updated 4 years ago
- This set of apps allows you to type Latin characters in OSX and have them transliterate in real time into Cyrillic characters. This inclu…☆41Jan 22, 2014Updated 12 years ago
- 🦀Fastest ever Trad/Simp and regional Chinese variants converter | 中文简繁及地區詞轉換☆50Feb 5, 2026Updated last week
- GPT-powered Telegram bot built to help community members learn software development☆14Nov 9, 2025Updated 3 months ago
- Super Flappy Bird in p5.js☆10Mar 8, 2021Updated 4 years ago
- ☆11Sep 25, 2022Updated 3 years ago