CyberCan is a lexicon of contemporary Cantonese based on more than 100 million pieces of internet texts from discussion forums in Hong Kong.
☆12Aug 24, 2021Updated 4 years ago
Alternatives and similar repositories for CyberCan
Users that are interested in CyberCan are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Loengfan (粵語兩分) is the Cantonese version of the Liang Fen input method☆15Mar 3, 2022Updated 4 years ago
- 漢語常用字詞表☆16Jun 3, 2023Updated 2 years ago
- Cantonese segmentation tool 粵語分詞工具☆30Aug 22, 2020Updated 5 years ago
- 💒 Reproducible Extraction of Cross-lingual Topics using R☆20Jul 12, 2023Updated 2 years ago
- 中文文本主题提取,并根据主题,对预测文本进行分类☆12May 18, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆32Jul 6, 2015Updated 10 years ago
- A Cantonese-English translator based on prompt engineering☆12Sep 19, 2023Updated 2 years ago
- A frequency lexicon for Hong Kong Cantonese☆23Aug 27, 2020Updated 5 years ago
- BERT Tokenizer with vocabulary tailored for Cantonese☆23Oct 27, 2022Updated 3 years ago
- A Package for Cantonese Tokenisation☆18Jun 17, 2021Updated 4 years ago
- Outputs Canvas discussions as a CSV for specified course.☆14Feb 27, 2026Updated 3 weeks ago
- Rime TUPA input schema | rime 切韻拼音輸入方案☆47Feb 12, 2026Updated last month
- R package to interact with the Pushift.io API☆10Aug 4, 2025Updated 7 months ago
- 蘇州吳語拼音輸入方案 · 苏州吴语拼音输入方案 · A Rime input schema for Suzhou Dialect☆21Feb 22, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Input a Chinese character and get all of its variant forms☆21Apr 13, 2025Updated 11 months ago
- a simple html5 jyutping learning game☆23Nov 25, 2025Updated 4 months ago
- ☆22Apr 21, 2022Updated 3 years ago
- 電腦用漢字粵語拼音表 / Cantonese Pronunciation List of the Characters for Computers☆63Jan 11, 2024Updated 2 years ago
- 粵語/廣東話參考資料 Reference Materials for Yue / Cantonese☆14Dec 12, 2025Updated 3 months ago
- Dataset for analysing Propagation of COVID-19 Misinformation on Twitter☆18Jan 31, 2024Updated 2 years ago
- 《香港二十世紀中期粵語語料庫》打包器☆16Apr 12, 2016Updated 9 years ago
- 粵文語料篩選器 Cantonese text filter☆41Feb 4, 2026Updated last month
- fastText vectors created from Hong Kong data.☆22Jul 7, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- R Scraper for LIHKG, the Hong Kong version of Reddit.☆18Nov 24, 2020Updated 5 years ago
- 中州韻粵語拼音輸入法分歧拼音系統補丁 | For users of alternative Cantonese romanisation schemes☆26Sep 29, 2025Updated 5 months ago
- Slides and homework for model based inference☆13Sep 26, 2017Updated 8 years ago
- Simple online editor of math formulas based on LaTeX syntax. Contains table of popular equations and chars for easy work with it to help …☆10Sep 13, 2019Updated 6 years ago
- Introduction to Python Programming for Data Science☆40Oct 3, 2023Updated 2 years ago
- Google Input Tools for macOS☆33Mar 14, 2026Updated last week
- Online BaseHangul Encoder And Decoder☆12Jan 30, 2023Updated 3 years ago
- Corpus of Black Lives Matters and counter protests tweets☆14Dec 22, 2022Updated 3 years ago
- iOS 动画实战之钓鱼小游戏☆29Mar 26, 2018Updated 8 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Generates Graphviz image URL that can be used directly on any website without a need to host them on a server☆13Feb 21, 2026Updated last month
- ☆13Nov 20, 2023Updated 2 years ago
- ☆10Oct 21, 2022Updated 3 years ago
- Page for the class "Computational Social Science with Images and Audio" at ETH Zurich.☆13Sep 18, 2025Updated 6 months ago
- 中古漢語(切韻音系)全拼及三拼☆32Mar 26, 2021Updated 5 years ago
- R library for accessing data from everypolitician.org☆20Apr 24, 2018Updated 7 years ago
- Using snscrape and tweepy libraries to scrape unlimited amount of tweets☆26Mar 1, 2021Updated 5 years ago