shenfei1010 / CyberCanLinks
CyberCan is a lexicon of contemporary Cantonese based on more than 100 million pieces of internet texts from discussion forums in Hong Kong.
☆12Updated 3 years ago
Alternatives and similar repositories for CyberCan
Users that are interested in CyberCan are comparing it to the libraries listed below
Sorting:
- A Package for Cantonese Tokenisation☆18Updated 3 years ago
- Twitter dataset for 2022 Russian and Ukrainian crisis☆48Updated 2 years ago
- Cantonese segmentation tool 粵語分詞工具☆30Updated 4 years ago
- Chinese Moral Foundation Dictionary☆18Updated last year
- Pre-trained ELECTRA from Hong Kong data☆29Updated 4 years ago
- Raw text of 申報☆26Updated 3 years ago
- Chinese Dialect Database☆17Updated 7 years ago
- Additional material for the paper "MoralStrength: Exploiting a Moral Lexicon and Embedding Similarity for Moral Foundations Prediction"☆54Updated 2 years ago
- R Scraper for LIHKG, the Hong Kong version of Reddit.☆16Updated 4 years ago
- ☆47Updated 3 years ago
- A frequency lexicon for Hong Kong Cantonese☆22Updated 4 years ago
- Driver for LIWC2015 analysis. LIWC2015 dictionary not included.☆16Updated 2 years ago
- ☆22Updated last year
- Answers to some "weird" statistics questions with R code☆10Updated 3 months ago
- Digital Outrage Classifier from the Crockett Lab at Yale. Predicts whether tweets contain moral outrage.☆30Updated 2 years ago
- An automation webcrawler based on Selenium library for retrieving parliamentary questions on The Website of Taiwan Legislative Yuan (http…☆11Updated last year
- fastText vectors created from Hong Kong data.☆21Updated 4 years ago
- Github site with code and data associated with the ASR paper on the Geometry of Culture☆51Updated 5 years ago
- ☆41Updated 5 years ago
- ☆54Updated 2 years ago
- ☆21Updated 4 years ago
- Code for measuring novelty in science using publication text☆27Updated 3 months ago
- The Extended Moral Foundations Dictionary (E-MFD)☆40Updated 4 years ago
- Loengfan (粵語兩分) is the Cantonese version of the Liang Fen input method☆12Updated 3 years ago
- A Python implementation for Structural Topic Modeling☆42Updated 2 years ago
- https://sites.google.com/site/multidimensionaltagger☆34Updated last year
- Python code for producing emotionality scores from Gennaro and Ash (2021).☆18Updated 3 years ago
- This package consists of functionalities for dynamic topic modelling and its visualization☆25Updated 5 years ago
- 《跨越网络的门槛》☆22Updated 2 years ago
- This repository contains data of TikTok videos related to the 2024 U.S. Elections☆22Updated 3 months ago