justinchuntingho / songotsti
A Package for Cantonese Tokenisation
☆17Updated 3 years ago
Alternatives and similar repositories for songotsti:
Users that are interested in songotsti are comparing it to the libraries listed below
- R Scraper for LIHKG, the Hong Kong version of Reddit.☆16Updated 4 years ago
- Answers to some "weird" statistics questions with R code☆10Updated 3 weeks ago
- An automation webcrawler based on Selenium library for retrieving parliamentary questions on The Website of Taiwan Legislative Yuan (http…☆11Updated last year
- legisTaiwan: An Interface to Access Taiwan Legislative API in R 台灣立法院國會系統 API☆36Updated 2 months ago
- 💒 Reproducible Extraction of Cross-lingual Topics using R☆20Updated last year
- CyberCan is a lexicon of contemporary Cantonese based on more than 100 million pieces of internet texts from discussion forums in Hong Ko…☆12Updated 3 years ago
- 👚 Speedy Word Embedding Association Test & Extras using R☆30Updated last month
- flairR: Bring Amazing Flair NLP to R☆30Updated last month
- Code and models for 3 different tools to measure appeals to 8 discrete emotions in German political text☆13Updated 2 years ago
- Natural Language Processing for Political Science☆20Updated 7 years ago
- Raw text of 申報☆25Updated 3 years ago
- ☆21Updated last year
- An all-in-one R package for the assessment of linguistic similarity☆11Updated last month
- ☆12Updated 7 months ago
- Extract effects from estimateEffect in the stm package☆48Updated 4 years ago
- Neural Language Models for Historical Research☆25Updated 5 months ago
- The official Github for the American Stories dataset as in {link}☆116Updated last year
- Repository for paper "Embedding Regression: Models for Context-Specific Description and Inference"☆90Updated 2 years ago
- Materials for "Text as Data" classes at Penn State and Essex.☆24Updated 3 years ago
- 🍵 Create and administrate validation tests for automated content analysis tools.☆55Updated last month
- U.S. County level word and topic loading derived from a 10% Twitter sample from 2009-2015.☆21Updated 3 years ago
- A multi-lingual stopwords lists☆17Updated 8 months ago
- ☆37Updated 11 months ago
- Chris Bail's graduate-level computational social science course☆24Updated 4 years ago
- A short demo of (r)Ollama☆11Updated 5 months ago
- Twitter dataset for 2022 Russian and Ukrainian crisis☆49Updated 2 years ago
- Pre-trained ELECTRA from Hong Kong data☆28Updated 4 years ago
- ☆12Updated last year
- TA Lecture Slides for Quantitative Social Science: An introduction in tidyverse☆17Updated 2 years ago
- semgram: R package for extracting semantic motifs from text☆23Updated 2 years ago