justinchuntingho / songotstiView external linksLinks
A Package for Cantonese Tokenisation
☆18Jun 17, 2021Updated 4 years ago
Alternatives and similar repositories for songotsti
Users that are interested in songotsti are comparing it to the libraries listed below
Sorting:
- R Scraper for LIHKG, the Hong Kong version of Reddit.☆18Nov 24, 2020Updated 5 years ago
- 👚 Speedy Word Embedding Association Test & Extras using R☆31Aug 18, 2025Updated 6 months ago
- ☆12Jan 25, 2026Updated 3 weeks ago
- 💒 Reproducible Extraction of Cross-lingual Topics using R☆20Jul 12, 2023Updated 2 years ago
- Answers to some "weird" statistics questions with R code☆10Jun 8, 2025Updated 8 months ago
- R package to interact with the Pushift.io API☆10Aug 4, 2025Updated 6 months ago
- Image clustering☆13Jan 22, 2022Updated 4 years ago
- ☆10Jan 3, 2023Updated 3 years ago
- Rank Degree Influencer Core Sampler (RaDICeS). A Twitter follow network crawler for influential accounts using the cost-free Twitter API.☆13Mar 14, 2023Updated 2 years ago
- Cantonese segmentation tool 粵語分詞工具☆30Aug 22, 2020Updated 5 years ago
- Scrape cantonese syllables from CUHK Multi-function Chinese Character Database.☆10Mar 18, 2015Updated 10 years ago
- CyberCan is a lexicon of contemporary Cantonese based on more than 100 million pieces of internet texts from discussion forums in Hong Ko…☆12Aug 24, 2021Updated 4 years ago
- 粵語/廣東話參考資料 Reference Materials for Yue / Cantonese☆14Dec 12, 2025Updated 2 months ago
- Easily access the party colors of thousands of parties around the world☆23Nov 27, 2021Updated 4 years ago
- A project written in R to get old tweets, it bypass some limitations of Twitter Official API.☆18Jul 22, 2020Updated 5 years ago
- A frequency lexicon for Hong Kong Cantonese☆23Aug 27, 2020Updated 5 years ago
- 常用香港外字表☆57Sep 7, 2022Updated 3 years ago
- Pre-trained ELECTRA from Hong Kong data☆29Jul 7, 2020Updated 5 years ago
- Notebooks and other course materials for Emory QTM 340 (Fall 2022)☆12Dec 13, 2022Updated 3 years ago
- legisTaiwan: An Interface to Access Taiwan Legislative API in R 台灣立法院國會系統 API☆42Nov 21, 2025Updated 2 months ago
- An Amharic News Text classification Dataset☆38May 17, 2024Updated last year
- R library to help dealing with syntactic structure☆37Feb 3, 2025Updated last year
- ☆11Oct 2, 2022Updated 3 years ago
- Code for "Fine-Tuned 'Small' LLMs (Still) Significantly Outperform Zero-Shot Generative AI Models in Text Classification", arXiv 2024☆13Jun 24, 2024Updated last year
- Python 汉字到粤拼转换工具。☆35Feb 26, 2024Updated last year
- ☆12Jun 25, 2020Updated 5 years ago
- Research compendium for reproducible research☆12Sep 7, 2020Updated 5 years ago
- Super Flappy Bird in p5.js☆10Mar 8, 2021Updated 4 years ago
- 微信扫码登陆在Django上的实现☆11Jan 14, 2021Updated 5 years ago
- The Android application providing user with REST-based interface for utilizing built-in Android's TTS engine. The web service is highly c…☆11Jul 28, 2020Updated 5 years ago
- ☆11Aug 1, 2024Updated last year
- Data & Analysis compendium for the Economic Sentiment analysis paper☆11Nov 18, 2025Updated 3 months ago
- ☆13Sep 25, 2024Updated last year
- RIME schema and dict files for Hakka dialect in Taiwan☆11Feb 18, 2020Updated 5 years ago
- Phonetically balanced text to speech sentences☆10Aug 16, 2021Updated 4 years ago
- Shiny App for ChatGPT☆12May 21, 2023Updated 2 years ago
- ☆10Apr 17, 2024Updated last year
- ☆10Sep 19, 2022Updated 3 years ago
- Microsoft Complex Tasks Dataset☆17Jun 12, 2023Updated 2 years ago