Pre-trained ELECTRA from Hong Kong data
☆29Jul 7, 2020Updated 5 years ago
Alternatives and similar repositories for electra-hongkongese
Users that are interested in electra-hongkongese are comparing it to the libraries listed below
Sorting:
- Scraped reviews from OpenRice for sentiment analysis. Formatted to use with BERT.☆11Apr 9, 2020Updated 5 years ago
- Cantonese segmentation tool 粵語分詞工具☆30Aug 22, 2020Updated 5 years ago
- Transformers for Cantonese☆57Oct 24, 2020Updated 5 years ago
- fastText vectors created from Hong Kong data.☆22Jul 7, 2020Updated 5 years ago
- Spoken Cantonese from Hong Kong.☆30Nov 12, 2025Updated 3 months ago
- A frequency lexicon for Hong Kong Cantonese☆23Aug 27, 2020Updated 5 years ago
- 粵文語料篩選器 Cantonese text filter☆41Feb 4, 2026Updated last month
- CyberCan is a lexicon of contemporary Cantonese based on more than 100 million pieces of internet texts from discussion forums in Hong Ko…☆12Aug 24, 2021Updated 4 years ago
- Zero-Shot Translation implemented by Transformer☆14Mar 24, 2023Updated 2 years ago
- 粵語/廣東話參考資料 Reference Materials for Yue / Cantonese☆14Dec 12, 2025Updated 2 months ago
- A curated list of resources dedicated to Natural Language Processing (NLP) of Cantonese | 粵語 NLP☆92Oct 17, 2021Updated 4 years ago
- Dictionary for Cantonese word segmentation☆38Jun 4, 2024Updated last year
- Hong Kong Cantonese Corpus of transcribed speech (spontaneous speech, radio programmes and a monologue).☆86Nov 3, 2025Updated 4 months ago
- BERT Tokenizer with vocabulary tailored for Cantonese☆23Oct 27, 2022Updated 3 years ago
- 《香港二十世紀中期粵語語料庫》打包器☆16Apr 12, 2016Updated 9 years ago
- A Package for Cantonese Tokenisation☆18Jun 17, 2021Updated 4 years ago
- 粤语分词工具☆48Jul 29, 2018Updated 7 years ago
- 56 language, 1 model Multilingual ASR☆24Jul 25, 2021Updated 4 years ago
- 常用香港外字表☆57Sep 7, 2022Updated 3 years ago
- ☆10Jan 20, 2023Updated 3 years ago
- ☆13Jul 17, 2021Updated 4 years ago
- An audio and transcribed corpus of contemporary Hong Kong Cantonese☆40Dec 30, 2020Updated 5 years ago
- PotatoGUI is a mod menu for Phasmophobia.☆12Nov 4, 2020Updated 5 years ago
- ☆13Jan 25, 2026Updated last month
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Nov 7, 2021Updated 4 years ago
- ☆11Sep 25, 2022Updated 3 years ago
- Super Flappy Bird in p5.js☆10Mar 8, 2021Updated 5 years ago
- ☆12Nov 8, 2019Updated 6 years ago
- A collection of Models, Datasets, DataModules, Callbacks, Metrics, Losses and Loggers to better integrate pytorch-lightning with transfor…☆47May 29, 2023Updated 2 years ago
- Open Source State-of-the-art Chinese Word Segmentation System with BiLSTM and ELMo. https://arxiv.org/abs/1901.05816☆46May 20, 2021Updated 4 years ago
- An open-access corpus of conversational bilingual speech in Cantonese and English☆40Apr 28, 2022Updated 3 years ago
- Code for "Fine-Tuned 'Small' LLMs (Still) Significantly Outperform Zero-Shot Generative AI Models in Text Classification", arXiv 2024☆14Jun 24, 2024Updated last year
- Answers to some "weird" statistics questions with R code☆10Jun 8, 2025Updated 9 months ago
- An apa7 template for quarto/posit☆12Jan 25, 2023Updated 3 years ago
- Elm Set built on top of AnyDict☆10Aug 12, 2024Updated last year
- ☆10Jun 17, 2020Updated 5 years ago
- Object annotation maker in VOC Pascal format using object images and background images☆10Feb 27, 2021Updated 5 years ago
- Data & Analysis compendium for the Economic Sentiment analysis paper☆11Nov 18, 2025Updated 3 months ago
- ☆10Oct 17, 2022Updated 3 years ago