toastynews / electra-hongkongeseView external linksLinks
Pre-trained ELECTRA from Hong Kong data
☆29Jul 7, 2020Updated 5 years ago
Alternatives and similar repositories for electra-hongkongese
Users that are interested in electra-hongkongese are comparing it to the libraries listed below
Sorting:
- Cantonese segmentation tool 粵語分詞工具☆30Aug 22, 2020Updated 5 years ago
- Transformers for Cantonese☆57Oct 24, 2020Updated 5 years ago
- A frequency lexicon for Hong Kong Cantonese☆23Aug 27, 2020Updated 5 years ago
- 粵文語料篩選器 Cantonese text filter☆41Feb 4, 2026Updated last week
- CyberCan is a lexicon of contemporary Cantonese based on more than 100 million pieces of internet texts from discussion forums in Hong Ko…☆12Aug 24, 2021Updated 4 years ago
- ☆99Feb 1, 2024Updated 2 years ago
- Cantonese TTS frontend☆16Oct 14, 2019Updated 6 years ago
- 粵語/廣東話參考資料 Reference Materials for Yue / Cantonese☆14Dec 12, 2025Updated 2 months ago
- Zero-Shot Translation implemented by Transformer☆14Mar 24, 2023Updated 2 years ago
- A curated list of resources dedicated to Natural Language Processing (NLP) of Cantonese | 粵語 NLP☆92Oct 17, 2021Updated 4 years ago
- Dictionary for Cantonese word segmentation☆38Jun 4, 2024Updated last year
- 《香港二十世紀中期粵語語料庫》打包器☆16Apr 12, 2016Updated 9 years ago
- BERT Tokenizer with vocabulary tailored for Cantonese☆23Oct 27, 2022Updated 3 years ago
- A Package for Cantonese Tokenisation☆18Jun 17, 2021Updated 4 years ago
- 粤语分词工具☆48Jul 29, 2018Updated 7 years ago
- 🏃 hosting nlp models in one line☆20May 8, 2024Updated last year
- 56 language, 1 model Multilingual ASR☆24Jul 25, 2021Updated 4 years ago
- 常用香港外字表☆57Sep 7, 2022Updated 3 years ago
- ☆23Oct 20, 2021Updated 4 years ago
- 《现代汉语大词典》字词头☆27Dec 29, 2020Updated 5 years ago
- 🍳 NLPrep - dataset tool for many natural language processing task☆28Jul 30, 2021Updated 4 years ago
- Comprehensive quantitative comparison of lossless and lossy audio codecs☆39Feb 11, 2023Updated 3 years ago
- ☆10Jan 20, 2023Updated 3 years ago
- An audio and transcribed corpus of contemporary Hong Kong Cantonese☆40Dec 30, 2020Updated 5 years ago
- Group Based Modeling Trajectory☆13Jun 8, 2025Updated 8 months ago
- Code for "Fine-Tuned 'Small' LLMs (Still) Significantly Outperform Zero-Shot Generative AI Models in Text Classification", arXiv 2024☆13Jun 24, 2024Updated last year
- Super Flappy Bird in p5.js☆10Mar 8, 2021Updated 4 years ago
- ☆11Sep 25, 2022Updated 3 years ago
- An open-access corpus of conversational bilingual speech in Cantonese and English☆40Apr 28, 2022Updated 3 years ago
- Answers to some "weird" statistics questions with R code☆10Jun 8, 2025Updated 8 months ago
- ☆13Feb 18, 2023Updated 2 years ago
- ☆11Feb 16, 2025Updated last year
- The Android application providing user with REST-based interface for utilizing built-in Android's TTS engine. The web service is highly c…☆11Jul 28, 2020Updated 5 years ago
- Data & Analysis compendium for the Economic Sentiment analysis paper☆11Nov 18, 2025Updated 3 months ago
- ☆11Aug 1, 2024Updated last year
- 开放中文转换 - 简繁转换之通用规范汉字标准☆13Jan 27, 2026Updated 3 weeks ago
- UW DigiPsych Prosody Feature Extraction Repository☆13May 16, 2019Updated 6 years ago
- Phonetically balanced text to speech sentences☆10Aug 16, 2021Updated 4 years ago
- ☆10Nov 25, 2022Updated 3 years ago