toastynews/electra-hongkongese

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/toastynews/electra-hongkongese)

toastynews / electra-hongkongese

Pre-trained ELECTRA from Hong Kong data

☆29

Alternatives and similar repositories for electra-hongkongese

Users that are interested in electra-hongkongese are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

paramiai / cantoformer
View on GitHub
Transformers for Cantonese
☆58Oct 24, 2020Updated 5 years ago
toastynews / hong-kong-fastText
View on GitHub
fastText vectors created from Hong Kong data.
☆22Jul 7, 2020Updated 6 years ago
ayaka14732 / cantoseg
View on GitHub
Cantonese segmentation tool 粵語分詞工具
☆31Aug 22, 2020Updated 5 years ago
UniversalDependencies / UD_Cantonese-HK
View on GitHub
Spoken Cantonese from Hong Kong.
☆30May 6, 2026Updated 2 months ago
dbamman / akkadian-morph-analyzer
View on GitHub
Morphological Analyzer for Akkadian
☆15Aug 7, 2013Updated 12 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
UniversalConceptualCognitiveAnnotation / UCCA_English-Wiki
View on GitHub
Corpus for Universal Conceptual Cognitive Annotation
☆13Mar 5, 2021Updated 5 years ago
wchan757 / Cantonese_Word_Segmentation
View on GitHub
Dictionary for Cantonese word segmentation
☆39Jun 4, 2024Updated 2 years ago
CanCLID / canto-filter
View on GitHub
粵文語料篩選器 Cantonese text filter
☆43Feb 4, 2026Updated 5 months ago
jacksonllee / pycantonese
View on GitHub
Cantonese Linguistics and NLP
☆413May 26, 2026Updated last month
gwinterstein / Cifu
View on GitHub
A frequency lexicon for Hong Kong Cantonese
☆25Aug 27, 2020Updated 5 years ago
CanCLID / awesome-cantonese-nlp
View on GitHub
A curated list of resources dedicated to Natural Language Processing (NLP) of Cantonese | 粵語 NLP
☆95Oct 17, 2021Updated 4 years ago
annefried / sitent
View on GitHub
Situation entity type labeling system
☆15Mar 6, 2024Updated 2 years ago
ayaka14732 / bert-tokenizer-cantonese
View on GitHub
BERT Tokenizer with vocabulary tailored for Cantonese
☆23Oct 27, 2022Updated 3 years ago
g-traveller / cantonese-corpus
View on GitHub
粤语分词工具
☆48Jul 29, 2018Updated 7 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
XiangLi1999 / PosteriorControl-NLG
View on GitHub
Posterior Control of Blackbox Generation
☆23May 2, 2020Updated 6 years ago
konstantinosKokos / Transformers
View on GitHub
A few tranformer models
☆22Jun 30, 2020Updated 6 years ago
mahoffman / social_network_analysis
View on GitHub
☆15Oct 9, 2021Updated 4 years ago
lucy3 / ingroup_lang
View on GitHub
Code for 2021 TACL paper on community-specific language
☆13Dec 8, 2022Updated 3 years ago
justinchuntingho / songotsti
View on GitHub
A Package for Cantonese Tokenisation
☆18Jun 17, 2021Updated 5 years ago
greywizard / ml-waf-tutorial
View on GitHub
See more at:
☆16Jun 3, 2019Updated 7 years ago
shenfei1010 / CyberCan
View on GitHub
CyberCan is a lexicon of contemporary Cantonese based on more than 100 million pieces of internet texts from discussion forums in Hong Ko…
☆12Aug 24, 2021Updated 4 years ago
cpllab / syntactic-generalization
View on GitHub
Code and data for "A Systematic Assessment of Syntactic Generalization in Neural Language Models"
☆31Jun 18, 2021Updated 5 years ago
indiejoseph / hkcc-corpus
View on GitHub
《香港二十世紀中期粵語語料庫》打包器
☆16Apr 12, 2016Updated 10 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
voidful / nlp2go
View on GitHub
🏃 hosting nlp models in one line
☆20May 8, 2024Updated 2 years ago
kowndinya-renduchintala / POSIX
View on GitHub
POSIX: A Prompt Sensitivity Index for Language Models
☆13Nov 13, 2024Updated last year
wordshk / yue_references
View on GitHub
粵語/廣東話參考資料 Reference Materials for Yue / Cantonese
☆15Dec 12, 2025Updated 7 months ago
voidful / wav2vec2-xlsr-multilingual-56
View on GitHub
56 language, 1 model Multilingual ASR
☆24Jul 25, 2021Updated 4 years ago
hon9kon9ize / hkeval2025
View on GitHub
☆21Aug 12, 2025Updated 11 months ago
alsonicr / quarto-apa7
View on GitHub
An apa7 template for quarto/posit
☆12Jan 25, 2023Updated 3 years ago
UCREL / pymusas
View on GitHub
Python Multilingual Ucrel Semantic Analysis System
☆41May 29, 2026Updated last month
Algram / PodcastAutomator
View on GitHub
🎧 Simple bash-script to automatically download the most recent podcasts from a list of rss-feeds and upload them to your Dropbox.
☆10Nov 30, 2015Updated 10 years ago
CNMan / XDHYDCD
View on GitHub
《现代汉语大词典》字词头
☆29Dec 29, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
fangjh13 / protect_python_code
View on GitHub
☆10Aug 14, 2019Updated 6 years ago
hellonlp / sentence_segmentation_dl
View on GitHub
☆23Oct 20, 2021Updated 4 years ago
bhavyaghai / WordBias
View on GitHub
WordBias: Visualizing Intersectional Social biases encoded in Word Embeddings
☆23Aug 18, 2025Updated 11 months ago
pacotvj99 / testsampleR
View on GitHub
☆14Jan 25, 2026Updated 5 months ago
buptlj / learn_tf
View on GitHub
TensorFlow: learn and practice
☆11Aug 30, 2018Updated 7 years ago
THU-KEG / Entity-Linking-Trends-and-History
View on GitHub
Papers about the trend of Entity Linking in recent years.
☆11Sep 5, 2022Updated 3 years ago
sebastianruder / emnlp2021-multiqa-tutorial
View on GitHub
EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering
☆38Nov 7, 2021Updated 4 years ago