Dictionary for Cantonese word segmentation
☆38Jun 4, 2024Updated last year
Alternatives and similar repositories for Cantonese_Word_Segmentation
Users that are interested in Cantonese_Word_Segmentation are comparing it to the libraries listed below
Sorting:
- fastText vectors created from Hong Kong data.☆22Jul 7, 2020Updated 5 years ago
- Cantonese segmentation tool 粵語分詞工具☆30Aug 22, 2020Updated 5 years ago
- Pre-trained ELECTRA from Hong Kong data☆29Jul 7, 2020Updated 5 years ago
- Cantonese TTS frontend☆16Oct 14, 2019Updated 6 years ago
- Cantonese Linguistics and NLP☆397May 23, 2024Updated last year
- 粵文語料篩選器 Cantonese text filter☆41Feb 4, 2026Updated last month
- Tools for processing open Cantonese dictionary data provided words.hk☆23Jan 30, 2025Updated last year
- 粤语分词工具☆48Jul 29, 2018Updated 7 years ago
- Hong Kong Cantonese Corpus of transcribed speech (spontaneous speech, radio programmes and a monologue).☆86Nov 3, 2025Updated 4 months ago
- a framework for Medical Image Segmentation and Filtering☆10Mar 23, 2017Updated 8 years ago
- Landing page for Woolly Test☆12Apr 15, 2025Updated 10 months ago
- ☆12Nov 8, 2019Updated 6 years ago
- Group Based Modeling Trajectory☆13Jun 8, 2025Updated 9 months ago
- ☆14Updated this week
- Including several social-media-computing tools.☆11Jan 4, 2019Updated 7 years ago
- An open-access corpus of conversational bilingual speech in Cantonese and English☆40Apr 28, 2022Updated 3 years ago
- A curated list of resources dedicated to Natural Language Processing (NLP) of Cantonese | 粵語 NLP☆92Oct 17, 2021Updated 4 years ago
- Config file parsing/writing [Haskell]☆17Mar 2, 2025Updated last year
- StumpWM Debugger☆11Apr 19, 2025Updated 10 months ago
- Code and data to support Bamman et al. (2020), "A Dataset of Literary Coreference" (LREC)☆10Dec 8, 2022Updated 3 years ago
- This repository provide script to do OCR using some basic Deep Learning approach☆10Aug 27, 2020Updated 5 years ago
- Unbounded cache model for online language modeling with open vocabulary☆11Feb 15, 2019Updated 7 years ago
- ☆15Nov 20, 2025Updated 3 months ago
- Scraped reviews from OpenRice for sentiment analysis. Formatted to use with BERT.☆11Apr 9, 2020Updated 5 years ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Sep 30, 2024Updated last year
- Code for "Mind Your Inflections! Improving NLP for Non-Standard Englishes with Base-Inflection Encoding" (EMNLP 2020).☆11May 1, 2025Updated 10 months ago
- Classified tweets sentiment towards COVID-19 vaccine to detect people’s opinion towards vaccine and to identify overall customer ratings …☆12Feb 24, 2021Updated 5 years ago
- ☆11Nov 27, 2018Updated 7 years ago
- Export sticker sets from telegram☆10Apr 3, 2025Updated 11 months ago
- Adaptive Multimodal Reasoning via Reinforcement Learning☆23Jan 11, 2026Updated last month
- semantic tokenizer for speech and music☆21Jul 6, 2025Updated 8 months ago
- test HanLP vs LTP☆11Mar 28, 2018Updated 7 years ago
- Emacs package to remind you current clocking task or its absence☆18Feb 22, 2023Updated 3 years ago
- The project aims to extract entities and relations from articles☆11Feb 12, 2020Updated 6 years ago
- A Cantonese-English translator based on prompt engineering☆12Sep 19, 2023Updated 2 years ago
- A tool for calculating WER (Word Error Rate) in python.☆14Sep 18, 2024Updated last year
- Easily document your Sanic API with a UI (and attrs!)☆13Oct 4, 2018Updated 7 years ago
- Cross platform screen capture library☆12Feb 28, 2025Updated last year