Cantonese segmentation tool 粵語分詞工具
☆30Aug 22, 2020Updated 5 years ago
Alternatives and similar repositories for cantoseg
Users that are interested in cantoseg are comparing it to the libraries listed below
Sorting:
- A Python script for scraping LIHKG☆32Mar 7, 2022Updated 3 years ago
- CyberCan is a lexicon of contemporary Cantonese based on more than 100 million pieces of internet texts from discussion forums in Hong Ko…☆12Aug 24, 2021Updated 4 years ago
- 粵語/廣東話參考資料 Reference Materials for Yue / Cantonese☆14Dec 12, 2025Updated 2 months ago
- 《香港二十世紀中期粵語語料庫》打包器☆16Apr 12, 2016Updated 9 years ago
- 粵文語料篩選器 Cantonese text filter☆41Feb 4, 2026Updated last month
- A Cantonese-English translator based on prompt engineering☆12Sep 19, 2023Updated 2 years ago
- rime-cantonese 上游詞表倉庫☆32Dec 24, 2025Updated 2 months ago
- 粤语分词工具☆48Jul 29, 2018Updated 7 years ago
- Python scripts and datasets of the "Extremely Low-Resource Neural Machine Translation: A Case Study of Cantonese" project☆16Oct 28, 2022Updated 3 years ago
- An audio and transcribed corpus of contemporary Hong Kong Cantonese☆40Dec 30, 2020Updated 5 years ago
- 粵語拼音輸入法下載網站 | Jyutping Input Method Website☆14Updated this week
- ☆99Feb 1, 2024Updated 2 years ago
- 56 language, 1 model Multilingual ASR☆24Jul 25, 2021Updated 4 years ago
- cantonese-mandarin unsupervised neural translation for sw project☆28May 2, 2023Updated 2 years ago
- 常用香港外字表☆57Sep 7, 2022Updated 3 years ago
- ☆10Jan 20, 2023Updated 3 years ago
- 粵語拼音自動標註工具 Cantonese Pronunciation Automatic Labeling Tool☆81Feb 17, 2026Updated 2 weeks ago
- ☆31Jun 2, 2018Updated 7 years ago
- 粵音資料集叢:典籍資料☆232Feb 27, 2026Updated last week
- Structural Topic Modeling of the Facebook posts of NC State Senators☆13Mar 17, 2017Updated 8 years ago
- Super Flappy Bird in p5.js☆10Mar 8, 2021Updated 4 years ago
- Normalization Matters in Weakly Supervised Object Localization (ICCV 2021)☆11Oct 24, 2021Updated 4 years ago
- A collection of Models, Datasets, DataModules, Callbacks, Metrics, Losses and Loggers to better integrate pytorch-lightning with transfor…☆47May 29, 2023Updated 2 years ago
- Open Source State-of-the-art Chinese Word Segmentation System with BiLSTM and ELMo. https://arxiv.org/abs/1901.05816☆46May 20, 2021Updated 4 years ago
- 开放中文转换 - 简繁转换之通用规范汉字标准☆14Feb 19, 2026Updated 2 weeks ago
- ☆10Apr 17, 2024Updated last year
- The Android application providing user with REST-based interface for utilizing built-in Android's TTS engine. The web service is highly c…☆11Jul 28, 2020Updated 5 years ago
- ☆10Jan 3, 2023Updated 3 years ago
- [IROS 2021] ADD: A Fine-grained Dynamic Inference Architecture for Semantic Image Segmentation☆10May 3, 2022Updated 3 years ago
- Extended Sensing via Dynamic Tactile Sensors☆10Apr 8, 2024Updated last year
- Phonetically balanced text to speech sentences☆10Aug 16, 2021Updated 4 years ago
- ☆10Oct 7, 2019Updated 6 years ago
- For loops in const☆13Sep 7, 2024Updated last year
- char <-> Unicode character name (maintained fork of huonw/unicode_names)☆12Sep 7, 2025Updated 5 months ago
- ☆13Feb 18, 2023Updated 3 years ago
- 中文金融大模型测评基准,六大类二十五任务、等级化评价,国内模型获得A级☆10May 6, 2024Updated last year
- Benchmarks for Business Document Foundation Models☆10Apr 4, 2024Updated last year
- ☆13Sep 25, 2024Updated last year
- Github mirror of MediaWiki extension TextExtracts - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Dev…☆15Feb 26, 2026Updated last week