tsroten / hanzidentifier
Python module that identifies Chinese text as being Simplified or Traditional
☆89Updated 3 months ago
Alternatives and similar repositories for hanzidentifier:
Users that are interested in hanzidentifier are comparing it to the libraries listed below
- Python library for CJK (Chinese, Japanese, and Korean) language dictionary☆89Updated this week
- 臺灣閩南語常用詞辭典 資料檔☆76Updated last year
- Constants used in Chinese text processing☆368Updated 2 months ago
- Hanzi Converter for Traditional and Simplified Chinese☆184Updated 4 years ago
- Export UNIHAN's database to csv, json or yaml☆54Updated this week
- Identification and conversion functions for Chinese text processing☆59Updated 3 months ago
- fastText vectors created from Hong Kong data.☆21Updated 4 years ago
- A curated list of resources dedicated to Natural Language Processing (NLP) of Cantonese | 粵語 NLP☆86Updated 3 years ago
- 台語、族語、客語的語料清單、彙整☆39Updated 4 years ago
- A CWN Python binding with graph structure☆27Updated last year
- OpenCC made with Python☆545Updated last year
- Cantonese Linguistics and NLP☆367Updated 8 months ago
- Han character library for CJKV languages☆153Updated 3 years ago
- 中華大辭典☆117Updated last year
- 教育部重編國語辭典 資料檔; 若有建議或 bug 請在 moedict-process 反應☆137Updated 2 years ago
- Spoken Cantonese from Hong Kong.☆29Updated 3 months ago
- ☆33Updated 8 months ago
- 粵文語料篩選器 Cantonese text filter☆37Updated last week
- 開放漢語字典 - 現代漢語字音數據庫☆21Updated 4 years ago
- A sentence segmentation library with wide language support optimized for speed and utility.☆57Updated 5 months ago
- A toolbox for working with the Chinese language in Python☆147Updated 5 years ago
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆65Updated 3 months ago
- Machine-Translation-based sentence alignment tool for parallel text☆306Updated 3 years ago
- ☆92Updated 3 months ago
- A tool for ancient Chinese segmentation.☆53Updated 5 years ago
- A versioned python wrapper package for cmudict (https://github.com/cmusphinx/cmudict).☆61Updated last month
- A frequency lexicon for Hong Kong Cantonese☆21Updated 4 years ago
- 漢語拼音轉換表☆36Updated 3 years ago
- Converts between traditional and simplified Chinese☆30Updated 5 months ago
- Hong Kong Cantonese Corpus of transcribed speech (spontaneous speech, radio programmes and a monologue).☆52Updated 11 months ago