tsroten / hanzidentifierLinks
Python module that identifies Chinese text as being Simplified or Traditional
☆93Updated 6 months ago
Alternatives and similar repositories for hanzidentifier
Users that are interested in hanzidentifier are comparing it to the libraries listed below
Sorting:
- Python library for CJK (Chinese, Japanese, and Korean) language dictionary☆91Updated this week
- Hanzi Converter for Traditional and Simplified Chinese☆188Updated 5 years ago
- Constants used in Chinese text processing☆371Updated 5 months ago
- 臺灣閩南語常用詞辭典 資料檔☆78Updated 2 years ago
- A CWN Python binding with graph structure☆31Updated 2 years ago
- Converts between traditional and simplified Chinese☆31Updated 8 months ago
- Identification and conversion functions for Chinese text processing☆59Updated 6 months ago
- A curated list of resources dedicated to Natural Language Processing (NLP) of Cantonese | 粵語 NLP☆91Updated 3 years ago
- 漢語拼音轉換表☆39Updated 4 years ago
- OpenCC made with Python☆555Updated last year
- 台語、族語、客語的語料清單、彙整☆42Updated 5 years ago
- fastText vectors created from Hong Kong data.☆21Updated 4 years ago
- Simple conversion and localization between simplified and traditional Chinese using tables from MediaWiki.☆541Updated last year
- Spoken Cantonese from Hong Kong.☆29Updated 3 weeks ago
- 《香港二十世紀中期粵語語料庫》打包器☆16Updated 9 years ago
- Cantonese Linguistics and NLP☆380Updated last year
- A sentence segmentation library with wide language support optimized for speed and utility.☆65Updated 9 months ago
- Library for extracting text and timestamps from multiple subtitle files (.ass, .ssa, .srt, .sub, .txt).☆52Updated last year
- pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation☆59Updated this week
- ☆93Updated this week
- A python module to reduce Unicode to a 'good enough' ASCII representation (outdated Github copy)☆40Updated 14 years ago
- Han character library for CJKV languages☆158Updated 4 years ago
- ☆171Updated 2 months ago
- 中文词典 / 中文詞典。Chinese / Chinese-English dictionaries.☆171Updated last year
- Export UNIHAN's database to csv, json or yaml☆58Updated this week
- 開放漢語字典 - 現代漢語字音數據庫☆22Updated 4 years ago
- A toolbox for working with the Chinese language in Python☆150Updated 5 years ago
- 粵文語料篩選器 Cantonese text filter☆40Updated 2 months ago
- Cython wrapper on Hunspell Dictionary☆66Updated 11 months ago
- unihandecode is a transliteration library to convert all characters/words in Unicode into ASCII alphabet that aware with Language prefere…☆68Updated 2 years ago