zeicold / pycnnumLinks
Converting Chinese number string <=> int/float/str
☆20Updated 8 months ago
Alternatives and similar repositories for pycnnum
Users that are interested in pycnnum are comparing it to the libraries listed below
Sorting:
- Constants used in Chinese text processing☆378Updated last year
- Hanzi Converter for Traditional and Simplified Chinese☆189Updated 5 years ago
- ☆96Updated last month
- ☆129Updated 8 years ago
- Simple conversion and localization between simplified and traditional Chinese using tables from MediaWiki.☆560Updated last year
- Python module that identifies Chinese text as being Simplified or Traditional☆105Updated last year
- Pure python Aho-Corasick library.☆220Updated 2 years ago
- Berserker - BERt chineSE woRd toKenizER☆16Updated 6 years ago
- You do not need to modify your model when applied it to Chinese,you can translate chinese chars to wubi ,then you can process chinese cha…☆38Updated 7 years ago
- Chinese word segmentation module of LTP☆46Updated 10 years ago
- Chinese stopwords collection☆140Updated 5 years ago
- 中文分词软件基准测试 | Chinese tokenizer benchmark☆25Updated 7 years ago
- ZPar statistical parser. Universal language support (depending on the availability of training data), with language-specific features for…☆135Updated 9 years ago
- High performance Trie and Ahocorasick automata (AC automata) Keyword Match & Replace Tool for python. Correct case insensitive implementa…☆94Updated last year
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆378Updated 3 years ago
- Utility scripts or libraries for various Natural Language Processing tasks.☆37Updated 3 years ago
- super fast cpp implementation of longest common subsequence/substring☆72Updated 2 years ago
- This directory contains the training, test, and gold-standard data used in the 2nd International Chinese Word Segmentation Bakeoff. Also …☆66Updated 7 years ago
- Clone of "A Good Part-of-Speech Tagger in about 200 Lines of Python" by Matthew Honnibal☆49Updated 9 years ago
- ☆176Updated 9 months ago
- A Chinese sentiment dataset may be useful for sentiment analysis.☆234Updated 9 years ago
- A Chinese Cloze-style RC Dataset: People's Daily & Children's Fairy Tale (CFT)☆175Updated 6 years ago
- An open-source classical Chinese information processing toolkit developed by Tsinghua Natural Language Processing Group☆51Updated 7 years ago
- Python 3 Spelling Corrector☆178Updated 2 years ago
- Corpus of Annotations for Misspelings☆28Updated 2 years ago
- Simhash and near-duplicate detection☆421Updated 2 years ago
- THU Chinese Keyphrase Extraction Toolkit☆123Updated 7 years ago
- A web-based viewer for documents in the CoNLL-U format☆16Updated 4 years ago
- CRF++: Yet Another CRF toolkit☆513Updated 10 months ago
- CoNLL-U format library for JavaScript☆73Updated 8 years ago