zeicold / pycnnumLinks
Converting Chinese number string <=> int/float/str
☆20Updated 7 months ago
Alternatives and similar repositories for pycnnum
Users that are interested in pycnnum are comparing it to the libraries listed below
Sorting:
- Hanzi Converter for Traditional and Simplified Chinese☆189Updated 5 years ago
- Constants used in Chinese text processing☆378Updated last year
- ☆96Updated last month
- Chinese word segmentation module of LTP☆46Updated 10 years ago
- ☆128Updated 7 years ago
- High performance Trie and Ahocorasick automata (AC automata) Keyword Match & Replace Tool for python. Correct case insensitive implementa…☆95Updated last year
- Python module that identifies Chinese text as being Simplified or Traditional☆104Updated last year
- Chinese stopwords collection☆138Updated 5 years ago
- ZPar statistical parser. Universal language support (depending on the availability of training data), with language-specific features for…☆135Updated 9 years ago
- Simple conversion and localization between simplified and traditional Chinese using tables from MediaWiki.☆560Updated last year
- Pure python Aho-Corasick library.☆220Updated 2 years ago
- Berserker - BERt chineSE woRd toKenizER☆16Updated 6 years ago
- Python 3 Spelling Corrector☆178Updated 2 years ago
- THU Chinese Keyphrase Extraction Toolkit☆123Updated 7 years ago
- Which Encoding is the Best for Text Classification in Chinese, English, Japanese and Korean?☆174Updated 7 years ago
- A small package to fuzzy match chinese words☆94Updated 2 years ago
- You do not need to modify your model when applied it to Chinese,you can translate chinese chars to wubi ,then you can process chinese cha…☆38Updated 7 years ago
- ICU based universal language tokenizer☆33Updated 3 years ago
- A Chinese Cloze-style RC Dataset: People's Daily & Children's Fairy Tale (CFT)☆175Updated 6 years ago
- Python Set subclass that supports searching by ngram similarity☆119Updated 4 years ago
- Utility scripts or libraries for various Natural Language Processing tasks.☆38Updated 3 years ago
- This directory contains the training, test, and gold-standard data used in the 2nd International Chinese Word Segmentation Bakeoff. Also …☆67Updated 7 years ago
- Code for NeurIPS 2019 - Glyce: Glyph-vectors for Chinese Character Representations☆427Updated 2 years ago
- Clone of "A Good Part-of-Speech Tagger in about 200 Lines of Python" by Matthew Honnibal☆49Updated 9 years ago
- A Chinese sentiment dataset may be useful for sentiment analysis.☆234Updated 9 years ago
- 中文分词软件基准测试 | Chinese tokenizer benchmark☆25Updated 7 years ago
- minitools☆104Updated 12 years ago
- GIZA++ is a statistical machine translation toolkit that is used to train IBM Models 1-5 and an HMM word alignment model. This package al…☆270Updated 3 weeks ago
- Corpus creator for Chinese Wikipedia☆41Updated 4 years ago
- Conversion of UD_Chinese-GSD to simplified Chinese characters.☆38Updated last month