hermanschaaf / mafan
A toolbox for working with the Chinese language in Python
☆147Updated 5 years ago
Alternatives and similar repositories for mafan:
Users that are interested in mafan are comparing it to the libraries listed below
- Yet another Chinese word segmentation package based on character-based tagging heuristics and CRF algorithm☆242Updated 12 years ago
- OpenCC binding for Python.☆52Updated 4 years ago
- rmmseg-cpp with Python interface☆189Updated 10 years ago
- the Chinese NLP full stack toolkit☆41Updated 10 years ago
- A deep learning Chinese Word Segmentation toolkit☆45Updated 7 years ago
- A simple python script to translate chinese to pinyin based on Mandarin.dat☆214Updated 11 months ago
- Constants used in Chinese text processing☆368Updated 2 months ago
- 中文自然语言处理工具包☆86Updated 9 years ago
- ☆92Updated 3 months ago
- ZPar statistical parser. Universal language support (depending on the availability of training data), with language-specific features for…☆135Updated 8 years ago
- Transition-based statistical parser☆416Updated 7 years ago
- Use word2vec to improve search result☆64Updated 8 years ago
- Count frequent n-gram from big data with limited memory.☆59Updated 11 years ago
- Chinese Words Segment Library based on HMM model☆167Updated 10 years ago
- Chinese segmentation library☆82Updated 13 years ago
- Sentiment Analysis with Ensemble☆244Updated 8 years ago
- Chinese morphological analysis with Word Segment and POS Tagging data for MeCab☆157Updated 7 years ago
- Stanford NLP group's shared Python tools.☆137Updated 6 years ago
- Chinese Word Similarity Computation based on HowNet☆27Updated 7 years ago
- A pure python3 implementation of CRF tagger☆20Updated 6 years ago
- Hanzi Converter for Traditional and Simplified Chinese☆184Updated 4 years ago
- an open solution for collecting n-gram Chinese lexicon and n-gram statistics☆73Updated 9 years ago
- Lean Semantic Web tutorials☆128Updated 11 years ago
- Recurrent Neural Networks(GRU) for character-level language models on Chinese, in Python/Theano☆63Updated 7 years ago
- tyccl(同义词词林) is a ruby gem that provides friendly functions to analyse similarity between Chinese Words.☆46Updated 11 years ago
- Chinese Tokenizer; New words Finder. 中文三段式机械分词算法; 未登录新词发现算法☆95Updated 8 years ago
- Chinese word segmentation module of LTP☆46Updated 9 years ago
- Non-Overlapping Aho-Corasick Python extension, for Python 2 (str and unicode) and Python 3☆51Updated 9 years ago
- Code for the ACL-2015 paper "Accurate Linear-Time Chinese Word Segmentation via Embedding Matching"☆38Updated 9 years ago
- Code for Exploring Segment Representations for Neural Segmentation Models