hermanschaaf / mafanLinks
A toolbox for working with the Chinese language in Python
☆149Updated 5 years ago
Alternatives and similar repositories for mafan
Users that are interested in mafan are comparing it to the libraries listed below
Sorting:
- Yet another Chinese word segmentation package based on character-based tagging heuristics and CRF algorithm☆245Updated 13 years ago
- Chinese Words Segment Library based on HMM model☆166Updated 11 years ago
- OpenCC binding for Python.☆52Updated 5 years ago
- rmmseg-cpp with Python interface☆189Updated 11 years ago
- Transition-based statistical parser☆417Updated 8 years ago
- Chinese Wordnet v.2☆22Updated 9 years ago
- A simple python script to translate chinese to pinyin based on Mandarin.dat☆217Updated last year
- Chinese morphological analysis with Word Segment and POS Tagging data for MeCab☆162Updated 8 years ago
- A tool to segment text based on frequencies and the Viterbi algorithm "#TheBoyWhoLived" => ['#', 'The', 'Boy', 'Who', 'Lived']☆81Updated 9 years ago
- A Chinese Words Segmentation Tool Based on Bayes Model☆79Updated 12 years ago
- Count frequent n-gram from big data with limited memory.☆60Updated 11 years ago
- EASE (Enhanced AI Scoring Engine) is a library that allows for machine learning based classification of textual content. This is useful …☆218Updated 3 years ago
- Unofficial implementation of the paper "Bag of Tricks for Efficient Text Classification" by Joulin et al.☆60Updated 9 years ago
- Python implementation of linear-chain conditional random fields.☆101Updated 12 years ago
- Stanford NLP group's shared Python tools.☆136Updated 7 years ago
- This is a mirror of the script by Giuseppe Attardi, and contains history before the official repo started: https://github.com/attardi/wik…☆259Updated 9 years ago
- [NO LONGER MAINTAINED AS OPEN SOURCE - USE SCALETEXT.COM INSTEAD]☆107Updated 12 years ago
- Japanese Sentiment Analysis☆44Updated 10 years ago
- a chinese segment base on crf☆234Updated 7 years ago
- Pure python NLP toolkit☆55Updated 10 years ago
- ZPar statistical parser. Universal language support (depending on the availability of training data), with language-specific features for…☆135Updated 9 years ago
- The implementation of Word2Vec (SkipGram - and CBOW) models using theano and numpy☆27Updated 9 years ago
- A Python framework for exploring distributional semantic models.☆85Updated 10 years ago
- Sentiment Analysis with Ensemble☆244Updated 9 years ago
- A toolkit for corpus linguistics☆206Updated 6 years ago
- CogComp's light-weight Python NLP annotators☆115Updated 6 years ago
- Hanzi Converter for Traditional and Simplified Chinese☆189Updated 5 years ago
- A blog post using word embeddings and RNNs to explain representations.☆42Updated 11 years ago
- simple text preprocessing tool☆18Updated 8 years ago
- Code for the ACL-2015 paper "Accurate Linear-Time Chinese Word Segmentation via Embedding Matching"☆38Updated 9 years ago