sunpinyin / open-gramLinks
an open solution for collecting n-gram Chinese lexicon and n-gram statistics
☆73Updated 9 years ago
Alternatives and similar repositories for open-gram
Users that are interested in open-gram are comparing it to the libraries listed below
Sorting:
- ZPar statistical parser. Universal language support (depending on the availability of training data), with language-specific features for…☆135Updated 9 years ago
- Yet another Chinese word segmentation package based on character-based tagging heuristics and CRF algorithm☆245Updated 12 years ago
- Chinese morphological analysis with Word Segment and POS Tagging data for MeCab☆161Updated 8 years ago
- OpenCC binding for Python.☆52Updated 5 years ago
- An Efficient Lexical Analyzer for Chinese☆44Updated 5 years ago
- Constants used in Chinese text processing☆377Updated 10 months ago
- [本项目不再维护] 将汉字转换为拼音, 支持多音字,拼音 -> pin yin☆211Updated 5 months ago
- Chinese word segmentation module of LTP☆46Updated 10 years ago
- A toolbox for working with the Chinese language in Python☆150Updated 5 years ago
- ☆96Updated last month
- auto generate chinese words in huge text.☆92Updated 10 years ago
- Can CNNs transliterate Pinyin into Chinese characters correctly?☆333Updated 7 years ago
- Somiao Pinyin: Train your own Chinese Input Method with Seq2seq Model 搜喵拼音输入法☆271Updated 5 years ago
- Hanzi Converter for Traditional and Simplified Chinese☆191Updated 5 years ago
- Simple Solution for Multi-Criteria Chinese Word Segmentation☆303Updated 5 years ago
- a chinese segment base on crf☆234Updated 6 years ago
- Estimate the phonetic distance between Chinese words and get similar sounding candidate words.☆37Updated 3 weeks ago
- Chinese Words Segment Library based on HMM model☆166Updated 11 years ago
- Software for unsupervised word segmentation and language model learning using lattices☆45Updated 9 years ago
- 中文自然语言处理工具包☆86Updated 10 years ago
- 一个中文的已标注词性的语料库☆206Updated 11 years ago
- ☆128Updated 7 years ago
- Utility scripts or libraries for various Natural Language Processing tasks.☆38Updated 3 years ago
- a text analyzing (match, rewrite, extract) engine (python edition)☆80Updated 8 years ago
- A Chinese Cloze-style RC Dataset: People's Daily & Children's Fairy Tale (CFT)☆172Updated 6 years ago
- Spelling Corrector for Input Method Engine (IME)☆31Updated 9 years ago
- Chinese word segmentation algorithm without corpus(无需语料库的中文分词)☆499Updated 5 years ago
- A simple python script to translate chinese to pinyin based on Mandarin.dat☆218Updated last year
- Spoken Cantonese from Hong Kong.☆30Updated 3 weeks ago
- MIT Language Modeling Toolkit☆117Updated 5 years ago