sunpinyin / open-gram
an open solution for collecting n-gram Chinese lexicon and n-gram statistics
☆74Updated 9 years ago
Alternatives and similar repositories for open-gram:
Users that are interested in open-gram are comparing it to the libraries listed below
- ZPar statistical parser. Universal language support (depending on the availability of training data), with language-specific features for …☆135Updated 8 years ago
- Utility scripts or libraries for various Natural Language Processing tasks.☆39Updated 3 years ago
- ☆93Updated 5 months ago
- Yet another Chinese word segmentation package based on character-based tagging heuristics and CRF algorithm☆245Updated 12 years ago
- ☆36Updated 11 months ago
- 人民日报1998年1-4月中文标注语料库☆32Updated 6 years ago
- Conversion of UD_Chinese-GSD to simplified Chinese characters.☆36Updated 5 months ago
- Chinese word segmentation module of LTP☆46Updated 9 years ago
- An open-access corpus of conversational bilingual speech in Cantonese and English☆40Updated 3 years ago
- [本项目不再维护] 将汉字转换为拼音, 支持多音字,拼音 -> pin yin☆210Updated 2 years ago
- 中文分词软件基准测试 | Chinese tokenizer benchmark☆24Updated 6 years ago
- Corpus creator for Chinese Wikipedia☆41Updated 3 years ago
- Estimate the phonetic distance between Chinese words and get similar sounding candidate words.☆37Updated this week
- Chinese Words Segment Library based on HMM model☆166Updated 10 years ago
- auto generate chinese words in huge text.☆91Updated 10 years ago
- Chinese morphological analysis with Word Segment and POS Tagging data for MeCab☆160Updated 7 years ago
- 绝对有趣的中文发音引擎 funny chinese text to speech enginee☆51Updated 11 years ago
- The zhong [|] Chinese grammars☆14Updated 3 years ago
- Chinese processing☆36Updated 11 years ago
- a chinese segment base on crf☆233Updated 6 years ago
- An Efficient Lexical Analyzer for Chinese☆42Updated 5 years ago
- ☆129Updated 7 years ago
- 一个中文的已标注词性的语料库☆202Updated 10 years ago
- Chinese Natural Language Processing tools and examples☆162Updated 9 years ago
- Source codes and corpora of paper "Iterated Dilated Convolutions for Chinese Word Segmentation"☆135Updated 4 years ago
- Somiao Pinyin: Train your own Chinese Input Method with Seq2seq Model 搜喵拼音输入法☆266Updated 5 years ago
- Clone of "A Good Part-of-Speech Tagger in about 200 Lines of Python" by Matthew Honnibal☆48Updated 8 years ago
- 中文自然语言处理工具包☆86Updated 9 years ago
- A python wrapper around the ZPar parser for English.☆49Updated 4 years ago
- Spelling Corrector for Input Method Engine (IME)☆31Updated 9 years ago