BrikerMan / classic_chinese_punctuate
classic Chinese punctuate experiment with keras using daizhige(殆知阁古代文献藏书) dataset
☆34Updated 2 years ago
Alternatives and similar repositories for classic_chinese_punctuate:
Users that are interested in classic_chinese_punctuate are comparing it to the libraries listed below
- A tool for ancient Chinese segmentation.☆53Updated 5 years ago
- 中文分词工具评估☆61Updated 2 years ago
- This is a corpus of Chinese abbreviation, including negative full forms.☆194Updated 3 years ago
- Simple Solution for Multi-Criteria Chinese Word Segmentation☆301Updated 4 years ago
- 教育行业新闻 自动文摘 语料库 自动摘要☆198Updated 6 years ago
- 中文相关词典和语料库。☆172Updated 10 years ago
- Code for chinese error detection module, using n-gram and bi-lstm☆135Updated 5 years ago
- 图书名语料库。含部分电影、游戏名称。☆71Updated last year
- chinese anti semantic word search interface based on dict crawled from online resources, ChineseAntiword,针对中文词语的反义词查询接口☆59Updated 6 years ago
- An open-source classical Chinese information processing toolkit developed by Tsinghua Natural Language Processing Group☆51Updated 6 years ago
- ☆92Updated 4 months ago
- 人民日报语料处理工具集 | Tools for Corpus of People's Daily☆278Updated last year
- Source codes and corpora of paper "Iterated Dilated Convolutions for Chinese Word Segmentation"☆135Updated 3 years ago
- Train Wikidata with word2vec for word embedding tasks☆122Updated 6 years ago
- THU Chinese Keyphrase Extraction Toolkit☆125Updated 6 years ago
- NLP NER datasets video/music/book bio☆88Updated 4 years ago
- worddict crawler and transfer for sougpuinput wordict , 搜狗输入法词库抓取与格式转换☆25Updated 6 years ago
- 新词发现算法(NewWordDetection)☆92Updated 4 years ago
- 汉字字符特征提取工具,可以提取出字符中的字音(声母、韵母、声调)、字形(偏旁、部首)、四角编码等特征,同时可作为tensor输入到模型☆134Updated 4 years ago
- Self complemented Pinyin2Chinese demo use algorithms including Trie and HMM model , 基于隐马尔科夫模型与Trie树的拼音切分与拼音转中文的简单demo实现。☆86Updated 6 years ago
- SemEval-2016 Task 9: Chinese Semantic Dependency Parsing☆135Updated 6 years ago
- This directory contains the training, test, and gold-standard data used in the 2nd International Chinese Word Segmentation Bakeoff. Also …☆66Updated 6 years ago
- ☆42Updated 6 years ago
- 中文文本自动纠错☆84Updated 6 years ago
- 人民日报1998年1-4月中文标注语料库☆30Updated 6 years ago
- State of the art Chinese Word Segmentation with Bi-LSTMs☆27Updated 4 years ago
- self complemented SpellCorrection based pinyin similairity, edit distance ,基于拼音相似度与编辑距离的查询纠错。☆82Updated 2 years ago
- Conversion of UD_Chinese-GSD to simplified Chinese characters.☆36Updated 4 months ago
- 物种名称语料库。植物名,动物名。☆48Updated last year
- ☆28Updated 4 months ago