beifeng600 / nlp_storeroom
NLP的一些公开资料,有些是别人原始分享的,有些是处理了一下。
☆55Updated 9 years ago
Alternatives and similar repositories for nlp_storeroom:
Users that are interested in nlp_storeroom are comparing it to the libraries listed below
- 中文相关词典和语料库。☆169Updated 10 years ago
- 各大中文分词性能评测☆155Updated 5 years ago
- Train Wikidata with word2vec for word embedding tasks☆122Updated 6 years ago
- 人民日报1998年1-4月中文标注语料库☆29Updated 6 years ago
- 一个中文的已标注词性的语料库☆198Updated 10 years ago
- 对常用的6700个汉字进行音、形比较,输出音近字、形近字的列表。 # 相近字☆445Updated 10 months ago
- This is a corpus of Chinese abbreviation, including negative full forms.☆189Updated 3 years ago
- This directory contains the training, test, and gold-standard data used in the 2nd International Chinese Word Segmentation Bakeoff. Also …☆66Updated 6 years ago
- 汉字字符特征提取器 (featurizer),提取汉字的特征(发音特征、字形特征)用做深度学习的特征 | A Chinese character feature extractor, which extracts the features of Chinese charac…☆287Updated 3 years ago
- worddict crawler and transfer for sougpuinput wordict , 搜狗输入法词库抓取与格式转换☆25Updated 6 years ago
- 基于深度学习的中文分词尝试☆84Updated 9 years ago
- 汉字字符特征提取工具,可以提取出字符中的字音(声母、韵母、声调)、字形(偏旁、部首)、四角编码等特征,同时可作为tensor输入到模型☆130Updated 4 years ago
- Somiao Pinyin: Train your own Chinese Input Method with Seq2seq Model 搜喵拼音输入法☆266Updated 4 years ago
- 转换搜狗拼音词库为txt文件☆51Updated 7 years ago
- 大规模中文语料☆40Updated 5 years ago
- 中文、分词、词表、核心词典、事件词表、停用词、敏感词、问答、问答数据、知识图谱、文本语料。☆149Updated 3 years ago
- Self complemented Pinyin2Chinese demo use algorithms including Trie and HMM model , 基于隐马尔科夫模型与Trie树的拼音切分与拼音转中文的简单demo实现。☆86Updated 6 years ago
- WordForm,针对中文词语的笔画拆解,偏旁查询,拼音转换接口☆64Updated 6 years ago
- 基于哈工大同义词词林扩展版的单词相似度计算方法☆357Updated last year
- a chinese segment base on crf☆233Updated 6 years ago
- 图书名语料库。含部分电影、游戏名称。☆68Updated 10 months ago
- ☆55Updated 7 years ago
- 古汉语(文言文)字典-爬取文言文字典网,制作Kindle字典.☆65Updated 6 years ago
- A Chinese Cloze-style RC Dataset: People's Daily & Children's Fairy Tale (CFT)☆166Updated 5 years ago
- 百度QA100万数据集☆47Updated last year
- 下载搜狗、百度、QQ输入法的词库文件的 python 爬虫,可用于构建不同行业的词汇库☆112Updated 7 years ago
- Chinese Natural Language Processing tools and examples☆162Updated 8 years ago
- 提取中文的偏旁部首和拼音(一些生僻字的拼音没有补全,待优化)☆42Updated 6 years ago
- THU Chinese Keyphrase Extraction Toolkit☆124Updated 6 years ago
- 中文近义词表 Chinese Synonyms☆251Updated 7 years ago