LiveMirror / jiebaLinks
结巴中文分词做最好的Python分词组件
☆31Updated 8 years ago
Alternatives and similar repositories for jieba
Users that are interested in jieba are comparing it to the libraries listed below
Sorting:
- 中文分词软件基准测试 | Chinese tokenizer benchmark☆24Updated 6 years ago
- NLP的一些公开资料,有些是别人原始分享的,有些是处理了一下。☆57Updated 9 years ago
- Chinese Subjective Dectection based on subjective knowlegebase, 中文主观性计算。基于中文主观性知识库的句子主观性评定方法。☆57Updated last year
- Corpus creator for Chinese Wikipedia☆41Updated 3 years ago
- A Public Corpus for Machine Learning☆44Updated 6 years ago
- PTT 八卦版問答中文語料☆243Updated 7 months ago
- 中文 NLP 语料库数据集☆20Updated 6 years ago
- 中文情緒分析☆51Updated 10 years ago
- A Python Wrapper of Stanford Chinese Segmenter☆20Updated 7 years ago
- 总结了一些可以用作聊天机器人训练实作的文字语聊,包含中英文不同语言☆118Updated 7 years ago
- Chinese stopwords collection☆135Updated 5 years ago
- Self complemented sentiment words expansion using seed sentiment words and so-pmi , this method is tested to be effective, 基于情感种子词与so-pmi…☆87Updated 7 years ago
- 公開的情緒訓練資料☆58Updated 2 years ago
- 基于TextRank和WordNet的中英文单文档自动摘要☆63Updated 9 years ago
- Pre-trained Wikipedia corpus by MITIE☆51Updated 6 years ago
- 金庸小说人物关系图谱构建☆61Updated 5 years ago
- 对中文分词jieba (python版)的注解☆92Updated 6 years ago
- Quick run NLP in many task 快速运行分类、序列标注、匹配、生成等NLP任务的Tensorflow框架 (中文 NLP 支持分布式)☆30Updated 4 years ago
- 常用的中文停用词表☆76Updated 7 years ago
- SMP2017中文人机对话评测数据☆107Updated 7 years ago
- 中文詞向量訓練教學☆516Updated 2 years ago
- This directory contains the training, test, and gold-standard data used in the 2nd International Chinese Word Segmentation Bakeoff. Also …☆67Updated 7 years ago
- 会说中文的机器人☆55Updated 13 years ago
- code collections for the book of qna☆28Updated 6 years ago
- Tutorial for Chinese Sentiment analysis with hotel review data☆48Updated 7 years ago
- 金融问答平台文本数据采集/爬取,数据源涉及上交所,深交所,全景网及新浪股吧☆38Updated 7 years ago
- 中文PDF转TXT的实用工具☆30Updated 3 years ago
- Event monitor based on online news corpus including event storyline and analysis,基于给定事件关键词,采集事件资讯,对事件进行挖掘和分析。☆152Updated 6 years ago
- A Python toolkit for file processing, text cleaning and data splitting. 文件处理,文本清洗和数据划分的python工具包。☆32Updated 2 years ago
- 使用pyltp的工具,基于中文依存句法的四大名著人物情节分析系统。分为整体分析和章节分析两大模块,实现了人物篇幅分析,故事发生地分析,主要人物情绪变化分析,人物互动情况分析.☆94Updated 7 years ago