lxw0109 / CJOSpider
A Spider(with and w/o Scrapy) for crawling data from China Judgements Online(中国裁判文书网).
☆20Updated 6 years ago
Alternatives and similar repositories for CJOSpider
Users that are interested in CJOSpider are comparing it to the libraries listed below
Sorting:
- 中文命名实体识别(公司名称),Tensorflow 1.3 + Python3☆38Updated 7 years ago
- self complemented WeiboIndexSpyder based on Selenium ,新浪微博指数(微指数)采集,包括综合指数,移动端指数,PC端指数☆31Updated 6 years ago
- DeepDive Tutorial with Chinese Support☆34Updated 3 years ago
- 维基百科离线语料获取☆28Updated 7 years ago
- 个人学习用。请star或fork原作者。☆27Updated 9 years ago
- Syntax and Ruler-Based Doc sentiment analysis 基于依存句法规则的篇章级情感分析demo☆107Updated 5 years ago
- 用TF特征向量和simhash指纹计算中文文本的相似度☆216Updated 8 years ago
- 中文分词工具评估☆61Updated 2 years ago
- 从门户网站爬取新闻的摘要-标题对使用seq2seq根据摘要生成标题☆45Updated 7 years ago
- 智能客服☆105Updated 5 years ago
- 对中文分词jieba (python版)的注解☆92Updated 6 years ago
- 新词发现☆66Updated 10 years ago
- 新词发现算法(NewWordDetection)☆92Updated 4 years ago
- self complemented BaiduIndexSpyder based on Selenium , index image decode and num image transfer,基于关键词的历时百度搜索指数自动采集☆41Updated 6 years ago
- 使用python实现了一个简单的trie树结构,可增加/查找/删除关键词,用于中文文本的关键词匹配、停用词删除 等。☆64Updated 5 years ago
- Event monitor based on online news corpus including event storyline and analysis,基于给定事件关键词,采集事件资讯,对事件进行挖掘和分析。☆152Updated 6 years ago
- 把之前 hanLP-python-flask 裡面的 hanLP 單獨分出來☆59Updated 7 years ago
- 无监督中文仿真评论自动生成。 Unsupervised Automatic Generation of Chinese Fake Reviews.☆84Updated 5 years ago
- Train Wikidata with word2vec for word embedding tasks☆123Updated 6 years ago
- xmnlp中文分词工具,java编写,统计概率分词+规则分词实现,功能包括人名识别,词性标注,用户自定义词典扩展,分词效果速度都超过开源版的jieba分词。☆36Updated 3 years ago
- A simple and useful platform for entity tagging using tornado.☆25Updated 5 years ago
- 使用Simhash对海量文本进行去重☆12Updated 6 years ago
- ☆31Updated 6 years ago
- BosonNLP HTTP API 封装库(SDK)☆163Updated 6 years ago
- ZhidaoChatbot, a chatbot that can be an expert on the common questions like why,how,when,who,what based on the online question-answer web…☆42Updated 6 years ago
- DeepDive 中文配置☆51Updated 8 years ago
- 这是一个最大熵的简明Java实现,提供提供训练与预测接口。训练算法采用GIS训练算法,附带示例训练集和一个天气预测的Demo。☆46Updated 10 years ago
- 个人实现的基于Django与semantic-ui的语言计算实验平台, 功能包括自然语言综合处理,词语计算,社会热点计算,人物计算,文学画像,职位画像等社会计算功能☆29Updated 7 years ago
- E-Commerce Sentiment Dict☆130Updated 6 years ago
- 本项目曾冲到全球第一,干货集锦见本页面最底部,另完整精致的纸质版《编程之法:面试和算法心得》已在京东/当当上销售☆40Updated 7 years ago