Dengqlbq / ZhiHuSpiderLinks

知乎问题及答案爬虫

☆25

Alternatives and similar repositories for ZhiHuSpider

Users that are interested in ZhiHuSpider are comparing it to the libraries listed below

Sorting:

Google1234 / Information_retrieva_Projectl-
新闻检索：爬虫定向采集3-4个网页，实现网页信息的抽取、检索和索引。网页个数不少于10个，能按时间、相关度、热度等属性进行排序，并实现相似主题的自动聚类。可以实现：有相关搜索推荐、snippet生成、结果预览(鼠标移到相关结果，能预览)功能
☆128Updated 9 years ago
ashora / SocialListening
依据香港中文大学设计的规则系统，先用小样本评论建立初始关键词库，再结合18种句式逐条匹配评论，能够快速准确地识别评论对象及情感极性。经多次迭代优化关键词库后，达到较高准确率的基础上，使用Tableau进一步分析数据，识别出客户集中关注的商品属性、普遍好评差评的商品属性；通过…
☆56Updated 8 years ago
xqtbox / AutoHomeSpider_Scrapy
爬取汽车之家的口碑数据，并破解前端js反爬虫措施分析
☆62Updated 8 years ago
EliasCai / sentiment
CCF大数据比赛，基于主题的文本情感分析
☆94Updated 7 years ago
chaoming0625 / WaiMaiOpinionMiner
细粒度情感分析repository1：Wai Mai Opinion Miner，细粒度情感分析系统GUI demo。
☆113Updated 9 years ago
qibinlou / SinaWeibo-Emotion-Classification
新浪微博情感分析应用
☆143Updated 9 years ago
sileixinhua / News-classification
新闻分类系统&谣言处理系统
☆79Updated 8 years ago
speciallurain / CNKI_Patent_SVM
文本分类是指在给定分类体系下 , 根据文本的内容自动确定文本类别的过程。首先我们根据scrapy爬虫根据中国知网URL的规律，爬取70多万条2014年公开的发明专利，然后通过数据清洗筛选出了60多万条含标签数据。通过TF-IDF对60多万条本文进行词频提取，依照词频排序提取…
☆108Updated 7 years ago
howie6879 / getNews
互联网新闻推荐系统(myNews)--2016全国计算机设计大赛企业命题参赛作品
☆45Updated 8 years ago
691505789 / cnn-text-classification
基于卷积神经网络参数优化的情感分析论文code
☆61Updated 7 years ago
ZexinYan / NLP-JD
This is the program which tries to classifier the sentiment of the production's comments in JD.
☆39Updated 8 years ago
lrcUnlimited / check_file_system
simhash算法实现海量内容查重
☆14Updated 9 years ago
zlikun / python-crawler-douban-movie
豆瓣电影（短评）爬虫
☆52Updated 7 years ago
QuantumLiu / Neural-Headline-Generator-CN
从门户网站爬取新闻的摘要-标题对使用seq2seq根据摘要生成标题
☆45Updated 8 years ago
mattzheng / LangueOne
练习题︱基于今日头条开源数据的文本挖掘
☆84Updated 6 years ago
youthpasses / bayes_classifier
朴素贝叶斯实现的文本分类（新闻分类）
☆65Updated 9 years ago
zpeng1989 / RNN_learning_text_code
一个基于最新版本TensorFlow的Char RNN实现。可以实现生成英文、写诗、歌词、小说、生成代码、生成日文等功能。
☆43Updated 7 years ago
nladuo / novelRS
一个简单的网络小说推荐系统。
☆126Updated 6 years ago
Glacier759 / Sentiment
基于情感词典和朴素贝叶斯算法实现中文文本情感分类
☆84Updated 11 years ago
zhangxinxing / cluster_for_weibo_data
针对微博的话题聚类实现
☆49Updated 9 years ago
HappyShadowWalker / ChineseTextClassify
中文文本分类，使用搜狗文本分类语料库
☆125Updated 9 years ago
Germey / SentenceDistance
Sentence Distance
☆55Updated 7 years ago
zhbbupt / TF_IDF
用python实现TF_IDF算法，用于文档的相关性搜索
☆36Updated 10 years ago
YijunRan / Opinion-leaders-mining
本文提出一种基于应答关系来挖掘QQ群中意见领袖的方法，该方法首先构建回应词词库，然后基于Aho-Corasick算法来匹配聊天文本中的回应词数据，构建出用户应答关系的网络结构，最后使用社交网络中重要节点识别的方法来发现意见领袖。该方法对QQ群中的意见领袖发现具有较高的准确率…
☆21Updated 9 years ago
lining0806 / Naive-Bayes-Classifier
朴素贝叶斯文本分类器
☆142Updated 9 years ago
liuhuanyong / BaiduIndexSpyder
self complemented BaiduIndexSpyder based on Selenium , index image decode and num image transfer，基于关键词的历时百度搜索指数自动采集
☆42Updated 7 years ago
renjunxiang / chatbot_by_similarity
根据文本相似度实现问答的聊天机器人（简单版）
☆52Updated 7 years ago
liuhuanyong / WeiboIndexSpyder
self complemented WeiboIndexSpyder based on Selenium ，新浪微博指数(微指数)采集，包括综合指数，移动端指数，PC端指数
☆31Updated 7 years ago
sysuLocas / Single-pass-python-implement
用于发现热议事件的新闻文本聚类算法的python实现
☆36Updated 9 years ago
jfzhang95 / news_spider
新闻爬虫 (腾讯,网易,新浪,今日头条,搜狐,凤凰网,腾讯滚动新闻)
☆58Updated 7 years ago