KeithYue/Zhihu_Spider

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/KeithYue/Zhihu_Spider)

KeithYue / Zhihu_Spider

Scrapy the Zhihu content and user social network information

☆46

Alternatives and similar repositories for Zhihu_Spider

Users that are interested in Zhihu_Spider are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

immzz / zhihu-scrapy
View on GitHub
A scrapy zhihu crawler
☆77Nov 6, 2018Updated 7 years ago
zhijunio / scrapy-zhihu-github
View on GitHub
scrapy examples for crawling zhihu and github
☆221Jan 11, 2023Updated 3 years ago
pelick / VerticleSearchEngine
View on GitHub
Academic Search Engine using Scrapy, MongoDB, Lucene/Solr, Tika, Struts2, Jquery, Bootstrap, D3, CAS
☆101Jun 16, 2013Updated 13 years ago
KeithYue / weibo-keywords-crawler
View on GitHub
Crawl the related sina weibo content using the keywords, and save the results to txt file for future use.
☆18Oct 20, 2016Updated 9 years ago
ml-distribution / phrase-finding
View on GitHub
新词发现分布式机器学习算法。
☆15Jul 21, 2014Updated 12 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
weizetao / spider-roach
View on GitHub
分布式定向抓取集群
☆71Sep 4, 2017Updated 8 years ago
bojanliu / zhihu-to-renren
View on GitHub
一个自动抓取知乎热门问答内容、自动在人人网上发日志的脚本
☆40May 27, 2012Updated 14 years ago
yoyzhou / weibo_scrapy
View on GitHub
WEIBO_SCRAPY is a Multi-Threading SINA WEIBO data extraction Framework in Python.
☆155Jun 3, 2026Updated last month
Darmody / getLike
View on GitHub
一个 python scrapy 爬虫 utility，定制任何我想抓取的web infomation！
☆12Apr 7, 2014Updated 12 years ago
zhchbin / KM
View on GitHub
My personal knowledge management.
☆18Sep 26, 2014Updated 11 years ago
younghz / scrapy-redis
View on GitHub
Redis-based components for scrapy that allows distributed crawling
☆46Sep 6, 2014Updated 11 years ago
phyng / scrapy-stats
View on GitHub
Scrapy项目，抓取国家统计局区划代码，并用D3.js可视化
☆47Aug 22, 2014Updated 11 years ago
REMitchell / data-day-seattle
View on GitHub
Sample Crawler for Data Day Seattle
☆10Jun 27, 2015Updated 11 years ago
wuchong / scrapy-dynamic-configurable
View on GitHub
A dynamic configurable news crawler based Scrapy
☆164Jul 24, 2017Updated 9 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
JanHuang / awesome-me
View on GitHub
my php learn plan.
☆15Jul 18, 2017Updated 9 years ago
multiangle / Distributed_Microblog_Spider
View on GitHub
分布式新浪微博爬虫
☆30Dec 13, 2016Updated 9 years ago
poldrack / python
View on GitHub
python code for data processing
☆18May 15, 2013Updated 13 years ago
mayJJ / ket
View on GitHub
☆12Aug 5, 2018Updated 7 years ago
ehsanmok / sparkling-titanic
View on GitHub
Training models with Apache Spark, PySpark for Titanic Kaggle competition
☆14Sep 23, 2016Updated 9 years ago
DaveRandom / libjit
View on GitHub
Mirror of libjit http://www.gnu.org/software/libjit/
☆16Sep 15, 2014Updated 11 years ago
junwei-pan / Allen_AI_Science_Challenge_JunweiPan
View on GitHub
☆16Feb 3, 2016Updated 10 years ago
Andrew-liu / scrapy_example
View on GitHub
This repository store some example to learn scrapy better
☆175Oct 9, 2020Updated 5 years ago
paulu / deepmanifold
View on GitHub
Deep Manifold Traversal
☆14Nov 14, 2016Updated 9 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
python-cn / flask-slackbot
View on GitHub
flask_slackbot helps you deal with slack outgoing webhook.
☆22Jun 24, 2015Updated 11 years ago
AimeeHu / cracking-the-coding-interview-python-solution
View on GitHub
Python answers for the book Cracking the Coding Interview
☆17Dec 12, 2013Updated 12 years ago
sdq / FenciMac
View on GitHub
中文分词 Mac版
☆10Jul 5, 2017Updated 9 years ago
MOON-CLJ / scrapy_weibo
View on GitHub
distributed crawler for weibo
☆22May 23, 2013Updated 13 years ago
johnarevalo / blocks-char-rnn
View on GitHub
Multi-layer RNN (LSTM, GRU, RNN) for character-level language models in Blocks
☆60Jun 25, 2016Updated 10 years ago
PureMVC / puremvc-php-multicore-framework
View on GitHub
PureMVC MultiCore Framework for PHP
☆12Oct 27, 2018Updated 7 years ago
mindjolt / starling-builder-extensions
View on GitHub
☆12Sep 29, 2016Updated 9 years ago
AbeHandler / WordNet-Word2Vec
View on GitHub
An empirical comparison of lexical relations in WordNet and word2vec
☆28Apr 8, 2021Updated 5 years ago
thallium205 / Bitcoin_Updater
View on GitHub
Stores and updates the bitcoin blockchain and historical bitcoin market data into a mysql database.
☆17Feb 21, 2012Updated 14 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
gnemoug / assessment
View on GitHub
这是一个使用bottle，mongodb和jinja2开发的一个同学互评系统，通过它进行了对于使用bottle进行web开发的探索，包括：bottle做web开发的物理设计和bottle做web开发的高级的特性的使用
☆20Aug 26, 2013Updated 12 years ago
Andrew-liu / dou_ban_spider
View on GitHub
A Simple spider that use to crawl the douban Top 100 moive name and input all list
☆133May 24, 2017Updated 9 years ago
junrushao / Final-Fanatic-Facility
View on GitHub
A C compiler with SSA-based backend optimzation
☆15Mar 19, 2016Updated 10 years ago
raingo / caffe-parameter-server
View on GitHub
Proof of concept prototype to perform distributed training using BVLC/caffe, based on a parameter server implementation using MPI. Data p…
☆13May 7, 2015Updated 11 years ago
dakotahp / RegExhibit
View on GitHub
Source for Roger Jolly's RegExhibit.
☆14Mar 9, 2012Updated 14 years ago
pangge / python-crawler-ccw
View on GitHub
web resources crawler for pdf or doc by python 3
☆25Oct 15, 2014Updated 11 years ago
wewoor / fm_citypatient
View on GitHub
一个简单的，小型的基于Nodejs + HTML5 + Angularjs的FM网站
☆16May 9, 2015Updated 11 years ago