immzz/zhihu-scrapy

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/immzz/zhihu-scrapy)

immzz / zhihu-scrapy

A scrapy zhihu crawler

☆77

Alternatives and similar repositories for zhihu-scrapy

Users that are interested in zhihu-scrapy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zhijunio / scrapy-zhihu-github
View on GitHub
scrapy examples for crawling zhihu and github
☆221Jan 11, 2023Updated 3 years ago
feiskyer / scrapy-examples
View on GitHub
Some scrapy and web.py exmaples
☆79May 20, 2017Updated 9 years ago
maxliaops / scrapy-itzhaopin
View on GitHub
☆94Apr 28, 2014Updated 12 years ago
yoyzhou / weibo_scrapy
View on GitHub
WEIBO_SCRAPY is a Multi-Threading SINA WEIBO data extraction Framework in Python.
☆155Jun 3, 2026Updated last month
Vespa314 / douban_scrapy
View on GitHub
将会陆续添加豆瓣里面各种信息的爬虫代码和分析
☆25Aug 11, 2014Updated 11 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
widy28 / scrapy-taobao
View on GitHub
scrapy模拟淘宝登陆
☆74Oct 9, 2020Updated 5 years ago
ycloudnet / ya100
View on GitHub
一个比Spark-Parquet还快5~100倍的存储格式
☆12Feb 22, 2016Updated 10 years ago
chenqx / spiderDemo
View on GitHub
☆23Jan 31, 2015Updated 11 years ago
jackgitgz / CnblogsSpider
View on GitHub
用scrapy采集cnblogs列表页爬虫
☆274Jun 16, 2015Updated 11 years ago
pelick / VerticleSearchEngine
View on GitHub
Academic Search Engine using Scrapy, MongoDB, Lucene/Solr, Tika, Struts2, Jquery, Bootstrap, D3, CAS
☆101Jun 16, 2013Updated 13 years ago
brandicted / scrapy-webdriver
View on GitHub
☆143Nov 24, 2015Updated 10 years ago
BruceDone / cnbeta
View on GitHub
一键抓取cnbeta 首页的所有消息
☆16Sep 7, 2016Updated 9 years ago
EricQAQ / Puck
View on GitHub
web micro-frame, quickly developing restful api
☆20Dec 26, 2022Updated 3 years ago
geekan / scrapy-examples
View on GitHub
Multifarious Scrapy examples. Spiders for alexa / amazon / douban / douyu / github / linkedin etc.
☆3,254Nov 3, 2023Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
javalurker / sae-flask-blog
View on GitHub
一款运行在SAE Python上采用FLASK开发的轻型博客程序
☆20Aug 23, 2012Updated 13 years ago
xiocode / elasticsearch
View on GitHub
目前生产环境使用的elasticsearch
☆10Apr 29, 2014Updated 12 years ago
MOON-CLJ / scrapy_weibo
View on GitHub
distributed crawler for weibo
☆22May 23, 2013Updated 13 years ago
Cloudxtreme / nginx-cdn
View on GitHub
Nginx Lua = CDN
☆13Jan 18, 2023Updated 3 years ago
leitro / knowsecSpider2
View on GitHub
知道创宇爬虫题目持续更新版本
☆94Nov 6, 2014Updated 11 years ago
gnemoug / distribute_crawler
View on GitHub
使用scrapy,redis, mongodb,graphite实现的一个分布式网络爬虫,底层存储mongodb集群,分布式使用redis实现,爬虫状态显示使用graphite实现
☆3,242Apr 18, 2017Updated 9 years ago
youyudehexie / simple-scrapy
View on GitHub
simple-scrapy
☆40Jul 6, 2014Updated 12 years ago
YueDayu / WHU_SecCode
View on GitHub
just for fun :)
☆15Jan 13, 2016Updated 10 years ago
wuchong / scrapy-dynamic-configurable
View on GitHub
A dynamic configurable news crawler based Scrapy
☆164Jul 24, 2017Updated 9 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
pangge / python-crawler-ccw
View on GitHub
web resources crawler for pdf or doc by python 3
☆25Oct 15, 2014Updated 11 years ago
ghsqt / rocketmq-demo
View on GitHub
超级简单版的两个工程，一个是producer 一个是consumer 。有序和无序的都有了
☆18Mar 11, 2016Updated 10 years ago
smontanaro / spambayes
View on GitHub
SpamBayes spam classifier written in Python
☆19Jun 12, 2023Updated 3 years ago
kohn / HttpProxyMiddleware
View on GitHub
A middleware for scrapy. Used to change HTTP proxy from time to time.
☆323Feb 1, 2018Updated 8 years ago
lnxpgn / scrapy_multiple_spiders
View on GitHub
Using multiple spiders in a Scrapy project
☆10Aug 7, 2015Updated 10 years ago
devuser / spark-notes
View on GitHub
Note anything during writing spark or scala
☆20Sep 29, 2017Updated 8 years ago
pkuWu / Option_Hedge
View on GitHub
这是一个包含Zakamouline和WW两种期权对冲策略的项目
☆18Apr 15, 2022Updated 4 years ago
Germey / ScrapyTutorial
View on GitHub
Scrapy Tutorial
☆11Feb 19, 2017Updated 9 years ago
michaelxs / Android-XRouter
View on GitHub
This is a lightweight and simple routing framework that provides jump routing and method routing.
☆20Jul 18, 2019Updated 7 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
lpe234 / meizi_spider
View on GitHub
scrapy demo
☆25Jan 8, 2019Updated 7 years ago
CQiang27 / Spark_Python
View on GitHub
Spark—Python学习笔记
☆11Sep 25, 2018Updated 7 years ago
jelviss / Reptile
View on GitHub
Python火车票信息定时采集
☆14Jul 1, 2022Updated 4 years ago
cuilimeng / SAME
View on GitHub
☆11Apr 13, 2020Updated 6 years ago
Andrew-liu / scrapy_example
View on GitHub
This repository store some example to learn scrapy better
☆175Oct 9, 2020Updated 5 years ago
tpeng / weibosearch
View on GitHub
A distributed Sina Weibo Search spider base on Scrapy and Redis.
☆146May 31, 2013Updated 13 years ago
scrapy-plugins / scrapy-jsonrpc
View on GitHub
Scrapy extension to control spiders using JSON-RPC
☆299Aug 26, 2019Updated 6 years ago