Yanxueshan/Scrapy-Redis-Zhihu

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Yanxueshan/Scrapy-Redis-Zhihu)

Yanxueshan / Scrapy-Redis-Zhihu

基于scrapy-redis实现分布式爬虫，爬取知乎所有问题及对应的回答，集成selenium模拟登录、英文验证码及倒立文字验证码识别、随机生成User-Agent、IP代理、处理302重定向问题等等

☆61

Alternatives and similar repositories for Scrapy-Redis-Zhihu

Users that are interested in Scrapy-Redis-Zhihu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

shikanon / proxy_scrapy
View on GitHub
proxy_scrapy是一个scrapy搭建的代理模块，主要包括代理抓取、代理测试和使用代理三个模块。包括了对主要的代理网站的抓取和代理稳定性的测试，并整合进scrapy爬虫当中。
☆10Jan 20, 2017Updated 9 years ago
dengqiangxi / zhihu_spider
View on GitHub
知乎爬虫，用于爬取用户信息以及用户之间关系。
☆33Nov 22, 2022Updated 3 years ago
inlike / CookiePool
View on GitHub
一个强大的Cookie池项目，融合scrapy/requests/chrome储存cookie/cookie字符串/selenium等cookie形式
☆232Mar 13, 2020Updated 6 years ago
ioiogoo / scrapy-monitor
View on GitHub
scrapy-monitor，实现爬虫可视化，监控实时状态
☆109Dec 26, 2016Updated 9 years ago
Jaysong2012 / tutorial
View on GitHub
Scrapy爬虫实战系列，从零开始爬取腾讯百度淘宝知乎各大网站内容 \n 12306刷票脚本系列
☆80Apr 2, 2019Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
OSinoooO / MeituanSpider
View on GitHub
美团爬虫，基于scrapy_redis
☆22Apr 1, 2019Updated 7 years ago
Lucareful / JingDongSpider
View on GitHub
基于scrapy框架的京东爬虫实现
☆11Nov 22, 2019Updated 6 years ago
Vanessa219 / vuetify
View on GitHub
Vuetify with the markdown editor
☆10Sep 22, 2019Updated 6 years ago
shisiying / tc_zufang
View on GitHub
使用scrapy,redis, mongodb,django实现的一个分布式网络爬虫,底层存储mongodb,分布式使用redis实现,使用django可视化爬虫
☆280May 1, 2018Updated 8 years ago
yangshimin / ast_study
View on GitHub
记录AST学习
☆49Jan 21, 2022Updated 4 years ago
wzf1997 / count-component-plugin
View on GitHub
统计项目中一个组件引用次数的 webpack 插件
☆12Mar 19, 2022Updated 4 years ago
myml / webssh
View on GitHub
websocket to ssh
☆11May 14, 2019Updated 7 years ago
qhwa / auto_response
View on GitHub
A proxy server for debugging HTTP requests.
☆15Mar 25, 2024Updated 2 years ago
tmliang / Taobao_Spider
View on GitHub
基于Scrapy的Python3分布式淘宝爬虫
☆191Mar 11, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
dagege1993 / scrapy
View on GitHub
1,huaproject算福利吧,爬取的中国校花网,并且保存到本地,基础知识点,url,json,文件的读写. 2,Document.doc 是自己总结的常见爬虫面试题以及答案,但是貌似不想做全职爬虫,所以可能以后也不会更新这一块,爬虫算乐趣, 以后估计重心会放在web …
☆14Jan 24, 2018Updated 8 years ago
doublekous / django_pays
View on GitHub
django框架中使用支付宝支付，微信native模式支付，中国银联支付
☆14Feb 7, 2023Updated 3 years ago
tmpbook / Django-with-ElasticSearch
View on GitHub
当有新的 Blog 被保存时会触发 signals，在 ElasticSearch 中也生成一份并重建索引，最终在 Django 中实现高速查询
☆10Jan 6, 2018Updated 8 years ago
wuyue92tree / crwy
View on GitHub
一个简单的web爬虫框架，借鉴scrapy结构开发而来，并为scrapy使用者提供通用轮子^.^
☆13Nov 9, 2020Updated 5 years ago
xakep666 / asciinema-player
View on GitHub
A simple player for asciinema v2 (https://github.com/asciinema/asciinema) casts
☆20Nov 2, 2023Updated 2 years ago
dengzeyuan / gride-layout
View on GitHub
基于vue的可视化动态更改网格尺寸/可拖拽，可动态改变大小，网格布局和自由布局（vue-gride-layout/dnd-gride）
☆11Jul 20, 2018Updated 8 years ago
LinZiYU1996 / Spring-Boot-Elasticsearch
View on GitHub
☆11May 21, 2018Updated 8 years ago
n7best / react-weui-1
View on GitHub
weui for react
☆10Jul 18, 2017Updated 9 years ago
Johnson0722 / News_scrapy_redis
View on GitHub
☆30Jul 5, 2018Updated 8 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Daoming-Chen / AD_study
View on GitHub
Study notes in Chinese based on Self-Driving Cars (Prof. Andreas Geiger, University of Tübingen)
☆12Aug 4, 2022Updated 3 years ago
Dushibing / log-visual
View on GitHub
日志可视化进阶
☆13May 8, 2017Updated 9 years ago
lipengyu / uudatahive
View on GitHub
蜂巢爬虫系统是一套只需要定义XPath，就可实现爬取网站,APP的系统, 支持多种解析方式（XPath,正则表达式），多种下载方式（HttpClient库, PhantomJs, Selenium）,多种输出方式（Excel，MongoDB）。可不做任何修改发布到Yar…
☆10Sep 5, 2016Updated 9 years ago
Srpihot / GoodsSpider
View on GitHub
电商平台商品自定义爬虫脚本(已完成淘宝,京东)
☆101May 13, 2022Updated 4 years ago
henrylee123 / baiduIndexCrawler
View on GitHub
百度指数（百度热搜爬虫）（js破解版）
☆14Apr 9, 2019Updated 7 years ago
Gerapy / GerapyProxy
View on GitHub
A package for supporting proxy in Scrapy & Gerapy
☆11Jul 15, 2020Updated 6 years ago
pinax / pinax-api
View on GitHub
RESTful API adhering to the JSON:API specification
☆14Apr 19, 2019Updated 7 years ago
PFFFei / rent
View on GitHub
基于Scrapy和Django的二手房爬虫及可视化
☆10Nov 22, 2022Updated 3 years ago
fxyz999 / cnfunds
View on GitHub
基金信息大全
☆14Apr 6, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
WellerQu / grafana_plugins_practice
View on GitHub
grafana插件开发
☆13Jun 7, 2017Updated 9 years ago
Txiz / TBlog-Django
View on GitHub
BootStrap + Django3 + simpleUI 实现的个人博客
☆12Sep 22, 2021Updated 4 years ago
backto17 / SinaHouseCrawler
View on GitHub
基于scrapy,scrapy-redis实现的一个分布式网络爬虫,爬取了新浪房产的楼盘信息及户型图片,实现了常用的爬虫功能需求.
☆40Feb 13, 2017Updated 9 years ago
luobotang / simply-lazy
View on GitHub
A simple Lazy.js implementation, to show the core of lazy evaluation.
☆12Nov 29, 2016Updated 9 years ago
guapier / zufang
View on GitHub
租房爬虫，基于flask，采用apscheduler定时任务，通过微信，定时给用户推送想要的租房信息
☆15Mar 13, 2019Updated 7 years ago
Fankouzu / etherscan-api-cn
View on GitHub
API to cn.etherscan with a simple interface
☆10Mar 12, 2024Updated 2 years ago
Python3WebSpider / ScrapyRedisBloomFilter
View on GitHub
Scrapy Redis Bloom Filter
☆175Jul 25, 2021Updated 4 years ago