LiuXingMing/SinaSpider

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/LiuXingMing/SinaSpider)

LiuXingMing / SinaSpider

新浪微博爬虫（Scrapy、Redis）

☆3,286

Alternatives and similar repositories for SinaSpider

Users that are interested in SinaSpider are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

gnemoug / distribute_crawler
View on GitHub
使用scrapy,redis, mongodb,graphite实现的一个分布式网络爬虫,底层存储mongodb集群,分布式使用redis实现,爬虫状态显示使用graphite实现
☆3,243Apr 18, 2017Updated 9 years ago
LiuRoy / zhihu_spider
View on GitHub
知乎爬虫
☆1,280Aug 4, 2016Updated 9 years ago
LiuXingMing / QQSpider
View on GitHub
QQ空间爬虫（日志、说说、个人信息）
☆758Nov 25, 2016Updated 9 years ago
taizilongxu / scrapy_jingdong
View on GitHub
用scrapy写的京东爬虫
☆453Dec 5, 2014Updated 11 years ago
Shu-Ji / baidu-music-spider
View on GitHub
百度mp3全站爬虫
☆130Apr 28, 2013Updated 13 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
SpiderClub / weibospider
View on GitHub
A distributed crawler for weibo, building with celery and requests.
☆4,794Jul 11, 2020Updated 6 years ago
chyroc / WechatSogou
View on GitHub
基于搜狗微信搜索的微信公众号爬虫接口
☆6,351Mar 7, 2026Updated 4 months ago
Qutan / Spider
View on GitHub
社交数据爬虫
☆222Oct 11, 2016Updated 9 years ago
lanbing510 / LianJiaSpider
View on GitHub
链家爬虫
☆695Apr 6, 2016Updated 10 years ago
yanzhou / CnkiSpider
View on GitHub
中国知网爬虫
☆663Mar 8, 2025Updated last year
pakoo / tbcrawler
View on GitHub
淘宝天猫商品爬虫
☆266Oct 9, 2013Updated 12 years ago
lanbing510 / DouBanSpider
View on GitHub
豆瓣读书的爬虫
☆2,787Apr 8, 2020Updated 6 years ago
caspartse / QQ-Groups-Spider
View on GitHub
QQ Groups Spider（QQ 群爬虫）
☆866Dec 31, 2017Updated 8 years ago
airingursb / bilibili-user
View on GitHub
🍥 Bilibili 用户爬虫
☆3,089May 2, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
bowenpay / wechat-spider
View on GitHub
微信公众号爬虫
☆3,360Aug 10, 2021Updated 4 years ago
fankcoder / findtrip
View on GitHub
机票爬虫（去哪儿和携程网）。flight tickets multiple webspider.(scrapy + selenium + phantomjs + mongodb)
☆488Feb 23, 2026Updated 5 months ago
rmax / scrapy-redis
View on GitHub
Redis-based components for Scrapy.
☆5,644May 19, 2026Updated 2 months ago
LiuXingMing / WeiboSliderCode
View on GitHub
m.weibo.cn登录，四宫格图形解锁验证码破解
☆106Jan 26, 2018Updated 8 years ago
xchaoinfo / fuck-login
View on GitHub
模拟登录一些知名的网站，为了方便爬取需要登录的网站
☆5,869Jun 8, 2018Updated 8 years ago
lucasjinreal / weibo_terminater
View on GitHub
Final Weibo Crawler Scrap Anything From Weibo, comments, weibo contents, followers, anything. The Terminator
☆2,320Oct 25, 2019Updated 6 years ago
simapple / spider
View on GitHub
test
☆162Feb 4, 2023Updated 3 years ago
dataabc / weiboSpider
View on GitHub
新浪微博爬虫，用python爬取新浪微博数据
☆9,658Feb 4, 2026Updated 5 months ago
LiuXingMing / Scrapy_Redis_Bloomfilter
View on GitHub
基于Redis的Bloomfilter去重，并将其扩展到Scrapy框架。
☆347Feb 26, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
luyishisi / Anti-Anti-Spider
View on GitHub
越来越多的网站具有反爬虫特性，有的用图片隐藏关键数据，有的使用反人类的验证码，建立反反爬虫的代码仓库，通过与不同特性的网站做斗争（无恶意）提高技术。（欢迎提交难以采集的网站）（因工作原因，项目暂停）
☆7,285Oct 17, 2021Updated 4 years ago
RitterHou / music-163
View on GitHub
爬取网易云音乐所有歌曲的评论数
☆342Feb 16, 2017Updated 9 years ago
gnemoug / sina_reptile
View on GitHub
获取新浪微博1000w用户的基本信息和每个爬取用户最近发表的50条微博,使用python编写，多进程爬取，将数据存储在了mongodb中
☆475Mar 22, 2013Updated 13 years ago
jhao104 / proxy_pool
View on GitHub
Python ProxyPool for web spider
☆23,529Jun 15, 2026Updated last month
LiuXingMing / Tmall1212
View on GitHub
天猫双12爬虫，附商品数据。
☆202Dec 12, 2016Updated 9 years ago
nghuyong / WeiboSpider
View on GitHub
持续维护的新浪微博采集工具🚀🚀🚀
☆4,097Jun 30, 2026Updated 3 weeks ago
qiyeboy / IPProxyPool
View on GitHub
IPProxyPool代理池项目，提供代理ip
☆4,282Jul 13, 2018Updated 8 years ago
geekan / scrapy-examples
View on GitHub
Multifarious Scrapy examples. Spiders for alexa / amazon / douban / douyu / github / linkedin etc.
☆3,252Nov 3, 2023Updated 2 years ago
jackgitgz / CnblogsSpider
View on GitHub
用scrapy采集cnblogs列表页爬虫
☆274Jun 16, 2015Updated 11 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
lining0806 / PythonSpiderNotes
View on GitHub
Python入门网络爬虫之精华版
☆7,451Jun 21, 2021Updated 5 years ago
k1995 / BaiduyunSpider
View on GitHub
百度云网盘搜索引擎，包含爬虫 & 网站
☆1,175Sep 16, 2019Updated 6 years ago
binux / pyspider
View on GitHub
A Powerful Spider(Web Crawler) System in Python.
☆16,797Apr 30, 2024Updated 2 years ago
szcf-weiya / SinaSpider
View on GitHub
动态IP解决新浪的反爬虫机制，快速抓取内容。
☆141Sep 10, 2017Updated 8 years ago
benitoro / stockholm
View on GitHub
一个股票数据（沪深）爬虫和选股策略测试框架
☆1,510Aug 14, 2020Updated 5 years ago
hanc00l / wooyun_public
View on GitHub
This repo is archived. Thanks for wooyun! 乌云公开漏洞、知识库爬虫和搜索 crawl and search for wooyun.org public bug(vulnerability) and drops
☆4,402Jul 17, 2019Updated 7 years ago
fxsjy / jieba
View on GitHub
结巴中文分词
☆35,081Aug 21, 2024Updated last year