liinnux/awesome-crawler-cn

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/liinnux/awesome-crawler-cn)

liinnux / awesome-crawler-cn

互联网爬虫，蜘蛛，数据采集器，网页解析器的汇总，因新技术不断发展，新框架层出不穷，此文会不断更新...

☆333

Alternatives and similar repositories for awesome-crawler-cn

Users that are interested in awesome-crawler-cn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dongweiming / daenerys
View on GitHub
Scraping and Web Crawling Framework For Zhihu Live
☆63Oct 10, 2017Updated 8 years ago
dongweiming / Mtime
View on GitHub
A spider... ^.^
☆99Mar 23, 2014Updated 12 years ago
awolfly9 / IPProxyTool
View on GitHub
python ip proxy tool scrapy crawl. 抓取大量免费代理 ip，提取有效 ip 使用
☆2,000Dec 8, 2022Updated 3 years ago
luyishisi / Anti-Anti-Spider
View on GitHub
越来越多的网站具有反爬虫特性，有的用图片隐藏关键数据，有的使用反人类的验证码，建立反反爬虫的代码仓库，通过与不同特性的网站做斗争（无恶意）提高技术。（欢迎提交难以采集的网站）（因工作原因，项目暂停）
☆7,285Oct 17, 2021Updated 4 years ago
gnemoug / distribute_crawler
View on GitHub
使用scrapy,redis, mongodb,graphite实现的一个分布式网络爬虫,底层存储mongodb集群,分布式使用redis实现,爬虫状态显示使用graphite实现
☆3,243Apr 18, 2017Updated 9 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
zorlan / skycaiji
View on GitHub
蓝天采集器是一款开源免费的爬虫系统，仅需点选编辑规则即可采集数据，可运行在本地、虚拟主机或云服务器中，几乎能采集所有类型的网页，无缝对接各类CMS建站程序，免登录实时发布数据，全自动无需人工干预！是网页大数据采集软件中完全跨平台的云端爬虫系统
☆2,077Updated this week
lepture / captcha
View on GitHub
A captcha library that generates audio and image CAPTCHAs.
☆1,094Oct 21, 2025Updated 9 months ago
LZC6244 / ip_proxy_pool
View on GitHub
使用 Django2 作为接口后端，scrapy 作为爬虫的一个代理 IP 池
☆10Jun 6, 2020Updated 6 years ago
rspivak / slimit
View on GitHub
SlimIt - a JavaScript minifier/parser in Python
☆547Jul 30, 2019Updated 6 years ago
scrapinghub / portia
View on GitHub
Visual scraping for Scrapy
☆9,506Jun 26, 2024Updated 2 years ago
my8100 / scrapydweb
View on GitHub
Web app for Scrapyd cluster management, Scrapy log analysis & visualization, Auto packaging, Timer tasks, Monitor & Alert, and Mobile UI.…
☆3,409Feb 19, 2025Updated last year
istresearch / scrapy-cluster
View on GitHub
This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster.
☆1,225Nov 7, 2023Updated 2 years ago
jhao104 / proxy_pool
View on GitHub
Python ProxyPool for web spider
☆23,504Jun 15, 2026Updated last month
SpiderClub / smart_login
View on GitHub
各大网站登陆方式，有的是通过selenium登录，有的是通过抓包直接模拟登录（精力原因，目前不再继续维护）
☆1,009Jul 26, 2022Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
suffixbig / PHP-GoogleTranslate
View on GitHub
PHP使用Google Translate API來做自動化檔案翻譯
☆15Apr 18, 2019Updated 7 years ago
AlexTan-b-z / ZhihuSpider
View on GitHub
知乎分布式爬虫（Scrapy、Redis）
☆169Feb 18, 2018Updated 8 years ago
yijingping / unicrawler
View on GitHub
一个通用的可配置的爬虫框架
☆543Feb 9, 2023Updated 3 years ago
cabbage89 / Orchard.Crawler
View on GitHub
基于Orchard的一个采集器,使用.NET WebBrowser控件渲染DOM后再注入jquery,用JS轻松采集内容,回调C# COM接口方法完成入库和驱动下一步.
☆15Jul 11, 2016Updated 10 years ago
xchaoinfo / fuck-login
View on GitHub
模拟登录一些知名的网站，为了方便爬取需要登录的网站
☆5,870Jun 8, 2018Updated 8 years ago
defpt / userChromeJs
View on GitHub
自用脚本（包括自写以及修改自其它大神的脚本）
☆109Apr 18, 2017Updated 9 years ago
bowenpay / wechat-spider
View on GitHub
微信公众号爬虫
☆3,360Aug 10, 2021Updated 4 years ago
jobbole / awesome-python-cn
View on GitHub
Python资源大全中文版，包括：Web框架、网络爬虫、模板引擎、数据库、数据可视化、图片处理等，由「开源前哨」和「Python开发者」微信公号团队维护更新。
☆30,502Aug 29, 2022Updated 3 years ago
dongweiming / wechat-admin
View on GitHub
Wechat Management System
☆1,744May 17, 2018Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Family-TreeSY / SpiderList
View on GitHub
Spider Collection
☆23Aug 28, 2018Updated 7 years ago
rwv / Rising-KaKa
View on GitHub
🦁️ Rising KaKa 瑞星小狮子卡卡
☆22Oct 27, 2021Updated 4 years ago
gsh199449 / spider
View on GitHub
A configurable web spider with a easy-to-use web console
☆999Jun 3, 2026Updated last month
meetqy / sitemap-nodejs
View on GitHub
nodejs爬虫，输入网站自动生成网站sitemap
☆12Mar 27, 2018Updated 8 years ago
laixin86714802 / spider-platform
View on GitHub
可视化爬虫自动采集平台
☆187Feb 27, 2023Updated 3 years ago
fankcoder / spider-comments
View on GitHub
ios游戏APP评论爬虫。crawl app comments on amazon && appannie.
☆12Apr 6, 2016Updated 10 years ago
qiwsir / awesome-python-cn
View on GitHub
Python资源大全中文版，内容包括：Web框架、网络爬虫、网络内容提取、模板引擎、数据库、数据可视化、图片处理、文本处理、自然语言处理、机器学习、日志、代码分析等
☆23May 10, 2016Updated 10 years ago
soulmachine / weixinqunzhushou
View on GitHub
微信群助手机器人
☆15Feb 10, 2017Updated 9 years ago
Yurunsoft / yurun-crawler-example
View on GitHub
宇润爬虫框架(Yurun Crawler)示例程序
☆16Feb 25, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
chyroc / WechatSogou
View on GitHub
基于搜狗微信搜索的微信公众号爬虫接口
☆6,346Mar 7, 2026Updated 4 months ago
andeya / pholcus
View on GitHub
Pholcus is a distributed high-concurrency crawler software written in pure golang
☆7,578Mar 3, 2026Updated 4 months ago
FunPanda08 / VLCPlayer_Android
View on GitHub
Video Player Ultimate(HD) is based on VLC for Android Beta, and licensed under the GNU General Public License ver3 or later.
☆19Jun 15, 2014Updated 12 years ago
qiyeboy / IPProxyPool
View on GitHub
IPProxyPool代理池项目，提供代理ip
☆4,280Jul 13, 2018Updated 8 years ago
caspartse / QQ-Groups-Spider
View on GitHub
QQ Groups Spider（QQ 群爬虫）
☆866Dec 31, 2017Updated 8 years ago
rmax / scrapy-redis
View on GitHub
Redis-based components for Scrapy.
☆5,645May 19, 2026Updated 2 months ago
stanzhai / be-a-professional-programmer
View on GitHub
成为专业程序员路上用到的各种优秀资料、神器及框架
☆9,896Feb 21, 2023Updated 3 years ago