liinnux / awesome-crawler-cnView external linksLinks
互联网爬虫,蜘蛛,数据采集器,网页解析器的汇总,因新技术不断发展,新框架层出不穷,此文会不断更新...
☆331Oct 7, 2022Updated 3 years ago
Alternatives and similar repositories for awesome-crawler-cn
Users that are interested in awesome-crawler-cn are comparing it to the libraries listed below
Sorting:
- Scraping and Web Crawling Framework For Zhihu Live☆63Oct 10, 2017Updated 8 years ago
- 越来越多的网站具有反爬虫特性,有的用图片隐藏关键数据,有的使用反人类的验证码,建立反反爬虫的代码仓库,通过与不同特性的网站做斗争(无恶意)提高技术。(欢迎提交难以采集的网站)(因工作原因,项目暂停)☆7,309Oct 17, 2021Updated 4 years ago
- 这个项目是蜘蛛项目 ShoppingWebCrawler 的可视化任务站点。☆10Oct 16, 2018Updated 7 years ago
- python ip proxy tool scrapy crawl. 抓取大量免费代理 ip,提取有效 ip 使用☆2,003Dec 8, 2022Updated 3 years ago
- 使用scrapy,redis, mongodb,graphite实现的一个分布式网络爬虫,底层存储mongodb集群,分布式使用redis实现,爬虫状态显示使用graphite实现☆3,253Apr 18, 2017Updated 8 years ago
- 蓝天采集器是一款开源免费的爬虫系统,仅需点选编辑规则即可采集数据,可运行在本地、虚拟主机或云服务器中,几乎能采集所有类型的网页,无缝对接各类CMS建站程序,免登录实时发布数据,全自动无需人工干预!是网页大数据采集软件中完全跨平台的云端爬虫系统☆2,058Feb 1, 2026Updated 2 weeks ago
- Scrapy extension to write scraped items using Django models☆505Oct 15, 2023Updated 2 years ago
- 🇨🇳翻译: <awesome-puppeteer> Puppeteer 资源的精选列表 ❤️ 校对 ✅☆23Mar 29, 2019Updated 6 years ago
- Sprint Planning / Scrum Poker online tool (Akka/Socko Websockets)☆19Dec 22, 2015Updated 10 years ago
- A Powerful Spider(Web Crawler) System in Python.☆17,044Apr 30, 2024Updated last year
- Visual scraping for Scrapy☆9,490Jun 26, 2024Updated last year
- 各大网站登陆方式,有的是通过selenium登录,有的是通过抓包直接模拟登录(精力原因,目前不再继续维护)☆1,010Jul 26, 2022Updated 3 years ago
- nodejs爬虫,输入网站自动生成网站sitemap☆12Mar 27, 2018Updated 7 years ago
- ios游戏APP评论爬虫。crawl app comments on amazon && appannie.☆12Apr 6, 2016Updated 9 years ago
- 仿iOS的PickerView控件,有时间选择和选项选择并支持一二三级联动效果☆11Oct 12, 2015Updated 10 years ago
- 知乎分布式爬虫(Scrapy、Redis)☆168Feb 18, 2018Updated 7 years ago
- This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster.☆1,230Nov 7, 2023Updated 2 years ago
- Just a DEMO to demonstrate how to use JNA to type chars into alipay's password edit control automatically.☆12Dec 21, 2017Updated 8 years ago
- 微信群聊天监控机器人☆14Sep 3, 2020Updated 5 years ago
- Just a blog☆23Jul 31, 2017Updated 8 years ago
- Specifically designed to solve the web crawler when collecting Internet web data who need to login the web-site by useing some Simulated…☆14Nov 30, 2016Updated 9 years ago
- PHP使用Google Translate API來做自動化檔案翻譯☆15Apr 18, 2019Updated 6 years ago
- 一个通用的可配置的爬虫框架☆544Feb 9, 2023Updated 3 years ago
- 微信公众号爬虫☆3,298Aug 10, 2021Updated 4 years ago
- 微信群助手机器人☆15Feb 10, 2017Updated 9 years ago
- Video Player Ultimate(HD) is based on VLC for Android Beta, and licensed under the GNU General Public License ver3 or later.☆19Jun 15, 2014Updated 11 years ago
- A captcha library that generates audio and image CAPTCHAs.☆1,088Oct 21, 2025Updated 3 months ago
- Lite version of Crawlab. 轻量版 Crawlab 爬虫管理平台☆230Feb 9, 2023Updated 3 years ago
- Wechat Management System☆1,751May 17, 2018Updated 7 years ago
- scrapy-redis代码研究☆14Oct 10, 2014Updated 11 years ago
- Ning Guo's blog☆16Jan 26, 2026Updated 3 weeks ago
- 模拟登录一些知名的网站,为了方便爬取需要登录的网站☆5,893Jun 8, 2018Updated 7 years ago
- Python资源大全中文版,包括:Web框架、网络爬虫、模板引擎、数据库、数据可视化、图片处理等,由「开源前哨」和「Python开发者」微信公号团队维护更新。☆30,205Aug 29, 2022Updated 3 years ago
- ☆61Jan 6, 2017Updated 9 years ago
- 🦁️ Rising KaKa 瑞星小狮子卡卡☆19Oct 27, 2021Updated 4 years ago
- A nodejs-spider that gets the needed information of top-ten in bbs.byr.cn☆16May 26, 2016Updated 9 years ago
- 基于Orchard的一个采集器,使用.NET WebBrowser控件渲染DOM后再注入jquery,用JS轻松采集内容,回调C# COM接口方法完成入库和驱动下一步.☆15Jul 11, 2016Updated 9 years ago
- 项目为重构marry-server项目。将开发架构改为了spring boot + mybatis开发模式。☆11Jul 22, 2023Updated 2 years ago
- Pholcus is a distributed high-concurrency crawler software written in pure golang☆7,609Nov 8, 2022Updated 3 years ago