Germey / AwesomeWebScraping
List of libraries, tools and APIs for web scraping and data processing.
☆248Updated 9 months ago
Alternatives and similar repositories for AwesomeWebScraping:
Users that are interested in AwesomeWebScraping are comparing it to the libraries listed below
- 爬虫管理系统,支持集群,弹性伸缩。支持运行feapder、scrapy、selenium、playwright等各种框架及脚本☆108Updated last month
- 神奇的蜘蛛🕷,一个几乎适用于所有web端站点的采集方案☆334Updated 2 years ago
- An intelligent web service to automatically detect web content and extract information from it.☆85Updated last year
- SpiderBox - 虫盒 - 爬虫逆向资源导航站☆65Updated this week
- 这其实是一份学习笔记。包括学习记录、爬虫练习平台(网站)、自制工具脚本☆91Updated last year
- 使 scrapy 开发不用在意 item,pipeline,middleware 等通用场景下模块的编写,解放开发者的双手。☆84Updated this week
- 使用feapder爬虫框架开发的爬虫示例☆31Updated 2 years ago
- 记录一下js逆向的网站☆226Updated last year
- Auto Extractor Module☆325Updated 5 months ago
- js逆向和爬虫☆309Updated 2 years ago
- 碎片记录一些从开始学编程以来各种 零散的代码片,仅供个人方便查看使用。☆144Updated last year
- Python爬虫进阶 JS 解密逆向实战☆176Updated 2 years ago
- 这是一个xpath开发者的工具,可以帮助开发者快速的定位网页元素。☆222Updated last year
- Web crawler and data processing toolkit !☆50Updated 2 years ago
- 爬虫逆向破解心得记录,包含网易易盾、极验验证码、数美验证码、顶象验证码、同盾验证码、腾讯验证码、拼多多验证码,瑞数4 5 6代,包含验证码识别、滑块、点选、空间推理。已更新电商/工商/航空/抖音/小红书/文书采集系统。☆51Updated last year
- JS逆向破解☆100Updated 6 months ago
- Downloader Middleware to support Playwright in Scrapy & Gerapy☆108Updated 2 years ago
- Downloader Middleware to support Pyppeteer in Scrapy & Gerapy☆136Updated 3 years ago
- SpiderApi - 虫术 - 爬虫逆向常用 API☆68Updated this week
- awsome scrapy utils☆55Updated 9 months ago
- 我关注的一些优质公众号,基本都是js逆向和安卓逆向方面☆93Updated 9 months ago
- JS逆向系列教程,模拟登录,AES、RSA、DES加密等,持续更新,欢迎 star!☆409Updated 3 years ago
- Account Pool☆42Updated last year
- 一个免费开源一键搭建的通用验证码识别平台,大部分常见的中英数验证码识别都没啥问题。☆188Updated 3 years ago
- JS逆向Hook工具集,开源部分工具到这里☆326Updated last year
- 《微信公众号采集系统》微信公众号文章的阅读数、在看数、评论数、评论列表,还有微信公众号的账号基本信息。☆167Updated 2 years ago
- A chrome extension to get XPath of list items in webpage easily.☆35Updated 2 years ago
- Deep LearningImage Captcha 2☆172Updated 3 years ago