Gerapy/GerapyAutoExtractor

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Gerapy/GerapyAutoExtractor)

Gerapy / GerapyAutoExtractor

Auto Extractor Module

☆338

Alternatives and similar repositories for GerapyAutoExtractor

Users that are interested in GerapyAutoExtractor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

GeneralNewsExtractor / GeneralNewsExtractor
View on GitHub
新闻网页正文通用抽取器 Beta 版.
☆3,788Apr 21, 2026Updated 2 months ago
Gerapy / GerapyPyppeteer
View on GitHub
Downloader Middleware to support Pyppeteer in Scrapy & Gerapy
☆132Dec 27, 2021Updated 4 years ago
crawlab-team / webspot
View on GitHub
An intelligent web service to automatically detect web content and extract information from it.
☆86Jul 13, 2023Updated 3 years ago
Gerapy / Gerapy
View on GitHub
Distributed Crawler Management Framework Based on Scrapy, Scrapyd, Django and Vue.js
☆3,505Jul 4, 2026Updated last week
kingname / SifouSource
View on GitHub
Python 业务开发常见错误案例集配套源代码
☆10Dec 19, 2020Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
chencchen / RotateCaptchaBreak
View on GitHub
旋转验证码识别
☆219Jul 16, 2021Updated 5 years ago
lixi5338619 / lxparse
View on GitHub
用于解析列表页链接和提取详细页内容的库
☆19Oct 26, 2023Updated 2 years ago
yint-tech / sekiro-open
View on GitHub
SEKIRO is a multi-language, distributed, network topology-independent service publishing platform. By writing handlers in their respectiv…
☆1,910Jan 22, 2026Updated 5 months ago
Gerapy / GerapyProxy
View on GitHub
A package for supporting proxy in Scrapy & Gerapy
☆11Jul 15, 2020Updated 6 years ago
Python3WebSpider / ScrapyPyppeteer
View on GitHub
Scrapy Pyppeteer Demo
☆12Jul 30, 2020Updated 5 years ago
asyncins / antispider
View on GitHub
书籍《Python3 反爬虫原理与绕过实战》配套代码
☆627Oct 25, 2021Updated 4 years ago
Boris-code / feapder
View on GitHub
🚀🚀🚀feapder is an easy to use, powerful crawler framework | feapder是一款上手简单，功能强大的Python爬虫框架。内置AirSpider、Spider、TaskSpider、BatchSpider四种爬…
☆3,720Jul 7, 2026Updated last week
BruceDone / clock
View on GitHub
可视化任务调度系统，精简到一个二进制文件 (Web visual task scheduler system , yes ! just one binary solve all the problems !)
☆194Mar 21, 2026Updated 3 months ago
chencchen / webcrawler
View on GitHub
逆向
☆402Oct 13, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Python3WebSpider / AdslProxy
View on GitHub
Adsl Proxy Pool
☆238Mar 31, 2023Updated 3 years ago
crawlab-team / crawlab
View on GitHub
Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台，支持任何语言和框架
☆12,246Feb 10, 2026Updated 5 months ago
Gerapy / GerapySelenium
View on GitHub
Downloader Middleware to support Selenium in Scrapy & Gerapy
☆32Sep 13, 2020Updated 5 years ago
Python3WebSpider / Scrape
View on GitHub
Platform of Web Views to Scrape
☆11Jun 7, 2020Updated 6 years ago
bytebuff / JSpider
View on GitHub
JSpider会每周更新至少一个网站的JS解密方式，欢迎 Star，交流微信：13298307816
☆1,091Jun 22, 2022Updated 4 years ago
MgArcher / Text_select_captcha
View on GitHub
实现文字点选、选字、选择、点触验证码识别，基于pytorch训练
☆1,635May 8, 2026Updated 2 months ago
crawlaio / scrapy-redis-sentinel
View on GitHub
scrapy-redis-sentinel 基于 scrapy-redis 的基础上新增哨兵（sentinel）连接模式以及集群（cluster）连接模式。
☆30Mar 31, 2023Updated 3 years ago
MuggleK / CrawlersTools
View on GitHub
Tools for Crawlers
☆22Dec 25, 2023Updated 2 years ago
lixi5338619 / weixin-spider
View on GitHub
《微信公众号采集系统》微信公众号文章的阅读数、在看数、评论数、评论列表，还有微信公众号的账号基本信息。
☆185Apr 29, 2022Updated 4 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
selfshore / spiders
View on GitHub
xhs(小红书)，易盾滑块，知乎登录
☆411Jun 8, 2025Updated last year
cilame / v_jstools
View on GitHub
模仿着写一个 chrome 插件，用来快速调试前端 js 代码。
☆3,005Apr 27, 2026Updated 2 months ago
bytebuff / ScrapingOutsourcing
View on GitHub
ScrapingOutsourcing专注分享爬虫代码尽量每周更新一个
☆174May 20, 2020Updated 6 years ago
kerlomz / captcha_trainer
View on GitHub
[验证码识别-训练] This project is based on CNN/ResNet/DenseNet+GRU/LSTM+CTC/CrossEntropy to realize verification code identification. This proje…
☆3,210Nov 9, 2025Updated 8 months ago
Python3WebSpider / ScrapyRedisBloomFilter
View on GitHub
Scrapy Redis Bloom Filter
☆175Jul 25, 2021Updated 4 years ago
JSREI / ast-hook-for-js-RE
View on GitHub
浏览器内存漫游解决方案（探索中...）
☆1,909May 7, 2024Updated 2 years ago
azwpayne / PythonScrape
View on GitHub
☆20Nov 29, 2020Updated 5 years ago
inlike / CookiePool
View on GitHub
一个强大的Cookie池项目，融合scrapy/requests/chrome储存cookie/cookie字符串/selenium等cookie形式
☆232Mar 13, 2020Updated 6 years ago
hfut-dmic / CEDP
View on GitHub
Online Web News Extraction via Tag Path Feature Weighted by Text Block Density
☆10Apr 1, 2017Updated 9 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
XGSClear7 / fuck_dy
View on GitHub
抖音特别的几个参数解密
☆82Jun 29, 2021Updated 5 years ago
LearnKu-GX1 / 013_load_balancer
View on GitHub
从零创建一个负载均衡器
☆10Dec 12, 2021Updated 4 years ago
ylw00 / qxVm
View on GitHub
qxVm补环境框架(纯js实现)
☆490Nov 25, 2024Updated last year
tenlee2012 / scrapy-kafka-redis
View on GitHub
Distributed crawling/scraping, Kafka And Redis based components for Scrapy
☆46Nov 13, 2020Updated 5 years ago
clemfromspace / scrapy-puppeteer
View on GitHub
Scrapy + Puppeteer
☆110Jun 11, 2021Updated 5 years ago
my8100 / scrapydweb
View on GitHub
Web app for Scrapyd cluster management, Scrapy log analysis & visualization, Auto packaging, Timer tasks, Monitor & Alert, and Mobile UI.…
☆3,410Feb 19, 2025Updated last year
chenjinhu / JsKiller
View on GitHub
JsKiller 每月更新多个网站JS解密方式，欢迎Star
☆127Dec 20, 2019Updated 6 years ago