crawlab-team/webspot

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/crawlab-team/webspot)

crawlab-team / webspot

An intelligent web service to automatically detect web content and extract information from it.

☆86

Alternatives and similar repositories for webspot

Users that are interested in webspot are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Gerapy / GerapyAutoExtractor
View on GitHub
Auto Extractor Module
☆338Aug 19, 2024Updated last year
crawlab-team / crawlab-core
View on GitHub
Backend core modules for Crawlab
☆51Jun 14, 2024Updated 2 years ago
tieyongjie / scrapy-fingerprint
View on GitHub
☆70Nov 17, 2023Updated 2 years ago
GeneralNewsExtractor / GeneralNewsExtractor
View on GitHub
新闻网页正文通用抽取器 Beta 版.
☆3,788Apr 21, 2026Updated 3 months ago
GeneralNewsExtractor / GneList
View on GitHub
A chrome extension to get XPath of list items in webpage easily.
☆34Mar 11, 2022Updated 4 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
jundaychan / funasr-fastapi
View on GitHub
funasr语音转文字的简单api版本，funasr+fastapi，方便部署在服务器上
☆13Aug 10, 2024Updated last year
crawlab-team / crawlab-docs
View on GitHub
Documentation for Crawlab
☆42Jun 9, 2026Updated last month
lulufubin / universal_icon_click_captcha
View on GitHub
图标点选验证码通用解决方案
☆28Jul 19, 2025Updated last year
crawlab-team / crawlab-frontend
View on GitHub
Frontend for Crawlab
☆19Jul 15, 2021Updated 5 years ago
Boris-code / boris-spider
View on GitHub
boris-spider是一款使用Python语言编写的爬虫框架，于多年的爬虫业务中不断磨合而诞生，相比于scrapy，该框架更易上手，且又满足复杂的需求，支持分布式及批次采集。
☆85Jan 21, 2022Updated 4 years ago
2833844911 / IPserver
View on GitHub
工具可以实现代理池的搭建利用手机可以一直切ip,把手机（使用流量，不是wifi）作为类似拨号服务器,可以在我们需要过ip风控(利用手机切ip)的时候使用
☆55May 18, 2024Updated 2 years ago
zxjlm / Poirot
View on GitHub
自动将字体文件映射为编码，主要用于中文字体反爬虫的破解
☆61May 19, 2024Updated 2 years ago
lixi5338619 / lxparse
View on GitHub
用于解析列表页链接和提取详细页内容的库
☆19Oct 26, 2023Updated 2 years ago
2833844911 / nodeV8
View on GitHub
node可以创建使用v8环境
☆22Apr 30, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
yangshimin / SpiderCollection
View on GitHub
记录遇到的一些网站的加密和混淆
☆70May 5, 2022Updated 4 years ago
everywan / softwares
View on GitHub
收录的软件, 包括 arch的安装与配置, i3wm的配置, wsl的配置, osx的配置等
☆12Aug 25, 2025Updated 11 months ago
BigFaceCat2017 / risk-talk
View on GitHub
风控、实时计算、技术框架、架构方案、Groovy规则引擎、规则决策
☆10Jul 30, 2019Updated 6 years ago
MgArcher / Text_select_captcha
View on GitHub
实现文字点选、选字、选择、点触验证码识别，基于pytorch训练
☆1,638May 8, 2026Updated 2 months ago
rdcprojects / scrapy-mq-redis
View on GitHub
A RabbitMQ/Redis tool for Scrapy
☆13Oct 7, 2016Updated 9 years ago
TeamHG-Memex / autopager
View on GitHub
Detect and classify pagination links
☆107Apr 8, 2026Updated 3 months ago
DingZaiHub / PythonSpider
View on GitHub
JS逆向系列教程，模拟登录，AES、RSA、DES加密等，持续更新，欢迎 star！
☆434Apr 11, 2021Updated 5 years ago
mouday / spider-admin-pro-web
View on GitHub
☆34Nov 1, 2024Updated last year
sml2h3 / captcha_server
View on GitHub
一个免费开源一键搭建的通用验证码识别平台，大部分常见的中英数验证码识别都没啥问题。
☆192Jul 13, 2021Updated 5 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
rangertaha / scrapy-prometheus-exporter
View on GitHub
Scrapy stats exporter for prometheus
☆23Jul 17, 2026Updated last week
perpetually2014 / Official_Accounts
View on GitHub
公众号
☆10Jul 24, 2023Updated 3 years ago
jiyulany / Fchrome
View on GitHub
这是一款对chromium源码进行定制的浏览器,支持爬虫/JS逆向工程师进行辅助分析网页
☆343Dec 25, 2024Updated last year
Boris-code / feapder
View on GitHub
🚀🚀🚀feapder is an easy to use, powerful crawler framework | feapder是一款上手简单，功能强大的Python爬虫框架。内置AirSpider、Spider、TaskSpider、BatchSpider四种爬…
☆3,725Updated this week
kingname / AutoCrawler
View on GitHub
☆31Oct 17, 2024Updated last year
juneix / ink-nav
View on GitHub
拯救吃灰的Kindle，电子墨水屏专用导航
☆21Nov 21, 2023Updated 2 years ago
wanghaisheng / awesome-web-data-extractor
View on GitHub
A curated list of promising Web Data Extractors resources
☆31Dec 24, 2019Updated 6 years ago
lixi5338619 / weixin-spider
View on GitHub
《微信公众号采集系统》微信公众号文章的阅读数、在看数、评论数、评论列表，还有微信公众号的账号基本信息。
☆186Apr 29, 2022Updated 4 years ago
neverl805 / python_encrypt_tool
View on GitHub
Python代码加密保护.把对应文件内的所有py文件进行代码加密,得到一个新的代码，以防被看到并修改源码并且不影响正常使用
☆32Mar 5, 2023Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
CapAllen / Mini_Data_Middle_Plateform
View on GitHub
基于Python+Flask+MySQL的数据微中台，支持数据库管理、数据收集（某乎爬虫等）等功能
☆10Sep 4, 2020Updated 5 years ago
pangxiaobin / SpiderCollection
View on GitHub
js逆向练习记录
☆13Nov 30, 2023Updated 2 years ago
cxapython / Spider
View on GitHub
爬虫，反爬虫， JS 逆向，安卓逆向， AST
☆12Sep 14, 2020Updated 5 years ago
lizongying / cron
View on GitHub
基于时间轮实现的定时任务，更准时，并发性能更高。支持crontab格式或every 1 second|minute|hour|day|month|week格式
☆16Nov 24, 2023Updated 2 years ago
Ryuchen / DeadPool
View on GitHub
该项目是一个使用celery作为主体框架的爬虫应用，能够灵活的添加爬虫任务，并且同时运行多站点的爬虫工作，所有组件都能够原生支持规模并发和分布式，加上celery原生的分布式调用，实现大规模并发。
☆40Sep 23, 2022Updated 3 years ago
crawlab-team / crawlab
View on GitHub
Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台，支持任何语言和框架
☆12,251Feb 10, 2026Updated 5 months ago
yingchujun / CrackJs
View on GitHub
JS逆向：webpack、极验滑块、数字字母验证码、css加密、登陆流程、加速乐、补环境等案例
☆43Aug 15, 2024Updated last year