A configurable web spider with a easy-to-use web console
☆997Aug 21, 2018Updated 7 years ago
Alternatives and similar repositories for spider
Users that are interested in spider are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A scalable web crawler framework for Java.☆11,696Dec 20, 2025Updated 3 months ago
- 一个简单、敏捷、分布式的支持SpringBoot的Java爬虫框架;An agile, distributed crawler framework.☆1,997Nov 25, 2024Updated last year
- WebCollector is an open source web crawler framework based on Java.It provides some simple interfaces for crawling the Web,you can setup …☆3,093Feb 10, 2026Updated last month
- 使用WebMagic抓取招聘信息,并且持久化到Mysql的例子。☆225Nov 22, 2016Updated 9 years ago
- 基于 webmagic 的 Java 爬虫应用☆2,782Jan 8, 2022Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Easy to use lightweight web crawler(易用的轻量化网络爬虫)☆2,515Jan 23, 2026Updated 2 months ago
- 一个基于webmagic框架二次开发的java爬虫框架实战,已实现能爬取腾讯,搜狐,今日头条(单独集成功能)等资讯内容,配合elasticsearch框架用法,实现了自动爬虫,已投入线上生产使用。☆341Nov 16, 2022Updated 3 years ago
- ☆644Feb 21, 2026Updated last month
- hsweb (haʊs wɛb) 是一个基于spring-boot 2.x开发 ,首个使用全响应式编程的企业级后台管理系统基础项目。☆8,405Mar 10, 2026Updated 2 weeks ago
- Distributed Configuration Management Platform(分布式配置管理平台)☆5,534Jul 18, 2023Updated 2 years ago
- WK系列开发框架-V1至V5 Java开源企业级开发框架(单应用/微服务/分布式)☆1,609Oct 31, 2023Updated 2 years ago
- A headless,standalone webkit server which make grabing dynamic web page easier.☆221Feb 15, 2019Updated 7 years ago
- JPress,一个使用 Java 开发的建站神器,目前已经有 10w+ 网站使用 JPress 进行驱动,其中包括多个政府机构,200+上市公司,中科院、红+字会等。☆2,726Nov 28, 2024Updated last year
- 👍Java 低代码, 轻量级, Spring Boot, MyBatis, Flowable, TypeScript, Vue, Antdv, 包括核心模块如:组织机构、角色用户、权限授权、数据权限、内容管理、工作流、Spring Cloud 微服务等。☆8,044Mar 18, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- MPush开源实时消息推送系统☆3,775Jun 17, 2022Updated 3 years ago
- A distributed task scheduling framework.(分布式任务调度平台XXL-JOB)☆29,995Mar 22, 2026Updated last week
- a simple distributed spider in Java. Java编写的一个简单分布式爬虫☆160Jun 18, 2013Updated 12 years ago
- A lightweight web crawler framework.(Java爬虫框架)☆756Dec 20, 2025Updated 3 months ago
- Open Source Web Crawler for Java☆4,630Nov 4, 2021Updated 4 years ago
- When jsoup meets XPath.☆473Jan 27, 2026Updated 2 months ago
- Distributed Scheduled Job Framework☆3,009Oct 20, 2022Updated 3 years ago
- 基于Spring+SpringMVC+Mybatis分布式敏捷开发系统架构,提供整套公共微服务服务模块:集中权限管理(单点登录)、内容管理、支付中心、用户管理(支持第三方登录)、微信平台、存储系统、配置中心、日志分析、任务和通知等,支持服务治理、监控和追踪,努力为中小型企业…☆16,704Dec 16, 2022Updated 3 years ago
- 中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理☆36,211Nov 15, 2025Updated 4 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 微信开发 Java SDK ,支持包括微信支付,开放平台,小程序,企业微信,视频号,公众号等的后端开发☆32,667Mar 22, 2026Updated last week
- 阿里云计算平台DataWorks(https://help.aliyun.com/document_detail/137663.html) 团队出品,为监控而生的数据库连接池☆28,217Mar 23, 2026Updated last week
- DistributeCrawler的Maven版☆10Jun 20, 2022Updated 3 years ago
- A cross-language remote procedure call(RPC) framework for rapid development of high performance distributed services.☆5,908Nov 24, 2025Updated 4 months ago
- Enterprise Stream Process Engine☆3,885Jun 16, 2023Updated 2 years ago
- Package seimicrawler project so that can be fast and standalone deployed.It is based on maven-war-plugin and modified. 这是专为SeimiCrawl…☆14Jun 30, 2022Updated 3 years ago
- 分布式任务调度平台(Distributed Job Schedule Platform)☆559Jun 20, 2022Updated 3 years ago
- AutoLoadCache 是基于AOP+Annotation等技术实现的高效的缓存管理解决方案,实现缓存与业务逻辑的解耦,并增加异步刷新及“拿来主义机制”,以适应高并发环境下的使用。☆2,092Apr 1, 2024Updated last year
- zhihu-crawler是一个基于Java的高性能、支持免费http代理池、支持横向扩展、分布式爬虫项目☆918Apr 2, 2019Updated 6 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Apollo is a reliable configuration management system suitable for microservice configuration management scenarios.☆29,768Mar 21, 2026Updated last week
- CAT 作为服务端项目基础组件,提供了 Java, C/C++, Node.js, Python, Go 等多语言客户端,已经在美团点评的基础架构中间件框架(MVC框架,RPC框架,数据库框架,缓存框架等,消息队列,配置系统等)深度集成,为美团点评各业务线提供系统丰富的性能…☆18,967Jan 4, 2025Updated last year
- Empowering Data Intelligence with Distributed SQL for Sharding, Scalability, and Security Across All Databases.☆20,701Updated this week
- Java Dynamic code or JAR , publish you Api or Schedule in flying☆148Jul 28, 2020Updated 5 years ago
- 给爬虫使用的代理IP池☆568Sep 6, 2019Updated 6 years ago
- The vip.com's java coding standard, libraries and tools☆7,655Sep 6, 2023Updated 2 years ago
- Castle-Platform是一个以高性能、高扩展性为目标的java开发平台。它是spring-mvc, spring-data, spring-security, Querydsl, JPA, Redis, Mongodb, Neo4j, groovy-template…☆195Nov 16, 2022Updated 3 years ago