WebCollector is an open source web crawler framework based on Java.It provides some simple interfaces for crawling the Web,you can setup a multi-threaded web crawler in less than 5 minutes.
☆3,092Feb 10, 2026Updated 4 months ago
Alternatives and similar repositories for WebCollector
Users that are interested in WebCollector are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A scalable web crawler framework for Java.☆11,679Dec 20, 2025Updated 6 months ago
- Easy to use lightweight web crawler(易用的轻量化网络爬虫)☆2,514Jan 23, 2026Updated 5 months ago
- Open Source Web Crawler for Java☆4,622Nov 4, 2021Updated 4 years ago
- 一个简单、敏捷、分布式的支持SpringBoot的Java爬虫框架;An agile, distributed crawler framework.☆1,992Nov 25, 2024Updated last year
- Apache Nutch is an extensible and scalable web crawler☆3,204Jun 14, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 自动抽取网页正文的算法,用JAVA实现☆112Apr 18, 2017Updated 9 years ago
- A configurable web spider with a easy-to-use web console☆997Jun 3, 2026Updated 2 weeks ago
- The java implementation of Apache Dubbo. An RPC and microservice framework.☆41,520Jun 15, 2026Updated last week
- 阿里云计算平台DataWorks(https://help.aliyun.com/document_detail/137663.html) 团队出品,为监控而生的数据库连接池☆28,174May 12, 2026Updated last month
- Dubbox now means Dubbo eXtensions, and it adds features like RESTful remoting, Kyro/FST serialization, etc to the Dubbo service framework…☆4,836Mar 4, 2023Updated 3 years ago
- 👍Java 低代码, 轻量级, Spring Boot, MyBatis, Flowable, TypeScript, Vue, Antdv, 包括核心模块如:组织机构、角色用户、权限授权、数据权限、内容管理、工作流、Spring Cloud 微服务等。☆8,042Jun 16, 2026Updated last week
- Distributed scheduled job☆8,203Updated this week
- A distributed task scheduling framework.(分布式任务调度平台XXL-JOB)☆30,281Updated this week
- Empowering Data Intelligence with Distributed SQL for Sharding, Scalability, and Security Across All Databases.☆20,734Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆17Mar 9, 2016Updated 10 years ago
- 微信开发 Java SDK ,支持包括微信支付,开放平台,小程序,企业微信,视频号,公众号等的后端开发☆32,878Jun 12, 2026Updated last week
- CAT 作为服务端项目基础组件,提供了 Java, C/C++, Node.js, Python, Go 等多语言客户端,已经在美团点评的基础架构中间件框架(MVC框架,RPC框架,数据库框架,缓存框架等,消息队列,配置系统等)深度集成,为美团点评各业务线提供系统丰富的性能…☆18,945Jan 4, 2025Updated last year
- A cross-language remote procedure call(RPC) framework for rapid development of high performance distributed services.☆5,877Nov 24, 2025Updated 6 months ago
- Alibaba Java Diagnostic Tool Arthas/Alibaba Java诊断利器Arthas☆37,377Jun 16, 2026Updated last week
- ☆9,530May 18, 2026Updated last month
- 中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理☆36,412Nov 15, 2025Updated 7 months ago
- 基于 webmagic 的 Java 爬虫应用☆2,776Jan 8, 2022Updated 4 years ago
- Apollo is a reliable configuration management system suitable for microservice configuration management scenarios.☆29,760Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Distributed Configuration Management Platform(分布式配置管理平台)☆5,527Jul 18, 2023Updated 2 years ago
- Redisson: Valkey & Redis Java Client and Real-Time Data Platform. Sync/Async/RxJava/Reactive API. Over 50 Valkey and Redis based Java obj…☆24,360Updated this week
- Lightning fast and elegant mvc framework for Java8☆5,878May 15, 2026Updated last month
- ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典☆6,528Nov 19, 2023Updated 2 years ago
- A Spring Framework based, pragmatic style JavaEE application reference architecture.☆5,651Oct 25, 2022Updated 3 years ago
- hsweb (haʊs wɛb) 是一个基于spring-boot 2.x开发 ,首个使用全响应式编程的企业级后台管理系统基础项目。☆8,404May 31, 2026Updated 3 weeks ago
- Java资源大全中文版,包括开发库、开发工具、网站、博客、微信、微博等,由伯乐在线持续更新。☆15,693Jan 31, 2024Updated 2 years ago
- Alibaba Java Coding Guidelines pmd implements and IDE plugin☆30,827Aug 6, 2024Updated last year
- Google core libraries for Java☆51,471Jun 16, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- nutcher是中文的nutch文档,包含nutch的配置和源码解析,持续更新中。☆130Jul 23, 2019Updated 6 years ago
- FASTJSON 2.0.x has been released, faster and more secure, recommend you upgrade.☆25,625Jul 16, 2024Updated last year
- JavaEE项目开发脚手架(我的公众号:kaitao-1234567,我的新书:《亿级流量网站架构核心技术》)☆2,154Apr 5, 2018Updated 8 years ago
- 基于Spring+SpringMVC+Mybatis分布式敏捷开发系统架构,提供整套公共微服务服务模块:集中权限管理(单点登录)、内容管理、支付中心、用户管理(支持第三方登录)、微信平台、存储系统、配置中心、日志分析、任务和通知等,支持服务治理、监控和追踪,努力为中小型企业…☆16,673Dec 16, 2022Updated 3 years ago
- An powerful enhanced toolkit of MyBatis for simplify development☆17,394Jun 3, 2026Updated 2 weeks ago
- 阿里巴巴 MySQL binlog 增量订阅&消费组件☆29,700Jun 4, 2026Updated 2 weeks ago
- A powerful flow control component enabling reliability, resilience and monitoring for microservices. (面向云原生微服务的高可用流控防护组件)☆23,127May 27, 2026Updated 3 weeks ago