WebCollector is an open source web crawler framework based on Java.It provides some simple interfaces for crawling the Web,you can setup a multi-threaded web crawler in less than 5 minutes.
☆3,093Feb 10, 2026Updated 2 weeks ago
Alternatives and similar repositories for WebCollector
Users that are interested in WebCollector are comparing it to the libraries listed below
Sorting:
- A scalable web crawler framework for Java.☆11,703Dec 20, 2025Updated 2 months ago
- Easy to use lightweight web crawler(易用的轻量化网络爬虫)☆2,514Jan 23, 2026Updated last month
- Open Source Web Crawler for Java☆4,627Nov 4, 2021Updated 4 years ago
- 一个简单、敏捷、分布式的支持SpringBoot的Java爬虫框架;An agile, distributed crawler framework.☆1,994Nov 25, 2024Updated last year
- Apache Nutch is an extensible and scalable web crawler☆3,135Updated this week
- 自动抽取网页正文的算法,用JAVA实现☆111Apr 18, 2017Updated 8 years ago
- A configurable web spider with a easy-to-use web console☆998Aug 21, 2018Updated 7 years ago
- The java implementation of Apache Dubbo. An RPC and microservice framework.☆41,738Feb 20, 2026Updated last week
- Dubbox now means Dubbo eXtensions, and it adds features like RESTful remoting, Kyro/FST serialization, etc to the Dubbo service framework…☆4,854Mar 4, 2023Updated 2 years ago
- 阿里云计算平台DataWorks(https://help.aliyun.com/document_detail/137663.html) 团队出品,为监控而生的数据库连接池☆28,218Updated this week
- Distributed scheduled job☆8,221Feb 7, 2026Updated 3 weeks ago
- 👍Java 低代码, 轻量级, Spring Boot, MyBatis, Flowable, TypeScript, Vue, Antdv, 包括核心模块如:组织机构、角色用户、权限授权、数据权限、内容管理、工作流、Spring Cloud 微服务等。☆8,048Updated this week
- A distributed task scheduling framework.(分布式任务调度平台XXL-JOB)☆29,921Feb 11, 2026Updated 2 weeks ago
- Empowering Data Intelligence with Distributed SQL for Sharding, Scalability, and Security Across All Databases.☆20,685Updated this week
- 微信开发 Java SDK ,支持包括微信支付,开放平台,小程序,企业微信,视频号,公众号等的后端开发☆32,577Updated this week
- A cross-language remote procedure call(RPC) framework for rapid development of high performance distributed services.☆5,911Nov 24, 2025Updated 3 months ago
- CAT 作为服务端项目基础组件,提供了 Java, C/C++, Node.js, Python, Go 等多语言客户端,已经在美团点评的基础架构中间件框架(MVC框架,RPC框架,数据库框架,缓存框架等,消息队列,配置系统等)深度集成,为美团点评各业务线提供系统丰富的性能…☆18,976Jan 4, 2025Updated last year
- 中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理☆36,146Nov 15, 2025Updated 3 months ago
- Alibaba Java Diagnostic Tool Arthas/Alibaba Java诊断利器Arthas☆37,098Updated this week
- ☆9,553Jan 15, 2024Updated 2 years ago
- Redisson - Valkey & Redis Java client. Real-Time Data Platform. Sync/Async/RxJava/Reactive API. Over 50 Valkey and Redis based Java objec…☆24,255Feb 20, 2026Updated last week
- Distributed Configuration Management Platform(分布式配置管理平台)☆5,539Jul 18, 2023Updated 2 years ago
- Apollo is a reliable configuration management system suitable for microservice configuration management scenarios.☆29,781Updated this week
- 基于 webmagic 的 Java 爬虫应用☆2,782Jan 8, 2022Updated 4 years ago
- Lightning fast and elegant mvc framework for Java8☆5,885Dec 15, 2025Updated 2 months ago
- hsweb (haʊs wɛb) 是一个基于spring-boot 2.x开发 ,首个使用全响应式编程的企业级后台管理系统基础项目。☆8,409Feb 15, 2026Updated last week
- Java资源大全中文版,包括开发库、开发工具、网站、博客、微信、微博等,由伯乐在线持续更新。☆15,723Jan 31, 2024Updated 2 years ago
- 基于Spring+SpringMVC+Mybatis分布式敏捷开发系统架构,提供整套公共微服务服务模块:集中权限管理(单点登录)、内容管理、支付中心、用户管理(支持第三方登录)、微信平台、存储系统、配置中心、日志分析、任务和通知等,支持服务治理、监控和追踪,努力为中小型企业…☆16,712Dec 16, 2022Updated 3 years ago
- Alibaba Java Coding Guidelines pmd implements and IDE plugin☆30,827Aug 6, 2024Updated last year
- ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典☆6,545Nov 19, 2023Updated 2 years ago
- A Spring Framework based, pragmatic style JavaEE application reference architecture.☆5,668Oct 25, 2022Updated 3 years ago
- An powerful enhanced toolkit of MyBatis for simplify development☆17,310Feb 7, 2026Updated 3 weeks ago
- A powerful flow control component enabling reliability, resilience and monitoring for microservices. (面向云原生微服务的高可用流控防护组件)☆23,070Jan 26, 2026Updated last month
- 阿里巴巴 MySQL binlog 增量订阅&消费组件☆29,612Feb 12, 2026Updated 2 weeks ago
- Google core libraries for Java☆51,479Updated this week
- JavaEE项目开发脚手架(我的公众号:kaitao-1234567,我的新书:《亿级流量网站架构核心技术》)☆2,153Apr 5, 2018Updated 7 years ago
- MPush开源实时消息推送系统☆3,777Jun 17, 2022Updated 3 years ago
- FASTJSON 2.0.x has been released, faster and more secure, recommend you upgrade.☆25,717Jul 16, 2024Updated last year
- The vip.com's java coding standard, libraries and tools☆7,659Sep 6, 2023Updated 2 years ago