WebCollector is an open source web crawler framework based on Java.It provides some simple interfaces for crawling the Web,you can setup a multi-threaded web crawler in less than 5 minutes.
☆3,094Feb 10, 2026Updated last month
Alternatives and similar repositories for WebCollector
Users that are interested in WebCollector are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A scalable web crawler framework for Java.☆11,698Dec 20, 2025Updated 3 months ago
- Easy to use lightweight web crawler(易用的轻量化网络爬虫)☆2,514Jan 23, 2026Updated 2 months ago
- Open Source Web Crawler for Java☆4,627Nov 4, 2021Updated 4 years ago
- 一个简单、敏捷、分布式的支持SpringBoot的Java爬虫框架;An agile, distributed crawler framework.☆1,997Nov 25, 2024Updated last year
- Apache Nutch is an extensible and scalable web crawler☆3,145Feb 27, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.☆3,210Mar 10, 2026Updated 3 weeks ago
- 自动抽取网页正文的算法,用JAVA实现☆112Apr 18, 2017Updated 8 years ago
- A configurable web spider with a easy-to-use web console☆997Aug 21, 2018Updated 7 years ago
- The java implementation of Apache Dubbo. An RPC and microservice framework.☆41,683Updated this week
- 阿里云计算平台DataWorks(https://help.aliyun.com/document_detail/137663.html) 团队出品,为监控而生的数据库连接池☆28,210Mar 23, 2026Updated last week
- Dubbox now means Dubbo eXtensions, and it adds features like RESTful remoting, Kyro/FST serialization, etc to the Dubbo service framework…☆4,844Mar 4, 2023Updated 3 years ago
- 👍Java 低代码, 轻量级, Spring Boot, MyBatis, Flowable, TypeScript, Vue, Antdv, 包括核心模块如:组织机构、角色用户、权限授权、数据权限、内容管理、工作流、Spring Cloud 微服务等。☆8,043Mar 26, 2026Updated last week
- Distributed scheduled job☆8,222Mar 10, 2026Updated 3 weeks ago
- A distributed task scheduling framework.(分布式任务调度平台XXL-JOB)☆29,995Mar 22, 2026Updated last week
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Empowering Data Intelligence with Distributed SQL for Sharding, Scalability, and Security Across All Databases.☆20,702Updated this week
- ☆17Mar 9, 2016Updated 10 years ago
- 微信开发 Java SDK ,支持包括微信支付,开放平台,小程序,企业微信,视频号,公众号等的后端开发☆32,679Mar 26, 2026Updated last week
- CAT 作为服务端项目基础组件,提供了 Java, C/C++, Node.js, Python, Go 等多语言客户端,已经在美团点评的基础架构中间件框架(MVC框架,RPC框架,数据库框架,缓存框架等,消息队列,配置系统等)深度集成,为美团点评各业务线提供系统丰富的性能…☆18,967Jan 4, 2025Updated last year
- A cross-language remote procedure call(RPC) framework for rapid development of high performance distributed services.☆5,904Nov 24, 2025Updated 4 months ago
- Alibaba Java Diagnostic Tool Arthas/Alibaba Java诊断利器Arthas☆37,216Updated this week
- ☆9,542Jan 15, 2024Updated 2 years ago
- 中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理☆36,226Nov 15, 2025Updated 4 months ago
- 基于 webmagic 的 Java 爬虫应用☆2,782Jan 8, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Apollo is a reliable configuration management system suitable for microservice configuration management scenarios.☆29,750Mar 28, 2026Updated last week
- Distributed Configuration Management Platform(分布式配置管理平台)☆5,532Jul 18, 2023Updated 2 years ago
- Redisson - Valkey & Redis Java client. Real-Time Data Platform. Sync/Async/RxJava/Reactive API. Over 50 Valkey and Redis based Java objec…☆24,284Mar 27, 2026Updated last week
- Lightning fast and elegant mvc framework for Java8☆5,882Dec 15, 2025Updated 3 months ago
- ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义 词典☆6,541Nov 19, 2023Updated 2 years ago
- A Spring Framework based, pragmatic style JavaEE application reference architecture.☆5,666Oct 25, 2022Updated 3 years ago
- hsweb (haʊs wɛb) 是一个基于spring-boot 2.x开发 ,首个使用全响应式编程的企业级后台管理系统基础项目。☆8,401Mar 26, 2026Updated last week
- Java资源大全中文版,包括开发库、开发工具、网站、博客、微 信、微博等,由伯乐在线持续更新。☆15,711Jan 31, 2024Updated 2 years ago
- Alibaba Java Coding Guidelines pmd implements and IDE plugin☆30,824Aug 6, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Google core libraries for Java☆51,515Mar 23, 2026Updated last week
- nutcher是中文的nutch文档,包含nutch的配置和源码解析,持续更新中。☆130Jul 23, 2019Updated 6 years ago
- FASTJSON 2.0.x has been released, faster and more secure, recommend you upgrade.☆25,696Jul 16, 2024Updated last year
- JavaEE项目开发脚手架(我的公众号:kaitao-1234567,我的新书:《亿级流量网站架构核心技术》)☆2,153Apr 5, 2018Updated 7 years ago
- 基于Spring+SpringMVC+Mybatis分布式敏捷开发系统架构,提供整套公共微服务服务模块:集中权限管理(单点登录)、内容管理、支付中心、用户管理(支持第三方登录)、微信平台、存储系统、配置中心、日志分析、任务和通知等,支持服务治理、监控和追踪,努力为中小型企业…☆16,697Dec 16, 2022Updated 3 years ago
- An powerful enhanced toolkit of MyBatis for simplify development☆17,332Feb 7, 2026Updated last month
- 阿里巴巴 MySQL binlog 增量订阅&消费组件☆29,643Feb 12, 2026Updated last month