CrawlScript / WebCollectorLinks
WebCollector is an open source web crawler framework based on Java.It provides some simple interfaces for crawling the Web,you can setup a multi-threaded web crawler in less than 5 minutes.
☆3,085Updated last month
Alternatives and similar repositories for WebCollector
Users that are interested in WebCollector are comparing it to the libraries listed below
Sorting:
- Easy to use lightweight web crawler(易用的轻量化网络爬虫)☆2,518Updated 3 months ago
- A scalable web crawler framework for Java.☆11,645Updated last month
- A configurable web spider with a easy-to-use web console☆998Updated 7 years ago
- 一个简单、敏捷、分布式的支持SpringBoot的Java爬虫框架;An agile, distributed crawler framework.☆1,994Updated 10 months ago
- JAVA WEB + ORM Framework☆3,265Updated last week
- Java分布式中文分词组件 - word分词☆1,822Updated 4 years ago
- Apache Nutch is an extensible and scalable web crawler☆3,077Updated this week
- Open Source Web Crawler for Java☆4,607Updated 3 years ago
- zhihu-crawler是一个基于Java的高性能、支持免费http代理池、支持横向扩展、分布式爬虫项目☆917Updated 6 years ago
- 基于 webmagic 的 Java 爬虫应用☆2,784Updated 3 years ago
- Nutz -- Web Framework(Mvc/Ioc/Aop/Dao/Json) for ALL Java developer☆2,539Updated 2 months ago
- JavaEE项目开发脚手架(我的公众号:kaitao-1234567,我的新书:《亿级流量网站架构核心技术》)☆2,157Updated 7 years ago
- Jsoup学习笔记。添加了部分学习代码和注释。☆637Updated last year
- wechat4j is wechat(weixin) develop framework for java 微信开发框架JAVA版,最简单易用微信开发框架☆858Updated 8 years ago
- ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典☆6,531Updated last year
- 🗯 wechat-api by java7.☆1,808Updated 7 years ago
- QuestionAnsweringSystem是一个Java实现的人机问答系统,能够自动分析问题并给出候选答案。☆1,960Updated 7 years ago
- Dubbox now means Dubbo eXtensions, and it adds features like RESTful remoting, Kyro/FST serialization, etc to the Dubbo service framework…☆4,866Updated 2 years ago
- A Spring Framework based, pragmatic style JavaEE application reference architecture.☆5,684Updated 2 years ago
- 微信公众号、企业号Java SDK☆2,751Updated 7 years ago
- JPress,一个使用 Java 开发的建站神器,目前已经有 10w+ 网站使用 JPress 进行驱动,其中包括多个政府机构,200+上市公司,中科院、红+字会等。☆2,719Updated 10 months ago
- The minimalist framework of RESTful(server and client) - Resty☆1,246Updated 3 years ago
- 结巴分词(java版)☆2,676Updated last year
- 极其方便的实现微信公众平台服务端开发,2行代码完成服务器绑定,3行代码实现用户消息监听☆776Updated last year
- Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.☆3,074Updated last week
- Netty learning.☆3,555Updated 8 years ago
- 这是一个针对ECharts2.x版本的Java类库,实现了所有ECharts中的Json结构对应的Java对象,并且可以很方便的创建Option,Series等☆1,106Updated 4 years ago
- Java utils☆841Updated 2 years ago
- Spring Boot Reference Guide中文翻译 -《Spring Boot参考指南》☆4,469Updated 8 years ago
- Jarslink is a sofa ark plugin used to manage multi-application deployment☆3,030Updated last year