CrawlScript / WebCollectorLinks
WebCollector is an open source web crawler framework based on Java.It provides some simple interfaces for crawling the Web,you can setup a multi-threaded web crawler in less than 5 minutes.
☆3,087Updated last month
Alternatives and similar repositories for WebCollector
Users that are interested in WebCollector are comparing it to the libraries listed below
Sorting:
- Easy to use lightweight web crawler(易用的轻量化网络爬虫)☆2,519Updated 3 months ago
- A configurable web spider with a easy-to-use web console☆998Updated 7 years ago
- A scalable web crawler framework for Java.☆11,655Updated 2 months ago
- 一个简单、敏捷、分布式的支持SpringBoot的Java爬虫框架;An agile, distributed crawler framework.☆1,992Updated 11 months ago
- Open Source Web Crawler for Java☆4,609Updated 4 years ago
- Apache Nutch is an extensible and scalable web crawler☆3,083Updated 3 weeks ago
- zhihu-crawler是一个基于Java的高性能、支持免费http代理池、支持横向扩展、分布式爬虫项目☆917Updated 6 years ago
- JAVA WEB + ORM Framework☆3,268Updated this week
- Java分布式中文分词组件 - word分词☆1,822Updated 4 years ago
- 基于 webmagic 的 Java 爬虫应用☆2,786Updated 3 years ago
- A Spring Framework based, pragmatic style JavaEE application reference architecture.☆5,683Updated 3 years ago
- JavaEE项目开发脚手架(我的公众号:kaitao-1234567,我的新书:《亿级流量网站架构核心技术》)☆2,157Updated 7 years ago
- Jsoup学习笔记。添加了部分学习代码和注释。☆636Updated last year
- ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典☆6,534Updated last year
- Spring Boot Reference Guide中文翻译 -《Spring Boot参考指南》☆4,468Updated 8 years ago
- 微信公众号、企业号Java SDK☆2,750Updated 7 years ago
- Nutz -- Web Framework(Mvc/Ioc/Aop/Dao/Json) for ALL Java developer☆2,541Updated last week
- 🗯 wechat-api by java7.☆1,807Updated 7 years ago
- Distributed Scheduled Job Framework☆3,019Updated 3 years ago
- wechat4j is wechat(weixin) develop framework for java 微信开发框架JAVA版,最简单易用微信开发框架☆858Updated 8 years ago
- 结巴分词(java版)☆2,681Updated last year
- Dubbox now means Dubbo eXtensions, and it adds features like RESTful remoting, Kyro/FST serialization, etc to the Dubbo service framework…☆4,866Updated 2 years ago
- Jarslink is a sofa ark plugin used to manage multi-application deployment☆3,031Updated last year
- Chinese translation of the Spring Framework 4.x Reference Documentation (https://docs.spring.io/spring/docs/4.3.13.RELEASE/spring-framewo…☆1,467Updated 7 years ago
- Anoyi's website☆1,098Updated 4 months ago
- A code generator for MyBatis.☆5,315Updated last week
- Netty learning.☆3,555Updated 8 years ago
- Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.☆3,081Updated last week
- Spring Boot集成MyBatis的基础项目☆3,366Updated 3 years ago
- The minimalist framework of RESTful(server and client) - Resty☆1,246Updated 4 years ago