CrawlScript / WebCollector
WebCollector is an open source web crawler framework based on Java.It provides some simple interfaces for crawling the Web,you can setup a multi-threaded web crawler in less than 5 minutes.
☆3,077Updated last month
Alternatives and similar repositories for WebCollector:
Users that are interested in WebCollector are comparing it to the libraries listed below
- Easy to use lightweight web crawler(易用的轻量化网络爬虫)☆2,510Updated last year
- A configurable web spider with a easy-to-use web console☆993Updated 6 years ago
- A scalable web crawler framework for Java.☆11,507Updated 2 weeks ago
- 一个简单、敏捷、分布式的支持SpringBoot的Java爬虫框架;An agile, distributed crawler framework.☆1,982Updated 3 months ago
- Open Source Web Crawler for Java☆4,576Updated 3 years ago
- zhihu-crawler是一个基于Java的高性能、支持免费http代理池、支 持横向扩展、分布式爬虫项目☆914Updated 5 years ago
- Apache Nutch is an extensible and scalable web crawler☆2,977Updated last month
- Java分布式中文分词组件 - word分词☆1,819Updated 3 years ago
- JAVA WEB + ORM Framework☆3,244Updated last month
- 基于 webmagic 的 Java 爬虫应用☆2,781Updated 3 years ago
- 结巴分词(java版)☆2,609Updated 7 months ago
- ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典☆6,509Updated last year
- Jsoup学习笔记。添加了部分学习代码和注释。☆637Updated last year
- Jarslink is a sofa ark plugin used to manage multi-application deployment☆3,046Updated 9 months ago
- A code generator for MyBatis.☆5,297Updated this week
- Jcseg is a light weight NLP framework developed with Java. Provide CJK and English segmentation based on MMSEG algorithm, With also keywo…☆921Updated last year
- 微信SDK JAVA (公众平台、开放平台、 商户平台、 服务商平台)☆2,499Updated 2 years ago
- JavaEE项目开发脚手架(我的公众号:kaitao-1234567,我的新书:《亿级流量网站架构核心技术》)☆2,164Updated 6 years ago
- 🗯 wechat-api by java7.☆1,814Updated 6 years ago
- 这是一个针对ECharts2.x版本的Java类库,实现了所有ECharts中的Json结构对应的Java对象,并且可以很方便的创建Option,Series等☆1,101Updated 3 years ago
- The minimalist framework of RESTful(server and client) - Resty☆1,244Updated 3 years ago
- Nutz -- Web Framework(Mvc/Ioc/Aop/Dao/Json) for ALL Java developer☆2,537Updated 5 months ago
- 极其方便的实现微信公众平台服务端开发,2行代码完成服务器绑定,3行代码实现用户消息监听☆774Updated 7 months ago
- Netty learning.☆3,552Updated 8 years ago
- Dubbox now means Dubbo eXtensions, and it adds features like RESTful remoting, Kyro/FST serialization, etc to the Dubbo service framework…☆4,887Updated 2 years ago
- 微信公众号、企业号Java SDK☆2,748Updated 7 years ago
- No longer maintained. Please contact the origional author.☆662Updated 6 years ago
- Spring Boot集成MyBatis的基础项目☆3,368Updated 3 years ago
- A lightweight web crawler framework.(Java爬虫框架)☆713Updated last month
- TProfiler是一个可以在生产环境长期使用的性能分析工具☆2,385Updated 6 years ago