yasserg / crawler4jLinks
Open Source Web Crawler for Java
☆4,628Updated 4 years ago
Alternatives and similar repositories for crawler4j
Users that are interested in crawler4j are comparing it to the libraries listed below
Sorting:
- Apache Nutch is an extensible and scalable web crawler☆3,121Updated this week
- WebCollector is an open source web crawler framework based on Java.It provides some simple interfaces for crawling the Web,you can setup …☆3,091Updated 5 months ago
- A scalable web crawler framework for Java.☆11,700Updated last month
- Easy to use lightweight web crawler(易用的轻量化网络爬虫)☆2,516Updated 2 weeks ago
- Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.☆3,182Updated this week
- Lightning fast and elegant mvc framework for Java8☆5,885Updated last month
- Jodd! Lightweight. Java. Zero dependencies. Use what you like.☆4,077Updated last year
- JAVA WEB + ORM Framework☆3,272Updated 3 months ago
- A Spring Framework based, pragmatic style JavaEE application reference architecture.☆5,671Updated 3 years ago
- Java agent that enables class reloading in a running JVM☆2,721Updated 3 years ago
- A configurable web spider with a easy-to-use web console☆998Updated 7 years ago
- jsoup: the Java HTML parser, built for HTML editing, cleaning, scraping, and XSS safety.☆11,328Updated last week
- When jsoup meets XPath.☆473Updated 2 weeks ago
- 一个简单、敏捷、分布式的支持SpringBoot的Java爬虫框架;An agile, distributed crawler framework.☆1,997Updated last year
- MapDB provides concurrent Maps, Sets and Queues backed by disk storage or off-heap-memory. It is a fast and easy to use embedded Java dat…☆5,045Updated last year
- Redis Java client☆12,275Updated this week
- Demonstrates the features of the Spring MVC web framework☆4,982Updated 4 years ago
- Ehcache 3.x line☆2,077Updated 3 weeks ago
- Provide support to increase developer productivity in Java when using Elasticsearch. Uses familiar Spring concepts such as a template cla…☆2,958Updated this week
- Elasticsearch Java Rest Client.☆2,108Updated 2 years ago
- A Java 8 string manipulation library.☆1,341Updated 6 years ago
- Dex : The Data Explorer -- A data visualization tool written in Java/Groovy/JavaFX capable of powerful ETL and publishing web visualizati…☆1,320Updated 7 years ago
- Spring Data Example Projects☆5,407Updated last month
- Nutz -- Web Framework(Mvc/Ioc/Aop/Dao/Json) for ALL Java developer☆2,547Updated 3 months ago
- zhihu-crawler是一个基于Java的高性能、支持免费http代理池、支持横向扩展、分布式爬虫项目☆917Updated 6 years ago
- A simple blogging system implemented with Spring Boot + Hibernate + MySQL + Bootstrap4.☆1,646Updated 5 years ago
- Benchmark comparing serialization libraries on the JVM☆3,295Updated 2 years ago
- ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典☆6,540Updated 2 years ago
- Jsoup学习笔记。添加了部分学习代码和注释。☆636Updated 2 years ago
- Java serialization library, proto compiler, code generator☆2,094Updated 10 months ago