yasserg / crawler4jLinks
Open Source Web Crawler for Java
☆4,610Updated 4 years ago
Alternatives and similar repositories for crawler4j
Users that are interested in crawler4j are comparing it to the libraries listed below
Sorting:
- Apache Nutch is an extensible and scalable web crawler☆3,089Updated last week
- A scalable web crawler framework for Java.☆11,662Updated last week
- WebCollector is an open source web crawler framework based on Java.It provides some simple interfaces for crawling the Web,you can setup …☆3,090Updated 2 months ago
- Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.☆3,089Updated 2 weeks ago
- Easy to use lightweight web crawler(易用的轻量化网络爬虫)☆2,519Updated 4 months ago
- Asynchronous Http and WebSocket Client library for Java☆6,394Updated last week
- cglib - Byte Code Generation Library is high level API to generate and transform Java byte code. It is used by AOP, testing, data access …☆4,882Updated last year
- Jodd! Lightweight. Java. Zero dependencies. Use what you like.☆4,073Updated last year
- Thumbnailator - a thumbnail generation library for Java☆5,346Updated last month
- When jsoup meets XPath.☆472Updated 2 years ago
- Ehcache 3.x line☆2,073Updated 3 weeks ago
- A high performance caching library for Java☆17,224Updated this week
- MapDB provides concurrent Maps, Sets and Queues backed by disk storage or off-heap-memory. It is a fast and easy to use embedded Java dat…☆5,028Updated last year
- jsoup: the Java HTML parser, built for HTML editing, cleaning, scraping, and XSS safety.☆11,278Updated last week
- This is no longer the active Jersey repository. Please see the README.md☆2,850Updated 4 years ago
- A Java 8 string manipulation library.☆1,345Updated 5 years ago
- Java agent that enables class reloading in a running JVM☆2,720Updated 3 years ago
- Apache Commons Lang☆2,876Updated this week
- Unirest in Java: Simplified, lightweight HTTP client library.☆2,695Updated this week
- Main Portal page for the Jackson project☆9,566Updated last week
- 一个简单、敏捷、分布式的支持SpringBoot的Java爬虫框架;An agile, distributed crawler framework.☆1,993Updated 11 months ago
- BTrace - a safe, dynamic tracing tool for the Java platform☆5,971Updated this week
- Feign makes writing java http clients easier☆9,761Updated this week
- Java binary serialization and cloning: fast, efficient, automatic☆6,441Updated this week
- Java serialization library, proto compiler, code generator☆2,087Updated 7 months ago
- Do not send pull requests! Automated Git clone of various OpenJDK branches☆2,149Updated 5 years ago
- Dozer is a Java Bean to Java Bean mapper that recursively copies data from one object to another.☆2,111Updated 4 months ago
- PowerMock is a Java framework that allows you to unit test code normally regarded as untestable.☆4,191Updated last year
- A configurable web spider with a easy-to-use web console☆998Updated 7 years ago
- MyBatis integration with Spring Boot☆4,226Updated last week