yasserg / crawler4j
Open Source Web Crawler for Java
☆4,591Updated 3 years ago
Alternatives and similar repositories for crawler4j
Users that are interested in crawler4j are comparing it to the libraries listed below
Sorting:
- Apache Nutch is an extensible and scalable web crawler☆3,013Updated last month
- WebCollector is an open source web crawler framework based on Java.It provides some simple interfaces for crawling the Web,you can setup …☆3,072Updated 4 months ago
- A scalable web crawler framework for Java.☆11,550Updated this week
- Asynchronous Http and WebSocket Client library for Java☆6,346Updated this week
- jsoup: the Java HTML parser, built for HTML editing, cleaning, scraping, and XSS safety.☆11,160Updated this week
- Lightning fast and elegant mvc framework for Java8☆5,863Updated 3 months ago
- Easy to use lightweight web crawler(易用的轻量化网络爬虫)☆2,514Updated last year
- MapDB provides concurrent Maps, Sets and Queues backed by disk storage or off-heap-memory. It is a fast and easy to use embedded Java dat…☆4,975Updated 11 months ago
- Jodd! Lightweight. Java. Zero dependencies. Use what you like.☆4,061Updated last year
- cglib - Byte Code Generation Library is high level API to generate and transform Java byte code. It is used by AOP, testing, data access …☆4,852Updated 9 months ago
- Most popular Mocking framework for unit tests written in Java☆15,140Updated 3 weeks ago
- PowerMock is a Java framework that allows you to unit test code normally regarded as untestable.☆4,185Updated last year
- Apache Commons Lang☆2,798Updated this week
- Admin UI for administration of spring boot applications☆12,567Updated this week
- MyBatis integration with Spring Boot☆4,192Updated 2 weeks ago
- The reliable, generic, fast and flexible logging framework for Java.☆3,103Updated last month
- H2 is an embeddable RDBMS written in Java.☆4,356Updated 3 weeks ago
- MyBatis SQL mapper framework for Java☆20,065Updated last week
- Elasticsearch Java Rest Client.☆2,116Updated 2 years ago
- Hibernate's core Object/Relational Mapping functionality☆6,145Updated this week
- Apache Shiro☆4,373Updated this week
- Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.☆2,969Updated this week
- Thumbnailator - a thumbnail generation library for Java☆5,266Updated 2 months ago
- Automated JSON API documentation for API's built with Spring☆5,933Updated last year
- High performance non-blocking webserver☆3,647Updated last month
- Java dataframe and visualization library☆3,634Updated last month
- Redis Java client☆12,056Updated last week
- Ehcache 3.x line☆2,048Updated 4 months ago
- Feign makes writing java http clients easier☆9,660Updated this week
- A code generator for MyBatis.☆5,300Updated last week