yasserg / crawler4j
Open Source Web Crawler for Java
☆4,583Updated 3 years ago
Alternatives and similar repositories for crawler4j:
Users that are interested in crawler4j are comparing it to the libraries listed below
- Apache Nutch is an extensible and scalable web crawler☆3,000Updated this week
- WebCollector is an open source web crawler framework based on Java.It provides some simple interfaces for crawling the Web,you can setup …☆3,074Updated 2 months ago
- Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.☆2,945Updated last week
- Easy to use lightweight web crawler(易用的轻量化网络爬虫)☆2,511Updated last year
- A scalable web crawler framework for Java.☆11,528Updated last month
- 一个简单、敏捷、分布式的支持SpringBoot的Java爬虫框架;An agile, distributed crawler framework.☆1,986Updated 4 months ago
- When jsoup meets XPath.☆469Updated last year
- Jodd! Lightweight. Java. Zero dependencies. Use what you like.☆4,062Updated 11 months ago
- jsoup: the Java HTML parser, built for HTML editing, cleaning, scraping, and XSS safety.☆11,096Updated this week
- Mirror of Apache HttpClient☆1,487Updated last week
- A simple blogging system implemented with Spring Boot + Hibernate + MySQL + Bootstrap4.☆1,641Updated 5 years ago
- Java agent that enables class reloading in a running JVM☆2,721Updated 2 years ago
- The simple, stupid rules engine for Java☆5,017Updated 10 months ago
- Elasticsearch Java Rest Client.☆2,115Updated 2 years ago
- Lightning fast and elegant mvc framework for Java8☆5,860Updated last month
- cglib - Byte Code Generation Library is high level API to generate and transform Java byte code. It is used by AOP, testing, data access …☆4,843Updated 7 months ago
- Do not send pull requests! Automated Git clone of various OpenJDK branches☆2,164Updated 4 years ago
- High performance non-blocking webserver☆3,630Updated 2 weeks ago
- MapDB provides concurrent Maps, Sets and Queues backed by disk storage or off-heap-memory. It is a fast and easy to use embedded Java dat…☆4,961Updated 9 months ago
- Redis Java client☆12,004Updated this week
- Unirest in Java: Simplified, lightweight HTTP client library.☆2,654Updated 2 months ago
- Apache Commons Lang☆2,779Updated this week
- Asynchronous Http and WebSocket Client library for Java☆6,329Updated this week
- JAVA WEB + ORM Framework☆3,245Updated last week
- A configurable web spider with a easy-to-use web console☆994Updated 6 years ago
- Guice (pronounced 'juice') is a lightweight dependency injection framework for Java 11 and above, brought to you by Google.☆12,582Updated this week
- ☆3,193Updated last year
- Java binary serialization and cloning: fast, efficient, automatic☆6,280Updated last week
- This is no longer the active Jersey repository. Please see the README.md☆2,854Updated 3 years ago
- Apache ActiveMQ Classic☆2,348Updated 2 weeks ago