apache / tikaLinks
The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).
☆3,022Updated this week
Alternatives and similar repositories for tika
Users that are interested in tika are comparing it to the libraries listed below
Sorting:
- Apache Lucene and Solr open-source search software☆4,376Updated 8 months ago
- Mirror of Apache POI☆2,026Updated this week
- Mirror of Apache PDFBox☆2,838Updated this week
- Apache OpenNLP☆1,514Updated this week
- Apache Lucene open-source search software☆2,997Updated this week
- Apache ActiveMQ Classic☆2,359Updated last week
- Code for Quartz Scheduler☆6,516Updated last month
- Eclipse Jetty® - Web Container & Clients - supports HTTP/2, HTTP/1.1, HTTP/1.0, websocket, servlets, and more☆3,950Updated this week
- Apache Nutch is an extensible and scalable web crawler☆3,029Updated 2 months ago
- VisualVM is an All-in-One Java Troubleshooting Tool☆3,031Updated last week
- High performance non-blocking webserver☆3,646Updated 2 weeks ago
- Java binary serialization and cloning: fast, efficient, automatic☆6,331Updated this week
- Ehcache 3.x line☆2,050Updated 2 weeks ago
- OpenPDF is a free Java library for creating and editing PDF files, with a LGPL and MPL open source license. OpenPDF is based on a fork of…☆3,876Updated this week
- iText for Java represents the next level of SDKs for developers that want to take advantage of the benefits PDF can bring. Equipped with …☆2,109Updated this week
- Java JNA wrapper for Tesseract OCR API☆1,672Updated 3 months ago
- The reliable, generic, fast and flexible logging framework for Java.☆3,109Updated 2 months ago
- Apache Solr open-source search software☆1,396Updated this week
- Drools is a rule engine, DMN engine and complex event processing (CEP) engine for Java.☆6,032Updated this week
- Apache Calcite☆4,856Updated this week
- JSqlParser parses an SQL statement and translate it into a hierarchy of Java classes. The generated hierarchy can be navigated using the …☆5,703Updated this week
- Advanced Java Redis client for thread-safe sync, async, and reactive usage. Supports Cluster, Sentinel, Pipelining, and codecs.☆5,583Updated this week
- A high performance caching library for Java☆16,702Updated this week
- Official Elasticsearch Java Client☆478Updated this week
- Apache Freemarker☆1,030Updated 2 months ago
- Prometheus instrumentation library for JVM applications☆2,222Updated this week
- An application observability facade for the most popular observability tools. Think SLF4J, but for observability.☆4,617Updated this week
- MapDB provides concurrent Maps, Sets and Queues backed by disk storage or off-heap-memory. It is a fast and easy to use embedded Java dat…☆4,981Updated last year
- Postgresql JDBC Driver☆1,588Updated this week
- Apache ZooKeeper☆12,503Updated this week