WolfgangFahl / pdfindexerLinks
Index and search PDF files using Apache Lucene and PDF Box
☆44Updated 2 months ago
Alternatives and similar repositories for pdfindexer
Users that are interested in pdfindexer are comparing it to the libraries listed below
Sorting:
- ☆38Updated 9 years ago
- Elwha is a Java application for monitoring topics, sentiment and events on Twitter streams with the ability to generate notification mess…☆16Updated 10 years ago
- An HTML to Asciidoc converter written in JavaScript☆23Updated 10 years ago
- This repository contains the Domain Discovery Tool (DDT) project. DDT is an interactive system that helps users explore and better unders…☆46Updated 3 years ago
- Quickly analyze and explore email with advanced analytics and visualization.☆56Updated 3 years ago
- Quick demos using the Toolkit☆96Updated 2 years ago
- An easy-to-use and highly customizable crawler that enables you to create your own little Web archives (WARC/CDX)☆25Updated 7 years ago
- CiteSeerX public repository☆133Updated last year
- PDF Extraction Toolkit☆42Updated 4 years ago
- Cytoscape 3 desktop version.☆17Updated last month
- Demonstration of searching PDF document with Solr, Tika, and Tesseract☆31Updated 11 months ago
- A tool to generate UML class diagrams from JSON schema documents☆40Updated 5 years ago
- Python library and command line tool for converting data from one format to another☆99Updated 5 years ago
- A machine learning software for extracting information from scholarly documents☆23Updated 4 years ago
- Wandora is a general purpose information extraction, management and publishing application based on Topic Maps and Java.☆133Updated 2 years ago
- Combines Apache OpenNLP and Apache Tika and provides facilities for automatically deriving sentiment from text.☆34Updated 2 years ago
- Quickly turn command-line applications into RESTful webservices with a web-application front-end. You provide a specification of your com…☆132Updated 6 months ago
- Fast in-memory graph structure, powering Gephi☆74Updated last week
- Tika-Similarity uses the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features.☆108Updated 5 months ago
- Open Semantic Visual Linked Data Graph Explorer: Open Source tool (web app) and user interace (UI) for discovery, exploration and visuali…☆86Updated 5 years ago
- A curated list of Awesome Apache Solr links and resources.☆109Updated 3 years ago
- Java port of TLSH (Trend Micro Locality Sensitive Hash)☆21Updated 4 years ago
- Advanced similarity and duplicate source code proof of concept for our research efforts.☆52Updated 3 years ago
- The GATE Embedded core API and GATE Developer application☆88Updated 10 months ago
- TextUML compiler and the TextUML Toolkit☆76Updated 3 weeks ago
- Gephi Toolkit - All Gephi in a Library☆178Updated last year
- Blazegraph Tinkerpop3 Implementation☆62Updated 4 years ago
- A LibreOffice extension that converts JabRef references to plain text code and vice versa so that you can use your references with MS Off…☆11Updated last year
- Fusion demo app searching open-source project data from the Apache Software Foundation☆43Updated 6 years ago
- Apache UIMA Java SDK☆66Updated 7 months ago