WolfgangFahl / pdfindexerLinks
Index and search PDF files using Apache Lucene and PDF Box
☆44Updated this week
Alternatives and similar repositories for pdfindexer
Users that are interested in pdfindexer are comparing it to the libraries listed below
Sorting:
- ☆38Updated 9 years ago
- An HTML to Asciidoc converter written in JavaScript☆23Updated 10 years ago
- Advanced similarity and duplicate source code proof of concept for our research efforts.☆52Updated 2 years ago
- Core API for Silverpeas☆50Updated this week
- A tool to generate UML class diagrams from JSON schema documents☆40Updated 5 years ago
- resource scheduling and event planing☆63Updated last month
- A course on free/libre and open source software☆10Updated last year
- This repository contains the Domain Discovery Tool (DDT) project. DDT is an interactive system that helps users explore and better unders…☆45Updated 3 years ago
- Fusion demo app searching open-source project data from the Apache Software Foundation☆43Updated 6 years ago
- Cuttlefish aims to be a highly extensible visualization and analysis platform for all kinds of network data☆18Updated 7 years ago
- A curated list of Awesome Apache Solr links and resources.☆109Updated 3 years ago
- Gephi Toolkit - All Gephi in a Library☆174Updated 10 months ago
- An easy-to-use and highly customizable crawler that enables you to create your own little Web archives (WARC/CDX)☆25Updated 7 years ago
- Quick demos using the Toolkit☆94Updated 2 years ago
- Norconex Crawlers (or spiders) are flexible web and filesystem crawlers for collecting, parsing, and manipulating data from the web or fi…☆191Updated this week
- An internet tool for LibreOffice☆16Updated 5 years ago
- 📦 The Knowledge Box - A data dependency management framework to help users to publish, find and install data models☆46Updated this week
- A library of examples showing how to use the Common Crawl corpus (2008-2012, ARC format)☆65Updated 8 years ago
- Suite of tools for detecting changes in web pages and their rendering☆54Updated last year
- Elwha is a Java application for monitoring topics, sentiment and events on Twitter streams with the ability to generate notification mess…☆17Updated 9 years ago
- Spring integration with Stardog RDF database☆17Updated 5 months ago
- Cloudfier is a model-driven tool for rapid development of business applications☆22Updated 2 weeks ago
- Teiid Designer is a visual tool that enables rapid, model-driven definition, integration, management and testing of data services without…☆32Updated 2 years ago
- Provenance: Linking and Understanding Sources☆17Updated last year
- Quickly turn command-line applications into RESTful webservices with a web-application front-end. You provide a specification of your com…☆130Updated 4 months ago
- openEHR and related reference models in computable form, including UML, XMI, BMM, Ecore, etc☆20Updated 5 years ago
- A library to store metadata of relational databases including the schema, statistics, and integrity constraints.☆25Updated 6 years ago
- sql engine for csv files☆16Updated 8 years ago
- JDBC driver for data.world☆18Updated 9 months ago
- Blazegraph Tinkerpop3 Implementation☆61Updated 4 years ago