Norconex / importer
Norconex Importer is a Java library and command-line application meant to "parse" and "extract" content out of a file as plain text, whatever its format (HTML, PDF, Word, etc). In addition, it allows you to perform any manipulation on the extracted text before using it in your own service or application.
☆33Updated last month
Related projects ⓘ
Alternatives and complementary repositories for importer
- Java utility for parsing PDF tabular data using Apache PDFBox and OpenCV☆71Updated last year
- Norconex Crawlers (or spiders) are flexible web and filesystem crawlers for collecting, parsing, and manipulating data from the web or fi…☆183Updated this week
- Audit4j Spring Integration.☆18Updated 2 years ago
- Sample audit4j applications.☆18Updated 6 years ago
- Powerful, hierachical based desktop search engine based on swing and lucene.☆18Updated 7 years ago
- Support for DataNucleus persistence using the JPA API (JSR0220, JSR0317, JSR0338)☆13Updated 2 years ago
- An easy-to-implement library for the GeoHash algorithm☆66Updated 3 years ago
- Norconex Filesystem Collector is a flexible crawler for collecting, parsing, and manipulating data ranging from local hard drives to netw…☆22Updated last month
- The FSS(file storage service) APIs make storing the blob file easy and simple .☆41Updated last year
- Formio, form definition and binding library for Java platform☆26Updated 10 months ago
- Apache POI builder☆54Updated last year
- Static site generator from WizTools.org☆42Updated 6 years ago
- Plugin Framework for Wicket (PF4J - Wicket integration)☆33Updated 2 years ago
- The SQL Processor is an engine producing the ANSI SQL statements and providing their execution without the necessity to write Java plumbi…☆27Updated 7 months ago
- Provides simplified access to the ElasticSearch Java API.☆4Updated 3 years ago
- Fork of OpenCSV's svn repo☆74Updated last year
- Please use the luke bundled with lucene! This repo is archived and frozen now.☆101Updated 6 years ago
- JDBC high-performance data bulk unload. Convertion between ResultSet/CSV/SQL/sqlldr files☆44Updated 5 months ago
- OhmDB - The Irresistible RDBMS + NoSQL Database for Java☆62Updated 8 years ago
- Minimal event-driven framework for Java.☆15Updated 5 years ago
- A JHipster app reporting to Spark Streaming☆14Updated 9 years ago
- Sample project that creates graph on words in same tweets using Spring Boot + Spring Social Twitter + Spring Data Neo4J☆42Updated 8 years ago
- Java library for creating fluid page layouts with Apache PDFBox. Supporting multi-page tables, different page layouts etc.☆63Updated last week
- LightAdmin and JHipster integration example☆18Updated 11 months ago
- JSF template project to create PDF from html with css☆10Updated 9 years ago
- Some examples of using JDBI as a persistence framework☆42Updated 9 years ago
- For logging events which have long-term business significance.☆33Updated 3 years ago
- Neuro4j Workflow is a light-weight workflow engine for Java with Eclipse-based development environment. Workflow allows to build reusable…☆60Updated 5 years ago
- ☆40Updated 4 years ago