Norconex / importerLinks
Norconex Importer is a Java library and command-line application meant to "parse" and "extract" content out of a file as plain text, whatever its format (HTML, PDF, Word, etc). In addition, it allows you to perform any manipulation on the extracted text before using it in your own service or application.
☆34Updated last month
Alternatives and similar repositories for importer
Users that are interested in importer are comparing it to the libraries listed below
Sorting:
- Java utility for parsing PDF tabular data using Apache PDFBox and OpenCV☆72Updated 2 years ago
- Norconex Crawlers (or spiders) are flexible web and filesystem crawlers for collecting, parsing, and manipulating data from the web or fi…☆190Updated 3 weeks ago
- Apache POI builder☆54Updated 2 years ago
- Java 11 Library with tons of utility classes required in all projects☆34Updated last week
- JODConverter automates document conversions using LibreOffice/OpenOffice.org☆35Updated 8 years ago
- FFMQ - full-java, light-weight, fast JMS 1.1 queuer implementation☆64Updated last year
- Implementation of Norconex Committer for Elasticsearch.☆11Updated 3 years ago
- edit a docx using CKEditor via XHTML round trip (with some session state)☆47Updated 7 years ago
- Grules - rule engine for data preprocessing☆37Updated 8 years ago
- jORM is a Lightweight Java ORM☆37Updated 6 years ago
- Clone of the Unitils SVN repository. Adds support for Java 8, HSqlDB and immutable collections☆23Updated 3 years ago
- BK-tree Java library☆28Updated 11 years ago
- A unique ID generator that specialises in small IDs.☆54Updated 3 weeks ago
- Provides simplified access to the ElasticSearch Java API.☆4Updated 4 years ago
- ☆27Updated 9 years ago
- Implementation of the new headless chrome with chromedriver and selenium.☆38Updated 6 years ago
- JDBC driver for CSV☆70Updated 7 years ago
- JFileHelpers is a library that automates the tedious task of parsing and creating structured text files. It handles fixed width or delimi…☆46Updated 7 years ago
- Advanced distributed task distribution library for Hazelcast. Customizable task load balancing with failover. For example: Fair task e…☆44Updated 11 years ago
- A Java Library that interfaces with GNU Gettext and Java i18n Facilities to Make i18n Easier☆31Updated 6 years ago
- Oddjob scheduler and task execution framework.☆20Updated 3 weeks ago
- Jawr Core Official Repository☆24Updated 8 years ago
- EventBus system for publish and subscribe to events within an application☆33Updated last year
- A modern JMX web console☆19Updated 2 years ago
- Neuro4j Workflow is a light-weight workflow engine for Java with Eclipse-based development environment. Workflow allows to build reusable…☆60Updated 6 years ago
- The SQL Processor is an engine producing the ANSI SQL statements and providing their execution without the necessity to write Java plumbi…☆28Updated 3 weeks ago
- Roostrap is a proven rapid application framework compilation built by putting together Spring Roo, Twitter Bootstrap and Google AppEngine…☆35Updated 10 years ago
- Powerful, hierachical based desktop search engine based on swing and lucene.☆18Updated 8 years ago
- A Fancy Chart Control in JavaFX☆27Updated 10 years ago
- JDBC high-performance data bulk unload. Convertion between ResultSet/CSV/SQL/sqlldr files☆45Updated 2 months ago