Norconex Importer is a Java library and command-line application meant to "parse" and "extract" content out of a file as plain text, whatever its format (HTML, PDF, Word, etc). In addition, it allows you to perform any manipulation on the extracted text before using it in your own service or application.
☆34Feb 21, 2026Updated last week
Alternatives and similar repositories for importer
Users that are interested in importer are comparing it to the libraries listed below
Sorting:
- FoGFaaS: Add serverless computing (faas) to ifogsim☆22Mar 30, 2025Updated 11 months ago
- Implementation of Norconex Committer for Elasticsearch.☆11Jan 4, 2022Updated 4 years ago
- Norconex Crawlers (or spiders) are flexible web and filesystem crawlers for collecting, parsing, and manipulating data from the web or fi…☆198Updated this week
- Generic library shared between several projects.☆14Feb 23, 2026Updated last week
- Super easy extraction of content from PDF-files☆12Jan 29, 2019Updated 7 years ago
- FeiTwnd的个人网站☆31Updated this week
- Sources of the Xyna Factory Server, Xyna runtime applications (like GuiHttp or gitintegration), and installation scripts.☆18Updated this week
- MaxiCP☆17Updated this week
- X-definition 4.2 (Open Source Software)☆15Updated this week
- 基于二维数据,支持排序,支持序列化/反序列化的guava table实现☆10Feb 15, 2017Updated 9 years ago
- This is a Polarion extension which provides common part to other extensions reducing code duplication.☆16Updated this week
- ☆18Updated this week
- Official Pangea Java Monorepo☆13Updated this week
- A collection of my notes and code covering algorithms from CLRS.☆14Nov 13, 2024Updated last year
- ☆13Aug 23, 2025Updated 6 months ago
- Simple implementation of RecyclerView.Adapter☆12Oct 30, 2017Updated 8 years ago
- Sharing a viewer we built for WNYC.☆12May 10, 2011Updated 14 years ago
- listen and fire events globally through a static callbacks class☆21Mar 9, 2016Updated 9 years ago
- Template for README.md☆66Feb 4, 2013Updated 13 years ago
- Roostrap is a proven rapid application framework compilation built by putting together Spring Roo, Twitter Bootstrap and Google AppEngine…☆35Dec 5, 2014Updated 11 years ago
- ☆12Updated this week
- User-friendly FTC control library☆16Feb 23, 2026Updated last week
- JBake Maven Plugin - NOTE: Code now resides in main JBake repository - https://github.com/jbake-org/jbake☆10Dec 28, 2021Updated 4 years ago
- Make XWiki an identity provider that can be reused by any application☆11Updated this week
- Assessment☆16Updated this week
- Lightweight persistence layer framework, support interface mapping, dynamic sql and sql file manager.☆12Feb 15, 2026Updated 2 weeks ago
- ☆10Feb 26, 2019Updated 7 years ago
- A rule-based aproach to explain the output of any machine learning model☆15Apr 4, 2024Updated last year
- سوالات متداول لینوکسی برای تازه کارها☆26Apr 20, 2017Updated 8 years ago
- 🌀 세종대 수강신청, 올클이 해결해 줄게요!☆20Updated this week
- DSL Platform - Java client☆12Dec 10, 2016Updated 9 years ago
- Web page content extractor☆31Feb 26, 2013Updated 13 years ago
- Windows Live API binding and connect support.☆18Dec 1, 2024Updated last year
- prosEO – A Processing System for Earth Observation Data☆19Updated this week
- DistributeCrawler的Maven版☆10Jun 20, 2022Updated 3 years ago
- Lua-MapReduce framework implemented in Lua using luamongo driver and MongoDB as storage. It follows Iterative MapReduce for training of M…☆25Dec 23, 2015Updated 10 years ago
- Effective IntelliJ IDEA☆40Jul 24, 2017Updated 8 years ago
- Sample application using Spring and SQLFire☆16Apr 4, 2022Updated 3 years ago
- 检测设备的网络连接状态!Network connection☆13Dec 17, 2018Updated 7 years ago