Norconex Importer is a Java library and command-line application meant to "parse" and "extract" content out of a file as plain text, whatever its format (HTML, PDF, Word, etc). In addition, it allows you to perform any manipulation on the extracted text before using it in your own service or application.
☆34Feb 21, 2026Updated last month
Alternatives and similar repositories for importer
Users that are interested in importer are comparing it to the libraries listed below
Sorting:
- Norconex Filesystem Collector is a flexible crawler for collecting, parsing, and manipulating data ranging from local hard drives to netw…☆24Sep 25, 2024Updated last year
- Implementation of Norconex Committer for Elasticsearch.☆11Jan 4, 2022Updated 4 years ago
- FoGFaaS: Add serverless computing (faas) to ifogsim☆22Mar 30, 2025Updated 11 months ago
- Generic library shared between several projects.☆14Feb 23, 2026Updated 3 weeks ago
- A Usenet binary grabber.☆12Nov 6, 2015Updated 10 years ago
- jQuery plugin to export the entire data from slick grid to excel. A client side javascript, jquery plugin to export slick grid to excel.☆10Sep 15, 2020Updated 5 years ago
- A jQuery plugin for keeping the aspect ratio☆38Jun 6, 2015Updated 10 years ago
- DistributeCrawler的Maven版☆10Jun 20, 2022Updated 3 years ago
- Smart retry for jQuery's ajax.☆29Jun 21, 2016Updated 9 years ago
- A jQuery plugin to make working with querystrings a breeze☆41Nov 2, 2014Updated 11 years ago
- Rust natural language processing model with a focus on mapping back to source and "layerable" recognizers☆19Jan 3, 2022Updated 4 years ago
- Autoproxy automatically detects proxies and stores them in the respective environment variables (e.g. http_proxy).☆13Oct 2, 2016Updated 9 years ago
- Demo package for dyson☆13Dec 13, 2022Updated 3 years ago
- Kotlin library for BigDecimal math functions (pow, sqrt, log, sin, ...) using arbitrary precision.☆21Jun 22, 2020Updated 5 years ago
- Library and applications for interfacing with eidc32 and intelli-m☆13Oct 18, 2021Updated 4 years ago
- kettle web manager support kettle 8.0☆17Mar 14, 2018Updated 8 years ago
- FFmpegKit implementation using Kotlin Multiplatform☆14Apr 4, 2023Updated 2 years ago
- 基于E-Chart以及PyQt5制作简易可视化小工具,以便灵活观察各种时间框架下期货行情走势☆15Sep 22, 2025Updated 6 months ago
- Extremely minimal Compose Multiplatform sample that demonstrates use of on-device AI on iOS and Android.☆46Mar 1, 2026Updated 3 weeks ago
- A free multithreaded proxy checking program written in Java. Load a proxy list and check each proxy to verify it's alive to create a new …☆11Nov 5, 2015Updated 10 years ago
- DSL Platform - Java client☆12Dec 10, 2016Updated 9 years ago
- search topics of sina weibo by phantomjs☆12Dec 20, 2015Updated 10 years ago
- 🪼 JellyFab – a delightful, spring-powered Floating Action Menu for Jetpack Compose. Elastic blob, soft shadows, smooth arcs, all in pure…☆40Nov 26, 2025Updated 3 months ago
- base业务框架的演示环境☆18Dec 27, 2018Updated 7 years ago
- Sketch plugin that hyphenates text☆12Apr 5, 2021Updated 4 years ago
- Web page content extractor☆31Feb 26, 2013Updated 13 years ago
- 利用tushare pandas下载股票历史数据并存入mysql数据库☆13Dec 18, 2018Updated 7 years ago
- A llama.cpp binding for Kotlin multiplatform API for common use on (Android, iOS).☆30Jun 25, 2025Updated 8 months ago
- A new framework to generate interpretable classification rules☆18Feb 11, 2023Updated 3 years ago
- Java client for EventStore (http://geteventstore.com)☆20May 25, 2015Updated 10 years ago
- Device resolution detection module☆39Sep 12, 2016Updated 9 years ago
- 舆情项目处理层 分词 情感分析☆10Mar 22, 2016Updated 10 years ago
- Kairos, combines a focused crawler and an information extraction engine, to convert a list of conference websites into a index filled wit…☆18Feb 20, 2011Updated 15 years ago
- Automatic CAPTCHA decoding☆11Apr 17, 2012Updated 13 years ago
- Easily use bulleted lists in your MJML emails.☆16Jul 29, 2023Updated 2 years ago
- Form designer for Activiti☆13Mar 4, 2026Updated 2 weeks ago
- Notebooks based on financial machine learning.☆16Nov 21, 2022Updated 3 years ago
- A Nutch 2.2.1 plugin which allows users to shuffle off the responsibility for retrieving pages to a selenium hub/node spoke system. This …☆16Jun 9, 2016Updated 9 years ago
- 基于搜索引擎实现网盘搜索☆12Nov 15, 2018Updated 7 years ago