Norconex Importer is a Java library and command-line application meant to "parse" and "extract" content out of a file as plain text, whatever its format (HTML, PDF, Word, etc). In addition, it allows you to perform any manipulation on the extracted text before using it in your own service or application.
☆35Apr 27, 2026Updated last week
Alternatives and similar repositories for importer
Users that are interested in importer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- FoGFaaS: Add serverless computing (faas) to ifogsim☆22Mar 30, 2025Updated last year
- Advanced fold methods for Kotlin☆12Updated this week
- ☆13Aug 23, 2025Updated 8 months ago
- DistributeCrawler的Maven版☆10Jun 20, 2022Updated 3 years ago
- Autoproxy automatically detects proxies and stores them in the respective environment variables (e.g. http_proxy).☆13Oct 2, 2016Updated 9 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Demo package for dyson☆13Dec 13, 2022Updated 3 years ago
- Sketch adaptors for Pig.☆10Mar 28, 2026Updated last month
- Kotlin library for BigDecimal math functions (pow, sqrt, log, sin, ...) using arbitrary precision.☆20Jun 22, 2020Updated 5 years ago
- Library and applications for interfacing with eidc32 and intelli-m☆13Oct 18, 2021Updated 4 years ago
- FFmpegKit implementation using Kotlin Multiplatform☆14Apr 4, 2023Updated 3 years ago
- Code samples for the Speedment ORM☆13Jun 21, 2022Updated 3 years ago
- Extremely minimal Compose Multiplatform sample that demonstrates use of on-device AI on iOS and Android.☆47Mar 1, 2026Updated 2 months ago
- A free multithreaded proxy checking program written in Java. Load a proxy list and check each proxy to verify it's alive to create a new …☆11Nov 5, 2015Updated 10 years ago
- Lua-MapReduce framework implemented in Lua using luamongo driver and MongoDB as storage. It follows Iterative MapReduce for training of M…☆25Dec 23, 2015Updated 10 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This is a sample project that include NFC value reading using insanely easy way.☆26Nov 1, 2016Updated 9 years ago
- Web page content extractor☆32Feb 26, 2013Updated 13 years ago
- A library of examples showing how to use the Common Crawl corpus (2008-2012, ARC format)☆65Aug 5, 2016Updated 9 years ago
- 利用tushare pandas下载股票历史数据并存入mysql数据库☆13Dec 18, 2018Updated 7 years ago
- Effective IntelliJ IDEA☆40Jul 24, 2017Updated 8 years ago
- A llama.cpp binding for Kotlin multiplatform API for common use on (Android, iOS).☆32Jun 25, 2025Updated 10 months ago
- Cassandra NoSQL + Bokeh + Prophet for stock time series analysis☆13Jul 13, 2018Updated 7 years ago
- Java client for EventStore (http://geteventstore.com)☆20May 25, 2015Updated 10 years ago
- 舆情项目处理层 分词 情感分析☆10Mar 22, 2016Updated 10 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 基于spring boot的 监控平台☆11Jun 17, 2015Updated 10 years ago
- Automatic CAPTCHA decoding☆11Apr 17, 2012Updated 14 years ago
- Form designer for Activiti☆13Mar 4, 2026Updated 2 months ago
- A Nutch 2.2.1 plugin which allows users to shuffle off the responsibility for retrieving pages to a selenium hub/node spoke system. This …☆16Jun 9, 2016Updated 9 years ago
- Windows Live API binding and connect support.☆18Dec 1, 2024Updated last year
- 基于搜索引擎实现网盘搜索☆12Nov 15, 2018Updated 7 years ago
- 🪼 JellyFab – a delightful, spring-powered Floating Action Menu for Jetpack Compose. Elastic blob, soft shadows, smooth arcs, all in pure…☆42Nov 26, 2025Updated 5 months ago
- Implementing java based text extractors as web APIs (currently only Boilerpipe & Goose)☆16Apr 1, 2012Updated 14 years ago
- sync tushare data automatically☆15Mar 2, 2017Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- alias names for java types☆15Updated this week
- Roostrap is a proven rapid application framework compilation built by putting together Spring Roo, Twitter Bootstrap and Google AppEngine…☆35Dec 5, 2014Updated 11 years ago
- JBake Maven Plugin - NOTE: Code now resides in main JBake repository - https://github.com/jbake-org/jbake☆10Dec 28, 2021Updated 4 years ago
- java分布式爬虫,主机和从机控制的机制☆14May 21, 2015Updated 10 years ago
- A lightweight KSP annotation processor that generates reports to track technical debt in Kotlin projects☆51Feb 8, 2026Updated 2 months ago
- A standalone Java XML parser and serializer☆23Jul 11, 2025Updated 9 months ago
- a readability client for android☆25Jan 23, 2012Updated 14 years ago