Norconex Importer is a Java library and command-line application meant to "parse" and "extract" content out of a file as plain text, whatever its format (HTML, PDF, Word, etc). In addition, it allows you to perform any manipulation on the extracted text before using it in your own service or application.
☆35Apr 27, 2026Updated 3 weeks ago
Alternatives and similar repositories for importer
Users that are interested in importer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Norconex Filesystem Collector is a flexible crawler for collecting, parsing, and manipulating data ranging from local hard drives to netw…☆24Sep 25, 2024Updated last year
- Norconex Crawlers (or spiders) are flexible web and filesystem crawlers for collecting, parsing, and manipulating data from the web or fi…☆201Updated this week
- Implementation of Norconex Committer for Elasticsearch.☆11Apr 27, 2026Updated 3 weeks ago
- FoGFaaS: Add serverless computing (faas) to ifogsim☆22Mar 30, 2025Updated last year
- Generic library shared between several projects.☆14Apr 25, 2026Updated last month
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆36Dec 21, 2019Updated 6 years ago
- jQuery plugin to export the entire data from slick grid to excel. A client side javascript, jquery plugin to export slick grid to excel.☆10Sep 15, 2020Updated 5 years ago
- Overload method for $.ajax that provides the ability to try the request over if it fails the first time.☆35Jul 29, 2020Updated 5 years ago
- DistributeCrawler的Maven版☆10Jun 20, 2022Updated 3 years ago
- A jQuery plugin to make working with querystrings a breeze☆41Nov 2, 2014Updated 11 years ago
- Autoproxy automatically detects proxies and stores them in the respective environment variables (e.g. http_proxy).☆13Oct 2, 2016Updated 9 years ago
- kettle web manager support kettle 8.0☆17Mar 14, 2018Updated 8 years ago
- FFmpegKit implementation using Kotlin Multiplatform☆14Apr 4, 2023Updated 3 years ago
- ☆10Feb 26, 2019Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 基于E-Chart以及PyQt5制作简易可视化小工具,以便灵活观察各种时间框架下期货行情走势☆16Sep 22, 2025Updated 8 months ago
- Code samples for the Speedment ORM☆13Jun 21, 2022Updated 3 years ago
- A free multithreaded proxy checking program written in Java. Load a proxy list and check each proxy to verify it's alive to create a new …☆11Nov 5, 2015Updated 10 years ago
- DSL Platform - Java client☆12Dec 10, 2016Updated 9 years ago
- search topics of sina weibo by phantomjs☆12Dec 20, 2015Updated 10 years ago
- Lua-MapReduce framework implemented in Lua using luamongo driver and MongoDB as storage. It follows Iterative MapReduce for training of M…☆25Dec 23, 2015Updated 10 years ago
- 利用tushare pandas下载股票历史数据并存入mysql数据库☆13Dec 18, 2018Updated 7 years ago
- Effective IntelliJ IDEA☆40Jul 24, 2017Updated 8 years ago
- A llama.cpp binding for Kotlin multiplatform API for common use on (Android, iOS).☆32Jun 25, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆23Feb 6, 2026Updated 3 months ago
- Cassandra NoSQL + Bokeh + Prophet for stock time series analysis☆13Jul 13, 2018Updated 7 years ago
- Java client for EventStore (http://geteventstore.com)☆20May 25, 2015Updated 11 years ago
- 舆情项目处理层 分词 情感分析☆10Mar 22, 2016Updated 10 years ago
- 关于获取金融数据的notebook,练习了JAQS、Tushare、通联的API调用☆14Dec 3, 2018Updated 7 years ago
- 基于spring boot的 监控平台☆11Jun 17, 2015Updated 10 years ago
- Automatic CAPTCHA decoding☆11Apr 17, 2012Updated 14 years ago
- Easily use bulleted lists in your MJML emails.☆16Jul 29, 2023Updated 2 years ago
- Form designer for Activiti☆13Mar 4, 2026Updated 2 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A Nutch 2.2.1 plugin which allows users to shuffle off the responsibility for retrieving pages to a selenium hub/node spoke system. This …☆16Jun 9, 2016Updated 9 years ago
- Windows Live API binding and connect support.☆18Dec 1, 2024Updated last year
- 基于搜索引擎实现网盘搜索☆12Nov 15, 2018Updated 7 years ago
- 🪼 JellyFab – a delightful, spring-powered Floating Action Menu for Jetpack Compose. Elastic blob, soft shadows, smooth arcs, all in pure…☆43Nov 26, 2025Updated 5 months ago
- Tools to custom your domain resolved rules. Used BlackHole as DNS server.☆18Jun 22, 2013Updated 12 years ago
- An online MJML Editor with Liquid Support.☆27May 7, 2026Updated 2 weeks ago
- Implementing java based text extractors as web APIs (currently only Boilerpipe & Goose)☆16Apr 1, 2012Updated 14 years ago