Apache Tika bridge for Node.js. Text and metadata extraction, language detection and more.
☆142Dec 17, 2023Updated 2 years ago
Alternatives and similar repositories for node-tika
Users that are interested in node-tika are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Extract text from a document by Apache Tika☆16Mar 16, 2026Updated last week
- A demo application of how to use handlebars (or another custom template engine) with Keystone.js☆13Feb 11, 2014Updated 12 years ago
- Stanford CoreNLP NER addon for Apache Tika's NamerEntityParser☆13Feb 26, 2022Updated 4 years ago
- This is the ETL lib package. It provides an API to munge and prepare JSON, TSV and other data using Apache Tika and JSON parsing/loading …☆18Jan 27, 2024Updated 2 years ago
- tool for authoring plugins for compromise☆13Apr 15, 2020Updated 5 years ago
- Export a JSON archive of a Gitter room's messages☆15Sep 26, 2018Updated 7 years ago
- US election metadata, packaged as python!☆10Mar 16, 2022Updated 4 years ago
- Multi-adapter bucket-based file system abstraction. #golang☆14Dec 3, 2025Updated 3 months ago
- A *very* simple ODM for MongoDB and NeDB on Node.js (using JS Harmony).☆23Feb 18, 2015Updated 11 years ago
- Hadoop integration code for working with with Apache cTAKES☆10Feb 11, 2014Updated 12 years ago
- code to remove "noise" from hOCR output of Tesseract OCR.☆14Oct 24, 2016Updated 9 years ago
- File and stream concatenation the right way.☆25Oct 18, 2019Updated 6 years ago
- Simple redux history middleware.☆13Mar 7, 2016Updated 10 years ago
- Lightweight Go pattern for writing CLIs with subcommands☆12Feb 14, 2023Updated 3 years ago
- Generates a list of mock todos for TodoMVC apps ;)☆11May 4, 2016Updated 9 years ago
- General information and docs about Crosscloud☆18Oct 30, 2014Updated 11 years ago
- A Ruby parser for electronic candidate, PAC and party campaign filings from the Federal Election Commission.☆15Feb 3, 2024Updated 2 years ago
- Notes for my talk "Exploring the Radio Spectrum for News"☆13Mar 6, 2020Updated 6 years ago
- This is the facade for installation and access to the individual components☆16Feb 10, 2026Updated last month
- pass pages through a pluggable pipeline to extract information from them.☆14Apr 21, 2015Updated 10 years ago
- Uduvudu☆17Mar 2, 2020Updated 6 years ago
- Google speech api wrapper for node☆85Feb 20, 2016Updated 10 years ago
- Code for extracting data from a large number of PDFs, particularly FCC political ad documents☆15Oct 26, 2017Updated 8 years ago
- "Open-source Dropbox" with added description features. It is a data storage and description platform designed to help researchers and oth…☆28Oct 12, 2022Updated 3 years ago
- A logging wrapper for winston.☆14Jul 14, 2017Updated 8 years ago
- Automatic compiler Jade templates to AngularJS modules for Brunch.IO☆31Aug 20, 2015Updated 10 years ago
- Virginia precinct-level maps as of June 2016☆12Jul 1, 2016Updated 9 years ago
- A tool for telling stories with maps.☆29Feb 26, 2026Updated 3 weeks ago
- Plugin for visionmedia/superagent that adds headers to all requests that prevents caching.☆26Feb 2, 2018Updated 8 years ago
- Reduce an array and return a Promise☆14Feb 22, 2018Updated 8 years ago
- Create a matching function from a glob pattern, regex, string, array or function.☆20Jun 18, 2017Updated 8 years ago
- Squebi provides an extendable SPARQL interface.☆22May 27, 2015Updated 10 years ago
- Data classification algorithms☆18Apr 6, 2015Updated 10 years ago
- React Native app☆14Feb 17, 2016Updated 10 years ago
- Tables is a simple command-line tool and powerful library for importing data like a CSV or JSON file into relational tables.☆14Dec 8, 2022Updated 3 years ago
- Generates diff markup for two strings.☆20Mar 29, 2013Updated 12 years ago
- Solr Query Segmenter for structuring unstructured queries☆22May 12, 2021Updated 4 years ago
- compile jade templates to virtualdom☆22Sep 25, 2015Updated 10 years ago
- virtual host sub-domain mapping☆26May 14, 2025Updated 10 months ago