Apache Tika bridge for Node.js. Text and metadata extraction, language detection and more.
☆142Dec 17, 2023Updated 2 years ago
Alternatives and similar repositories for node-tika
Users that are interested in node-tika are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Extract text from a document by Apache Tika☆16Mar 20, 2026Updated 3 weeks ago
- Tika-Similarity uses the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features.☆108Apr 9, 2025Updated last year
- DEPRECATED: RateIt jQuery (star) rating plugin. Meteor can use it directly from npm now.☆35Feb 26, 2015Updated 11 years ago
- Stanford CoreNLP NER addon for Apache Tika's NamerEntityParser☆13Feb 26, 2022Updated 4 years ago
- AngularJS, Play and Scala web interface to Wedding Tables Planner☆21Aug 17, 2015Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A little artoo.js bookmarklet to scrape and download the wanted or missing person lists from Interpol.☆12Dec 3, 2014Updated 11 years ago
- Export a JSON archive of a Gitter room's messages☆15Sep 26, 2018Updated 7 years ago
- Multi-adapter bucket-based file system abstraction. #golang☆14Apr 8, 2026Updated last week
- A *very* simple ODM for MongoDB and NeDB on Node.js (using JS Harmony).☆23Feb 18, 2015Updated 11 years ago
- Hadoop integration code for working with with Apache cTAKES☆10Feb 11, 2014Updated 12 years ago
- code to remove "noise" from hOCR output of Tesseract OCR.☆14Oct 24, 2016Updated 9 years ago
- Archive and make discoverable data and links with schema.org metadata.☆38Nov 4, 2014Updated 11 years ago
- ☆11Aug 8, 2016Updated 9 years ago
- Generates a list of mock todos for TodoMVC apps ;)☆11May 4, 2016Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Notes for my talk "Exploring the Radio Spectrum for News"☆13Mar 6, 2020Updated 6 years ago
- pass pages through a pluggable pipeline to extract information from them.☆14Apr 21, 2015Updated 10 years ago
- Using latent Dirichlet allocation (LDA) in Apache Lucene☆57Nov 19, 2012Updated 13 years ago
- Uduvudu☆17Mar 2, 2020Updated 6 years ago
- CLI for parsing FEC files, for federal campaign finance pipelines☆24Mar 9, 2026Updated last month
- "Open-source Dropbox" with added description features. It is a data storage and description platform designed to help researchers and oth…☆28Oct 12, 2022Updated 3 years ago
- tool for analyzing and converting PDF☆105Jul 27, 2016Updated 9 years ago
- A logging wrapper for winston.☆14Jul 14, 2017Updated 8 years ago
- A example site using HAPI, JWT tokens and swagger documentation☆28Oct 3, 2016Updated 9 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- pass-stream - pass-through node.js stream which can filter/adapt and pause data☆17Mar 3, 2017Updated 9 years ago
- Plugin for visionmedia/superagent that adds headers to all requests that prevents caching.☆26Feb 2, 2018Updated 8 years ago
- Reduce an array and return a Promise☆14Feb 22, 2018Updated 8 years ago
- Create a matching function from a glob pattern, regex, string, array or function.☆20Jun 18, 2017Updated 8 years ago
- Data classification algorithms☆18Apr 6, 2015Updated 11 years ago
- produce a stream of citiation data coming off wikimedia☆12Mar 28, 2017Updated 9 years ago
- a keep-alive agent for node http & https with a really snappy name☆32May 2, 2017Updated 8 years ago
- Animates elements inside a {{> AnimateWithVelocity}} block, by adding specific attributes to elements.☆22Mar 28, 2016Updated 10 years ago
- ☆10Nov 13, 2016Updated 9 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- React Native app☆14Feb 17, 2016Updated 10 years ago
- The DDI Discovery Vocabulary, an RDF vocabulary for data description and discovery based on DDI☆25May 5, 2023Updated 2 years ago
- Tables is a simple command-line tool and powerful library for importing data like a CSV or JSON file into relational tables.☆14Mar 23, 2026Updated 3 weeks ago
- Given a new image, determine if it is likely derived from a known image.☆21Updated this week
- Facilitates the indexing of content from a CSV into ElasticSearch☆27Oct 3, 2013Updated 12 years ago
- Node.js module for interfacing with the TreeTagger toolkit by Helmut Schmid.☆15Apr 12, 2015Updated 11 years ago
- compile jade templates to virtualdom☆22Sep 25, 2015Updated 10 years ago