node.js module for extracting text from html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf and more!
☆1,693Dec 15, 2025Updated 6 months ago
Alternatives and similar repositories for textract
Users that are interested in textract are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Standalone Office Open XML files (Microsoft Office 2007 and later) generator for Word (docx), PowerPoint (pptx) and Excell (xlsx) in java…☆2,714Apr 30, 2024Updated 2 years ago
- converts binary PDF to JSON and text, for server-side PDF processing and command-line use. Zero dependency.☆2,204Apr 16, 2026Updated 2 months ago
- Parse office documents (doc, docx, xls, etc..)☆183Apr 14, 2014Updated 12 years ago
- Automatically extract body content (and other cool stuff) from an html document☆2,161May 26, 2023Updated 3 years ago
- general natural language facilities for node☆10,876Feb 22, 2026Updated 4 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The next web scraper. See through the <html> noise.☆5,904May 6, 2026Updated last month
- Convert Word documents (.docx files) to HTML☆6,241May 24, 2026Updated last month
- An image processing library written entirely in JavaScript for Node, with zero external or native dependencies.☆14,626Apr 7, 2026Updated 2 months ago
- Pure Javascript OCR for more than 100 Languages 📖🎉🖥☆38,171May 17, 2026Updated last month
- Distributed, realtime CLI for live Node apps.☆3,419Aug 27, 2021Updated 4 years ago
- modest natural-language processing☆12,126Jun 23, 2026Updated last week
- Premium Queue package for handling distributed jobs and messages in NodeJS.☆16,242Jun 24, 2026Updated last week
- Node's framework for interactive CLIs☆5,628Sep 19, 2023Updated 2 years ago
- Debug Node.js code with Chrome Developer Tools.☆2,319Nov 10, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Package your Node.js project into an executable☆24,369Jan 3, 2024Updated 2 years ago
- Kue is a priority job queue backed by redis, built for node.js.☆9,437Feb 12, 2024Updated 2 years ago
- Asynchronous HTTP microservices☆10,620May 21, 2026Updated last month
- The fastest way to build beautiful Electron apps using simple HTML and CSS☆10,083Apr 3, 2026Updated 3 months ago
- Command Line UI toolkit for Node.js☆1,660Sep 8, 2020Updated 5 years ago
- 📗 SheetJS Spreadsheet Data Toolkit -- New home https://git.sheetjs.com/SheetJS/sheetjs☆36,279Apr 18, 2024Updated 2 years ago
- Node.js Desktop Automation.☆12,749Jun 16, 2026Updated 2 weeks ago
- React UI Components for macOS High Sierra and Windows 10☆9,494Jul 1, 2023Updated 3 years ago
- The world's most versatile desktop notifications framework☆8,663Dec 15, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official Elasticsearch client library for Node.js☆5,295Jun 26, 2026Updated last week
- natural language processor powered by plugins part of the @unifiedjs collective☆2,434Feb 4, 2025Updated last year
- The local-first database that runs on every JS runtime and replicates with your existing backend - no vendor, no lock-in - https://rxdb.i…☆23,243Updated this week
- Generate docx, pptx, and xlsx from templates (Word, Powerpoint and Excel documents), from Node.js or the browser. Demo: https://www.docxt…☆3,595Jun 18, 2026Updated 2 weeks ago
- The JavaScript Database, for Node.js, nw.js, electron and the browser☆13,540May 15, 2025Updated last year
- Web Crawler/Spider for NodeJS + server-side jQuery ;-)☆6,794Jun 18, 2026Updated 2 weeks ago
- JavaScript API for Chrome and Firefox☆95,258Updated this week
- Functional, composable, immutable & curried promise sequences with abstract resolution.☆109Jun 1, 2020Updated 6 years ago
- A high-level browser automation library.☆19,776Apr 20, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- High performance Node.js image processing, the fastest module to resize JPEG, PNG, WebP, AVIF and TIFF images. Uses the libvips library.☆32,410Updated this week
- Drag and drop so simple it hurts☆22,155Jun 7, 2024Updated 2 years ago
- Web scraper for NodeJS☆4,110Dec 13, 2023Updated 2 years ago
- Node.js test runner that lets you develop with confidence 🚀☆20,845Jun 17, 2026Updated 2 weeks ago
- A full-featured framework for building command line applications (cli) with node.js☆3,449Jan 3, 2024Updated 2 years ago
- The fast, flexible, and elegant library for parsing and manipulating HTML and XML.☆30,398Updated this week
- Since I originally wrote this a module called request has come on the scene. You might want to try that before mucking about with extrac…☆26Nov 16, 2015Updated 10 years ago