node.js module for extracting text from html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf and more!
☆1,693Dec 15, 2025Updated 3 months ago
Alternatives and similar repositories for textract
Users that are interested in textract are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Standalone Office Open XML files (Microsoft Office 2007 and later) generator for Word (docx), PowerPoint (pptx) and Excell (xlsx) in java…☆2,714Apr 30, 2024Updated last year
- converts binary PDF to JSON and text, for server-side PDF processing and command-line use. Zero dependency.☆2,197Mar 15, 2026Updated last week
- Parse office documents (doc, docx, xls, etc..)☆183Apr 14, 2014Updated 11 years ago
- Automatically extract body content (and other cool stuff) from an html document☆2,164May 26, 2023Updated 2 years ago
- general natural language facilities for node☆10,873Feb 22, 2026Updated last month
- The next web scraper. See through the <html> noise.☆5,906Feb 16, 2026Updated last month
- An image processing library written entirely in JavaScript for Node, with zero external or native dependencies.☆14,597Nov 27, 2025Updated 3 months ago
- Convert Word documents (.docx files) to HTML☆6,145Mar 13, 2026Updated last week
- Pure Javascript OCR for more than 100 Languages 📖🎉🖥☆37,933Feb 28, 2026Updated 3 weeks ago
- Distributed, realtime CLI for live Node apps.☆3,426Aug 27, 2021Updated 4 years ago
- modest natural-language processing☆12,052Feb 25, 2026Updated 3 weeks ago
- Premium Queue package for handling distributed jobs and messages in NodeJS.☆16,251Updated this week
- Node's framework for interactive CLIs☆5,636Sep 19, 2023Updated 2 years ago
- Debug Node.js code with Chrome Developer Tools.☆2,323Nov 10, 2022Updated 3 years ago
- Package your Node.js project into an executable☆24,414Jan 3, 2024Updated 2 years ago
- Kue is a priority job queue backed by redis, built for node.js.☆9,469Feb 12, 2024Updated 2 years ago
- Asynchronous HTTP microservices☆10,614Jun 19, 2024Updated last year
- The fastest way to build beautiful Electron apps using simple HTML and CSS☆10,078Dec 21, 2023Updated 2 years ago
- Command Line UI toolkit for Node.js☆1,666Sep 8, 2020Updated 5 years ago
- 📗 SheetJS Spreadsheet Data Toolkit -- New home https://git.sheetjs.com/SheetJS/sheetjs☆36,216Apr 18, 2024Updated last year
- Node.js Desktop Automation.☆12,720Updated this week
- React UI Components for macOS High Sierra and Windows 10☆9,501Jul 1, 2023Updated 2 years ago
- The world's most versatile desktop notifications framework☆8,681Dec 15, 2023Updated 2 years ago
- Official Elasticsearch client library for Node.js☆5,301Mar 16, 2026Updated last week
- A fast, local first, reactive Database for JavaScript Applications https://rxdb.info/☆23,083Updated this week
- Generate docx, pptx, and xlsx from templates (Word, Powerpoint and Excel documents), from Node.js or the browser. Demo: https://www.docxt…☆3,535Mar 1, 2026Updated 3 weeks ago
- The JavaScript Database, for Node.js, nw.js, electron and the browser☆13,564May 15, 2025Updated 10 months ago
- Web Crawler/Spider for NodeJS + server-side jQuery ;-)☆6,790May 28, 2025Updated 9 months ago
- JavaScript API for Chrome and Firefox☆93,892Updated this week
- High performance Node.js image processing, the fastest module to resize JPEG, PNG, WebP, AVIF and TIFF images. Uses the libvips library.☆32,022Mar 12, 2026Updated last week
- A high-level browser automation library.☆19,963Apr 20, 2024Updated last year
- Functional, composable, immutable & curried promise sequences with abstract resolution.☆110Jun 1, 2020Updated 5 years ago
- Drag and drop so simple it hurts☆22,190Jun 7, 2024Updated last year
- Web scraper for NodeJS☆4,117Dec 13, 2023Updated 2 years ago
- Node.js test runner that lets you develop with confidence 🚀☆20,855Mar 3, 2026Updated 2 weeks ago
- A full-featured framework for building command line applications (cli) with node.js☆3,461Jan 3, 2024Updated 2 years ago
- The fast, flexible, and elegant library for parsing and manipulating HTML and XML.☆30,195Updated this week
- Since I originally wrote this a module called request has come on the scene. You might want to try that before mucking about with extrac…☆26Nov 16, 2015Updated 10 years ago
- 🌐 Human-friendly and powerful HTTP request library for Node.js☆14,874Feb 28, 2026Updated 3 weeks ago