node.js module for extracting text from html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf and more!
☆1,692Dec 15, 2025Updated 2 months ago
Alternatives and similar repositories for textract
Users that are interested in textract are comparing it to the libraries listed below
Sorting:
- Automatically extract body content (and other cool stuff) from an html document☆2,164May 26, 2023Updated 2 years ago
- converts binary PDF to JSON and text, for server-side PDF processing and command-line use. Zero dependency.☆2,192Updated this week
- The next web scraper. See through the <html> noise.☆5,906Feb 16, 2026Updated 2 weeks ago
- general natural language facilities for node☆10,871Feb 22, 2026Updated last week
- An image processing library written entirely in JavaScript for Node, with zero external or native dependencies.☆14,590Nov 27, 2025Updated 3 months ago
- Pure Javascript OCR for more than 100 Languages 📖🎉🖥☆37,874Updated this week
- Standalone Office Open XML files (Microsoft Office 2007 and later) generator for Word (docx), PowerPoint (pptx) and Excell (xlsx) in java…☆2,717Apr 30, 2024Updated last year
- Parse office documents (doc, docx, xls, etc..)☆182Apr 14, 2014Updated 11 years ago
- Asynchronous HTTP microservices☆10,615Jun 19, 2024Updated last year
- Distributed, realtime CLI for live Node apps.☆3,429Aug 27, 2021Updated 4 years ago
- Debug Node.js code with Chrome Developer Tools.☆2,323Nov 10, 2022Updated 3 years ago
- modest natural-language processing☆12,040Feb 23, 2026Updated last week
- Node's framework for interactive CLIs☆5,639Sep 19, 2023Updated 2 years ago
- Convert Word documents (.docx files) to HTML☆6,109Nov 20, 2025Updated 3 months ago
- Package your Node.js project into an executable☆24,419Jan 3, 2024Updated 2 years ago
- Premium Queue package for handling distributed jobs and messages in NodeJS.☆16,235Updated this week
- Node.js Desktop Automation.☆12,708Jun 21, 2024Updated last year
- The world's most versatile desktop notifications framework☆8,685Dec 15, 2023Updated 2 years ago
- A fast, local first, reactive Database for JavaScript Applications https://rxdb.info/☆23,058Updated this week
- Command Line UI toolkit for Node.js☆1,667Sep 8, 2020Updated 5 years ago
- Kue is a priority job queue backed by redis, built for node.js.☆9,468Feb 12, 2024Updated 2 years ago
- The fastest way to build beautiful Electron apps using simple HTML and CSS☆10,077Dec 21, 2023Updated 2 years ago
- The JavaScript Database, for Node.js, nw.js, electron and the browser☆13,567May 15, 2025Updated 9 months ago
- A high-level browser automation library.☆19,971Apr 20, 2024Updated last year
- Official Elasticsearch client library for Node.js☆5,304Updated this week
- Web scraper for NodeJS☆4,115Dec 13, 2023Updated 2 years ago
- Web Crawler/Spider for NodeJS + server-side jQuery ;-)☆6,788May 28, 2025Updated 9 months ago
- Drag and drop so simple it hurts☆22,192Jun 7, 2024Updated last year
- natural language processor powered by plugins part of the @unifiedjs collective☆2,432Feb 4, 2025Updated last year
- React UI Components for macOS High Sierra and Windows 10☆9,509Jul 1, 2023Updated 2 years ago
- JavaScript API for Chrome and Firefox☆93,685Updated this week
- High performance Node.js image processing, the fastest module to resize JPEG, PNG, WebP, AVIF and TIFF images. Uses the libvips library.☆31,949Updated this week
- 🌐 Human-friendly and powerful HTTP request library for Node.js☆14,871Updated this week
- A client server module which acts like SSH but communicates via socket.io.☆31Oct 8, 2016Updated 9 years ago
- A full-featured framework for building command line applications (cli) with node.js☆3,462Jan 3, 2024Updated 2 years ago
- Node.js test runner that lets you develop with confidence 🚀☆20,853Updated this week
- Format input text content when you are typing...☆17,918Nov 25, 2023Updated 2 years ago
- Universal scraping tool, which allows you to extract data using multiple environments☆229Apr 17, 2019Updated 6 years ago
- API Services Made Easy With Node.js☆4,487Jan 16, 2023Updated 3 years ago