node.js module for extracting text from html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf and more!
☆1,695Dec 15, 2025Updated 4 months ago
Alternatives and similar repositories for textract
Users that are interested in textract are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Standalone Office Open XML files (Microsoft Office 2007 and later) generator for Word (docx), PowerPoint (pptx) and Excell (xlsx) in java…☆2,713Apr 30, 2024Updated 2 years ago
- converts binary PDF to JSON and text, for server-side PDF processing and command-line use. Zero dependency.☆2,202Apr 16, 2026Updated 2 weeks ago
- Parse office documents (doc, docx, xls, etc..)☆183Apr 14, 2014Updated 12 years ago
- Automatically extract body content (and other cool stuff) from an html document☆2,163May 26, 2023Updated 2 years ago
- general natural language facilities for node☆10,874Feb 22, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The next web scraper. See through the <html> noise.☆5,905Feb 16, 2026Updated 2 months ago
- An image processing library written entirely in JavaScript for Node, with zero external or native dependencies.☆14,603Apr 7, 2026Updated 3 weeks ago
- Convert Word documents (.docx files) to HTML☆6,188Mar 13, 2026Updated last month
- Pure Javascript OCR for more than 100 Languages 📖🎉🖥☆38,039Feb 28, 2026Updated 2 months ago
- Distributed, realtime CLI for live Node apps.☆3,423Aug 27, 2021Updated 4 years ago
- modest natural-language processing☆12,080Feb 25, 2026Updated 2 months ago
- Premium Queue package for handling distributed jobs and messages in NodeJS.☆16,241Apr 23, 2026Updated last week
- Node's framework for interactive CLIs☆5,634Sep 19, 2023Updated 2 years ago
- Debug Node.js code with Chrome Developer Tools.☆2,321Nov 10, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Package your Node.js project into an executable☆24,386Jan 3, 2024Updated 2 years ago
- Kue is a priority job queue backed by redis, built for node.js.☆9,450Feb 12, 2024Updated 2 years ago
- Asynchronous HTTP microservices☆10,613Jun 19, 2024Updated last year
- The fastest way to build beautiful Electron apps using simple HTML and CSS☆10,082Apr 3, 2026Updated 3 weeks ago
- Command Line UI toolkit for Node.js☆1,662Sep 8, 2020Updated 5 years ago
- 📗 SheetJS Spreadsheet Data Toolkit -- New home https://git.sheetjs.com/SheetJS/sheetjs☆36,242Apr 18, 2024Updated 2 years ago
- Node.js Desktop Automation.☆12,728Apr 15, 2026Updated 2 weeks ago
- React UI Components for macOS High Sierra and Windows 10☆9,497Jul 1, 2023Updated 2 years ago
- The world's most versatile desktop notifications framework☆8,673Dec 15, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official Elasticsearch client library for Node.js☆5,299Updated this week
- A fast, local first, reactive Database for JavaScript Applications https://rxdb.info/☆23,162Updated this week
- Generate docx, pptx, and xlsx from templates (Word, Powerpoint and Excel documents), from Node.js or the browser. Demo: https://www.docxt…☆3,560Updated this week
- The JavaScript Database, for Node.js, nw.js, electron and the browser☆13,556May 15, 2025Updated 11 months ago
- Web Crawler/Spider for NodeJS + server-side jQuery ;-)☆6,790May 28, 2025Updated 11 months ago
- JavaScript API for Chrome and Firefox☆94,231Updated this week
- High performance Node.js image processing, the fastest module to resize JPEG, PNG, WebP, AVIF and TIFF images. Uses the libvips library.☆32,176Updated this week
- Functional, composable, immutable & curried promise sequences with abstract resolution.☆110Jun 1, 2020Updated 5 years ago
- A high-level browser automation library.☆19,805Apr 20, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Drag and drop so simple it hurts☆22,184Jun 7, 2024Updated last year
- Web scraper for NodeJS☆4,114Dec 13, 2023Updated 2 years ago
- Node.js test runner that lets you develop with confidence 🚀☆20,849Updated this week
- A full-featured framework for building command line applications (cli) with node.js☆3,453Jan 3, 2024Updated 2 years ago
- The fast, flexible, and elegant library for parsing and manipulating HTML and XML.☆30,297Updated this week
- Since I originally wrote this a module called request has come on the scene. You might want to try that before mucking about with extrac…☆26Nov 16, 2015Updated 10 years ago
- 🌐 Human-friendly and powerful HTTP request library for Node.js☆14,899Apr 21, 2026Updated last week