dbashford / textractLinks
node.js module for extracting text from html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf and more!
☆1,686Updated 2 years ago
Alternatives and similar repositories for textract
Users that are interested in textract are comparing it to the libraries listed below
Sorting:
- Node.js module for high performance creation, modification and parsing of PDF files and streams☆1,168Updated last month
- Advanced html to text converter☆1,668Updated last year
- converts binary PDF to JSON and text, for server-side PDF processing and command-line use. Zero dependency.☆2,136Updated 2 weeks ago
- Decode mime formatted e-mails☆1,638Updated last month
- A persistent, network resilient, full text search library for the browser and Node.js☆1,418Updated 5 months ago
- 🚜 Parse text and tables from PDF files.☆691Updated 7 months ago
- A javascript library for defining recurring schedules and calculating future (or past) occurrences for them. Includes support for using …☆2,418Updated 7 years ago
- Node module that summarizes text using a naive summarization algorithm☆770Updated 10 months ago
- Node PDF Extract☆389Updated 2 years ago
- Scrape/Crawl article from any site automatically. Make any web page readable, no matter Chinese or English.☆344Updated 7 years ago
- Standalone Office Open XML files (Microsoft Office 2007 and later) generator for Word (docx), PowerPoint (pptx) and Excell (xlsx) in java…☆2,702Updated last year
- Download and extract files☆1,300Updated last year
- Automatically extract body content (and other cool stuff) from an html document☆2,158Updated 2 years ago
- Node module to allow for easy Excel file creation☆1,373Updated 3 years ago
- A powerful PDF tool for NodeJS based on HummusJS.☆349Updated 2 years ago
- PDF manipulation in Node.js! Split, join, crop, read, extract, boil, mash, stick them in a stew.☆287Updated 6 months ago
- An IMAP client module for node.js.☆2,206Updated last year
- Easy website screenshots in Node.js☆2,117Updated 6 years ago
- ImageMagick's Magick++ bindings for NodeJS☆631Updated 4 years ago
- Run PhantomJS from Node☆1,453Updated 5 years ago
- Nimble, streamable HTTP client for Node.js. With proxy, iconv, cookie, deflate & multipart support.☆1,640Updated last year
- A simple wrapper for the Tesseract OCR package☆676Updated 5 years ago
- Distribute processing tasks to child processes with an über-simple API and baked-in durability & custom concurrency options.☆1,742Updated 3 years ago
- CSV parser and formatter for node☆1,748Updated last week
- A search server that can be installed with npm☆656Updated 3 weeks ago
- Natural language detection☆4,308Updated last year
- Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.☆717Updated last year
- A node module for Google's Universal Analytics and Measurement Protocol☆965Updated 2 years ago
- Word Processing Document Library☆1,314Updated 3 years ago
- Native NodeJS implementation of MaxMind's GeoIP API -- works in node 0.6.3 and above, ask me about other versions☆2,380Updated last year