dbashford / textractLinks
node.js module for extracting text from html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf and more!
☆1,691Updated last month
Alternatives and similar repositories for textract
Users that are interested in textract are comparing it to the libraries listed below
Sorting:
- Node.js module for high performance creation, modification and parsing of PDF files and streams☆1,173Updated this week
- Advanced html to text converter☆1,686Updated 2 years ago
- A wrapper for the wkhtmltopdf HTML to PDF converter using WebKit☆614Updated 2 years ago
- converts binary PDF to JSON and text, for server-side PDF processing and command-line use. Zero dependency.☆2,185Updated last week
- 🚜 Parse text and tables from PDF files.☆698Updated last week
- Node module that summarizes text using a naive summarization algorithm☆770Updated this week
- This repo isn't maintained anymore as phantomjs got dreprecated a long time ago. Please migrate to headless chrome/puppeteer.☆3,658Updated last year
- A persistent, network resilient, full text search library for the browser and Node.js☆1,424Updated 9 months ago
- Download and extract files☆1,304Updated 2 years ago
- A javascript library for defining recurring schedules and calculating future (or past) occurrences for them. Includes support for using …☆2,420Updated 7 years ago
- A simple wrapper for the Tesseract OCR package☆677Updated 5 years ago
- CSV parser and formatter for node☆1,770Updated this week
- PDF manipulation in Node.js! Split, join, crop, read, extract, boil, mash, stick them in a stew.☆288Updated 11 months ago
- natural language processor powered by plugins part of the @unifiedjs collective☆2,429Updated 11 months ago
- Date() for humans☆1,482Updated 3 years ago
- An XML builder for node.js☆923Updated last month
- Easy website screenshots in Node.js☆2,119Updated 6 years ago
- Standalone Office Open XML files (Microsoft Office 2007 and later) generator for Word (docx), PowerPoint (pptx) and Excell (xlsx) in java…☆2,714Updated last year
- Nimble, streamable HTTP client for Node.js. With proxy, iconv, cookie, deflate & multipart support.☆1,635Updated 2 months ago
- Streaming csv parser inspired by binary-csv that aims to be faster than everyone else☆1,489Updated last year
- Decode mime formatted e-mails☆1,655Updated 3 weeks ago
- A powerful PDF tool for NodeJS based on HummusJS.☆350Updated 2 years ago
- Machine-learning for Node.js☆1,053Updated last week
- Node module to allow for easy Excel file creation☆1,369Updated 3 years ago
- A node.js library for processing and understanding scanned documents☆340Updated 3 years ago
- Flexible event driven crawler for node.☆2,136Updated 4 years ago
- 📄 A command line tool to generate PDF from URL, HTML or Markdown files.☆1,262Updated 6 months ago
- Agenda Dashboard☆812Updated last year
- A search server that can be installed with npm☆658Updated 5 months ago
- Unirest in Node.js: Simplified, lightweight HTTP client library.☆958Updated 9 months ago