dbashford / textract
node.js module for extracting text from html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf and more!
☆1,636Updated last year
Related projects: ⓘ
- Standalone Office Open XML files (Microsoft Office 2007 and later) generator for Word (docx), PowerPoint (pptx) and Excell (xlsx) in java…☆2,648Updated 4 months ago
- Download and extract files☆1,280Updated 11 months ago
- Advanced html to text converter☆1,587Updated 10 months ago
- A javascript library for defining recurring schedules and calculating future (or past) occurrences for them. Includes support for using …☆2,417Updated 6 years ago
- Node.js module for high performance creation, modification and parsing of PDF files and streams☆1,144Updated last month
- CSV parser and formatter for node☆1,639Updated this week
- Full featured CSV parser with simple api and tested against large datasets.☆3,978Updated 3 weeks ago
- rawStream.pipe(JSONStream.parse()).pipe(streamOfObjects)☆1,912Updated 5 years ago
- NodeJS excel file parser & builder☆2,951Updated 2 months ago
- a streaming interface for archive generation☆2,800Updated 2 weeks ago
- Access control lists for node applications☆2,618Updated last year
- Streaming csv parser inspired by binary-csv that aims to be faster than everyone else☆1,413Updated 7 months ago
- converts binary PDF to JSON and text, for server-side PDF processing and command-line use.☆1,978Updated last month
- Distribute processing tasks to child processes with an über-simple API and baked-in durability & custom concurrency options.☆1,747Updated 2 years ago
- Flexible ascii progress bar for nodejs☆2,975Updated last year
- Finds degree of similarity between two strings, based on Dice's Coefficient, which is mostly better than Levenshtein distance.☆2,523Updated last year
- natural language processor powered by plugins part of the @unifiedjs collective☆2,356Updated 4 months ago
- A Javascript implementation of zip for nodejs. Allows user to create or extract zip files both in memory or to/from disk☆2,023Updated 2 weeks ago
- Proxy middleware for express/connect☆1,224Updated 3 weeks ago
- Easy website screenshots in Node.js☆2,124Updated 5 years ago
- Abstraction for exponential and custom retry strategies for failed operations.☆1,217Updated last year
- Node module to allow for easy Excel file creation☆1,377Updated 2 years ago
- 🚜 Parse text and tables from PDF files.☆620Updated last month
- Node module for detecting image dimensions☆2,024Updated 2 months ago
- Nimble, streamable HTTP client for Node.js. With proxy, iconv, cookie, deflate & multipart support.☆1,624Updated 8 months ago
- a javascript docx parser☆355Updated last week
- JSON Schema validation☆1,820Updated last month
- A module to create readable `"multipart/form-data"` streams. Can be used to submit forms and file uploads to other web applications.☆2,273Updated 2 months ago
- ☆1,913Updated this week
- Pretty unicode tables for the CLI with Node.JS☆2,276Updated last month