ScientaNL / pdf-extractor
Node.js module for rendering pdf pages to images, svgs, html files, text files and json metadata
☆97Updated last year
Alternatives and similar repositories for pdf-extractor:
Users that are interested in pdf-extractor are comparing it to the libraries listed below
- nodejs lib for extracting data from PDF files☆227Updated last year
- Microsoft Word doc/docx to PDF conversion, client-side in-browser, using docx-wasm☆54Updated 6 years ago
- A Node.js library to parse text out of any office file. Currently supports docx, pptx, xlsx and odt, odp, ods..☆191Updated 5 months ago
- Asynchronous Node.js wrapper for the Poppler PDF rendering library☆211Updated this week
- Muhammara a node module with c/cpp bindings to modify PDF with js for node or electron (based/replacement on/of galkhana/hummusjs)☆258Updated 4 months ago
- Annotation layer for pdf.js☆281Updated 7 months ago
- Parser to convert PPTX to JSON format☆89Updated 2 years ago
- pdf2html is a module which helps to convert PDF file to HTML pages using Apache Tika. This module also helps to generate thumbnail image …☆166Updated 2 weeks ago
- 📰 Yet another Webassembly PDF renderer for node and the browser☆190Updated 10 months ago
- A wrapper for PDF Toolkit with streams and promises.☆141Updated last year
- Get text content from any file☆65Updated 8 months ago
- Generates a printable paginated pdf from DOM node using HTML5 canvas and svg☆147Updated 10 months ago
- Generate PPTX files on the server-side with JavaScript.☆173Updated last year
- a javascript docx parser☆377Updated 2 months ago
- Simple node package to convert a PDF into images.☆193Updated 6 months ago
- A powerful PDF tool for NodeJS based on HummusJS.☆346Updated 2 years ago
- Image annotation block for Airtable☆46Updated 4 years ago
- Provides an interface to convert PDF's pages to png files in Node.js by using ImageMagick☆235Updated 5 years ago
- A utility for converting pdf to image and base64 format.☆464Updated 2 months ago
- WebAssembly based Javascript bindings for google Compact Language Detector v3☆67Updated last year
- Node module wrapper for WordNet dictionary.☆54Updated 3 years ago
- ☆279Updated 2 months ago
- ☆187Updated 4 years ago
- 🚜 Parse text and tables from PDF files.☆675Updated 3 months ago
- A high-performance in-memory convertor to convert svg to png/jpeg images for Node.☆165Updated last year
- Javascript library for creating annotations in PDF documents☆585Updated 2 years ago
- Extracts email address from an arbitrary text input.☆62Updated 3 months ago
- Pure Javascript reader/writer for PowerPoint☆143Updated 9 years ago
- RFC 822 EML file format parser and builder☆92Updated 2 years ago
- Foxit webpdf.js provides a world-class JavaScript library for viewing PDF files in web browsers.☆64Updated 4 years ago