ffalt / pdf.js-extract
nodejs lib for extracting data from PDF files
☆226Updated 11 months ago
Alternatives and similar repositories for pdf.js-extract:
Users that are interested in pdf.js-extract are comparing it to the libraries listed below
- Muhammara a node module with c/cpp bindings to modify PDF with js for node or electron (based/replacement on/of galkhana/hummusjs)☆253Updated 3 months ago
- Asynchronous Node.js wrapper for the Poppler PDF rendering library☆209Updated this week
- 🚜 Parse text and tables from PDF files.☆668Updated 2 months ago
- Simple node package to convert a PDF into images.☆190Updated 5 months ago
- Node.js module for rendering pdf pages to images, svgs, html files, text files and json metadata☆97Updated last year
- A utility for converting pdf to image and base64 format.☆460Updated last month
- ☆276Updated last month
- A powerful PDF tool for NodeJS based on HummusJS.☆346Updated last year
- pdf2html is a module which helps to convert PDF file to HTML pages using Apache Tika. This module also helps to generate thumbnail image …☆163Updated last month
- Provides an interface to convert PDF's pages to png files in Node.js by using ImageMagick☆236Updated 5 years ago
- 📰 Yet another Webassembly PDF renderer for node and the browser☆189Updated 9 months ago
- ☆295Updated last month
- Library Convert PDF to PNG☆142Updated 3 weeks ago
- Turns XLSX into a readable stream.☆171Updated 7 months ago
- Read data from a Word document using node.js☆141Updated 9 months ago
- A Node.js library to parse text out of any office file. Currently supports docx, pptx, xlsx and odt, odp, ods..☆180Updated 4 months ago
- IMAP Client library for EmailEngine Email API (https://emailengine.app)☆408Updated 2 weeks ago
- Run Pandoc from NodeJS. Pandoc installation is required.☆79Updated 8 years ago
- a javascript docx parser☆376Updated last month
- 📃📸 Converts PDFs to images in nodejs☆98Updated last month
- A wrapper for PDF Toolkit with streams and promises.☆141Updated 11 months ago
- Short Unique ID (UUID) generation library. Available in NPM.☆412Updated 2 weeks ago
- A Node.js wrapper for the Tesseract OCR API☆310Updated last year
- Node.js - Convert DOCX to PDF, PNG to PDF, get thumbnails for PDF, stream PDFs.☆80Updated 2 years ago
- Lightweight string similarity function for javascript☆98Updated last year
- Yet another library to extract text from MS Office and PDF files☆72Updated 8 months ago
- Node.js module for high performance creation, modification and parsing of PDF files and streams☆1,153Updated last month
- Generate docx documents from templates, in Node or in the browser.☆428Updated this week
- Annotation layer for pdf.js☆278Updated 6 months ago
- Get text content from any file☆65Updated 7 months ago