ffalt / pdf.js-extract
nodejs lib for extracting data from PDF files
☆227Updated last year
Alternatives and similar repositories for pdf.js-extract:
Users that are interested in pdf.js-extract are comparing it to the libraries listed below
- 🚜 Parse text and tables from PDF files.☆674Updated 3 months ago
- Muhammara a node module with c/cpp bindings to modify PDF with js for node or electron (based/replacement on/of galkhana/hummusjs)☆258Updated 4 months ago
- Asynchronous Node.js wrapper for the Poppler PDF rendering library☆211Updated this week
- Simple node package to convert a PDF into images.☆194Updated 6 months ago
- A Node.js wrapper for the Tesseract OCR API☆311Updated last year
- pdf2html is a module which helps to convert PDF file to HTML pages using Apache Tika. This module also helps to generate thumbnail image …☆167Updated 2 weeks ago
- 📃📸 Converts PDFs to images in nodejs☆98Updated 2 months ago
- A utility for converting pdf to image and base64 format.☆464Updated 2 months ago
- Node.js module for rendering pdf pages to images, svgs, html files, text files and json metadata☆97Updated last year
- Yet another library to extract text from MS Office and PDF files☆76Updated 9 months ago
- Provides an interface to convert PDF's pages to png files in Node.js by using ImageMagick☆235Updated 5 years ago
- ☆279Updated 2 months ago
- A powerful PDF tool for NodeJS based on HummusJS.☆346Updated 2 years ago
- a javascript docx parser☆377Updated 2 months ago
- Library Convert PDF to PNG☆144Updated 2 months ago
- A Node.js library to parse text out of any office file. Currently supports docx, pptx, xlsx and odt, odp, ods..☆191Updated 5 months ago
- ☆294Updated 2 months ago
- A module for node.js and the browser that takes in text and strips it of stopwords☆247Updated 4 months ago
- A wrapper for PDF Toolkit with streams and promises.☆141Updated last year
- IMAP Client library for EmailEngine Email API (https://emailengine.app)☆418Updated 2 weeks ago
- In-memory Node.js and browser job scheduler☆589Updated 11 months ago
- Get text content from any file☆65Updated 8 months ago
- Helps to draw informations in simple tables using pdfkit. #server-side. Generate pdf tables with javascript (PDFKIT plugin)☆99Updated last year
- NPM package for creating a keyword array from a string and excluding stop words.☆200Updated 10 months ago
- Character encoding detection tool for NodeJS☆290Updated 2 months ago
- Short Unique ID (UUID) generation library. Available in NPM.☆421Updated last month
- Read data from a Word document using node.js☆142Updated 10 months ago
- Flexible conversion between JSON and CSV☆329Updated 3 months ago
- Generate PPTX files on the server-side with JavaScript.☆173Updated last year
- Generate docx documents from templates, in Node or in the browser.☆435Updated this week