modesty / pdf2jsonLinks
converts binary PDF to JSON and text, for server-side PDF processing and command-line use. Zero dependency.
☆2,097Updated last week
Alternatives and similar repositories for pdf2json
Users that are interested in pdf2json are comparing it to the libraries listed below
Sorting:
- Node.js module for high performance creation, modification and parsing of PDF files and streams☆1,164Updated 3 months ago
- 🚜 Parse text and tables from PDF files.☆675Updated 4 months ago
- node.js module for extracting text from html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf and more!☆1,668Updated 2 years ago
- Node PDF Extract☆390Updated last year
- Advanced html to text converter☆1,654Updated last year
- A powerful PDF tool for NodeJS based on HummusJS.☆346Updated 2 years ago
- nodejs lib for extracting data from PDF files☆232Updated last year
- HTML to PDF or image (jpeg, png, webp) converter via Chrome/Chromium☆787Updated this week
- A utility for converting pdf to image and base64 format.☆467Updated this week
- A simple wrapper for the Tesseract OCR package☆673Updated 4 years ago
- A JavaScript PDF generation library for Node and the browser☆10,241Updated last month
- A node.js library for processing and understanding scanned documents☆341Updated 2 years ago
- Standalone Office Open XML files (Microsoft Office 2007 and later) generator for Word (docx), PowerPoint (pptx) and Excell (xlsx) in java…☆2,686Updated last year
- Decode mime formatted e-mails☆1,621Updated 2 weeks ago
- Converts HTML documents to DOCX in the browser☆1,103Updated 3 years ago
- Provides an interface to convert PDF's pages to png files in Node.js by using ImageMagick☆235Updated 5 years ago
- Download and extract files☆1,294Updated last year
- The fast & forgiving HTML and XML parser☆4,582Updated this week
- pdf2html is a module which helps to convert PDF file to HTML pages using Apache Tika. This module also helps to generate thumbnail image …☆171Updated this week
- PDF manipulation in Node.js! Split, join, crop, read, extract, boil, mash, stick them in a stew.☆286Updated 3 months ago
- This repo isn't maintained anymore as phantomjs got dreprecated a long time ago. Please migrate to headless chrome/puppeteer.☆3,564Updated last year
- natural language processor powered by plugins part of the @unifiedjs collective☆2,404Updated 3 months ago
- Javascript library for creating annotations in PDF documents☆592Updated 2 years ago
- Convert json to csv with column titles☆2,727Updated 2 years ago
- Javascript utility for calculating deep difference, capturing changes, and applying changes across objects; for nodejs and the browser.☆3,033Updated last year
- Create custom SMTP servers on the fly☆883Updated 2 weeks ago
- A NodeJS module to generate Excel files in .xlsx format from a template created with Excel itself☆415Updated last month
- HTML to DOCX converter☆436Updated last month
- Convert Word documents (.docx files) to HTML☆5,438Updated this week
- Node.js module for rendering pdf pages to images, svgs, html files, text files and json metadata☆99Updated 2 years ago