modesty / pdf2json
converts binary PDF to JSON and text, for server-side PDF processing and command-line use.
☆2,071Updated 2 months ago
Alternatives and similar repositories for pdf2json:
Users that are interested in pdf2json are comparing it to the libraries listed below
- 🚜 Parse text and tables from PDF files.☆669Updated 2 months ago
- Node.js module for high performance creation, modification and parsing of PDF files and streams☆1,154Updated last month
- node.js module for extracting text from html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf and more!☆1,662Updated 2 years ago
- Node PDF Extract☆389Updated last year
- A utility for converting pdf to image and base64 format.☆460Updated last month
- A Portable Document Format (PDF) generation library targeting both the server- and client-side.☆788Updated last year
- A powerful PDF tool for NodeJS based on HummusJS.☆346Updated last year
- a javascript docx parser☆376Updated last month
- pdf2html is a module which helps to convert PDF file to HTML pages using Apache Tika. This module also helps to generate thumbnail image …☆164Updated last month
- Converts HTML documents to DOCX in the browser☆1,086Updated 3 years ago
- nodejs lib for extracting data from PDF files☆226Updated 11 months ago
- A JavaScript PDF generation library for Node and the browser☆10,150Updated this week
- Generate docx, pptx, and xlsx from templates (Word, Powerpoint and Excel documents), from Node.js or the browser. Demo: https://www.docxt…☆3,245Updated last week
- ViewerJS: Document Reader in JavaScript☆1,966Updated 2 years ago
- Standalone Office Open XML files (Microsoft Office 2007 and later) generator for Word (docx), PowerPoint (pptx) and Excell (xlsx) in java…☆2,680Updated 11 months ago
- Template-based docx report creation☆963Updated 2 weeks ago
- Advanced html to text converter☆1,648Updated last year
- Generic build of PDF.js library.☆1,243Updated 8 months ago
- Detect the file type of a file, stream, or data☆3,924Updated 2 weeks ago
- A wrapper for the wkhtmltopdf HTML to PDF converter using WebKit☆610Updated 2 years ago
- Annotation layer for pdf.js (no longer maintained)☆552Updated 6 years ago
- a streaming interface for archive generation☆2,862Updated 3 weeks ago
- PDF manipulation in Node.js! Split, join, crop, read, extract, boil, mash, stick them in a stew.☆286Updated last month
- Based on lunr.js, but more flexible and customized.☆2,058Updated 2 years ago
- Asynchronous Node.js wrapper for the Poppler PDF rendering library☆210Updated last week
- A Node.js wrapper for the Tesseract OCR API☆310Updated last year
- HTML to DOCX converter☆425Updated 7 months ago
- PDF to HTML (pdf2htmlEX) shell wrapper pdftohtmljs☆145Updated 2 years ago
- A persistent, network resilient, full text search library for the browser and Node.js☆1,408Updated 2 weeks ago
- Javascript Library parsing Excel Formulas and more☆642Updated 3 years ago