ScientaNL / pdf-extractor
Node.js module for rendering pdf pages to images, svgs, html files, text files and json metadata
â97Updated last year
Alternatives and similar repositories for pdf-extractor:
Users that are interested in pdf-extractor are comparing it to the libraries listed below
- nodejs lib for extracting data from PDF filesâ226Updated 11 months ago
- đ° Yet another Webassembly PDF renderer for node and the browserâ190Updated 9 months ago
- A wrapper for PDF Toolkit with streams and promises.â141Updated last year
- pdf2html is a module which helps to convert PDF file to HTML pages using Apache Tika. This module also helps to generate thumbnail image âŚâ163Updated last month
- a javascript docx parserâ376Updated last month
- Annotation layer for pdf.jsâ279Updated 6 months ago
- Get text content from any fileâ65Updated 7 months ago
- Pure Javascript reader/writer for PowerPointâ141Updated 9 years ago
- Asynchronous Node.js wrapper for the Poppler PDF rendering libraryâ211Updated last week
- Convert PDF files into images using Poppler with promises. It achieves 10x faster performance compared to other PDF converters.â54Updated 3 years ago
- Extracts email address from an arbitrary text input.â62Updated 2 months ago
- A Node.js library to parse text out of any office file. Currently supports docx, pptx, xlsx and odt, odp, ods..â183Updated 4 months ago
- Provides an interface to convert PDF's pages to png files in Node.js by using ImageMagickâ235Updated 5 years ago
- Extract text from pdfs that contain searchable pdf textâ116Updated 6 years ago
- Microsoft Word doc/docx to PDF conversion, client-side in-browser, using docx-wasmâ53Updated 6 years ago
- Parser to convert PPTX to JSON formatâ89Updated 2 years ago
- đ Node.js wrapper for pdftocairo - PDF to PNG/JPEG/TIFF/PDF/PS/EPS/SVG using cairoâ26Updated last year
- Simple node package to convert a PDF into images.â190Updated 5 months ago
- NodeJS Readium2 "streamer"â21Updated 2 months ago
- â278Updated last month
- Read data from a Word document using node.jsâ141Updated 9 months ago
- React component for ONLYOFFICE Document Serverâ42Updated last week
- Annotation layer for PDF.js. Forked and modified from Submitty's branch.â15Updated 2 years ago
- Building PDFium for Web Assemblyâ74Updated 4 years ago
- Wrapper for PDF JS to add annotationsâ359Updated 2 years ago
- Generates a printable paginated pdf from DOM node using HTML5 canvas and svgâ147Updated 9 months ago
- PDF to HTML (pdf2htmlEX) shell wrapper pdftohtmljsâ145Updated 2 years ago
- RFC 822 EML file format parser and builderâ92Updated last year
- Javascript library for creating and manipulating Open XML Documents like docx, xlsx, etc. User can export grid data or images to open xmlâŚâ29Updated 2 years ago
- Convert html to rtf format in the serverâ40Updated last year