shebinleo / pdf2htmlLinks
pdf2html is a module which helps to convert PDF file to HTML pages using Apache Tika. This module also helps to generate thumbnail image for PDF file using Apache PDFBox.
☆200Updated last month
Alternatives and similar repositories for pdf2html
Users that are interested in pdf2html are comparing it to the libraries listed below
Sorting:
- nodejs lib for extracting data from PDF files☆246Updated 5 months ago
- HTML to DOCX converter☆476Updated 9 months ago
- A robust, strictly-typed Node.js and Browser library for parsing office files (docx, pptx, xlsx, odt, odp, ods, pdf, rtf). It produces a …☆262Updated this week
- 📰 Yet another Webassembly PDF renderer for node and the browser☆212Updated last year
- Node.js module for rendering pdf pages to images, svgs, html files, text files and json metadata☆106Updated 2 years ago
- Simple node package to convert a PDF into images.☆198Updated last year
- ☆303Updated last month
- Yet another library to extract text from MS Office and PDF files☆84Updated 2 weeks ago
- Generate PPTX files on the server-side with JavaScript.☆187Updated last month
- Typescript wrapper for the PDFium library, works in browser and node.js☆149Updated this week
- Annotation layer for pdf.js☆291Updated last month
- Get text content from any file☆64Updated last year
- Create PowerPoint presentations with React☆198Updated 10 months ago
- A utility for converting pdf to image and base64 format.☆496Updated 7 months ago
- 🚜 Parse text and tables from PDF files.☆697Updated last month
- Muhammara a node module with c/cpp bindings to modify PDF with js for node or electron (based/replacement on/of galkhana/hummusjs)☆292Updated this week
- a javascript docx parser☆399Updated 11 months ago
- WebViewer UI built in React☆460Updated last week
- 📃📸 Converts PDFs to images in nodejs☆135Updated 4 months ago
- QuillJS Editor Plugin for advanced Markdown☆203Updated last year
- Export a prosemirror document to a Microsoft Word file, using docx.☆154Updated 6 months ago
- Parser to convert PPTX to JSON format☆93Updated 3 years ago
- A synchronous zip module☆58Updated 2 weeks ago
- Simple tool for converting PDF to text using OCR☆98Updated 2 years ago
- Render RTF documents in HTML.☆161Updated 2 years ago
- Library Convert PDF to PNG☆167Updated 2 weeks ago
- ☆302Updated 11 months ago
- Asynchronous Node.js wrapper for the Poppler PDF rendering library☆235Updated this week
- Emscripten port of Tesseract C++ API☆183Updated 2 weeks ago
- Pure Javascript reader/writer for PowerPoint☆151Updated 10 years ago