shebinleo / pdf2htmlLinks
pdf2html is a module which helps to convert PDF file to HTML pages using Apache Tika. This module also helps to generate thumbnail image for PDF file using Apache PDFBox.
☆193Updated last month
Alternatives and similar repositories for pdf2html
Users that are interested in pdf2html are comparing it to the libraries listed below
Sorting:
- A Node.js library to parse text out of any office file. Currently supports docx, pptx, xlsx and odt, odp, ods..☆235Updated 2 months ago
- nodejs lib for extracting data from PDF files☆241Updated last month
- Generate PPTX files on the server-side with JavaScript.☆180Updated last year
- Yet another library to extract text from MS Office and PDF files☆81Updated last year
- ☆296Updated this week
- A utility for converting pdf to image and base64 format.☆481Updated 3 months ago
- Simple node package to convert a PDF into images.☆197Updated 10 months ago
- HTML to DOCX converter☆472Updated 5 months ago
- Node.js module for rendering pdf pages to images, svgs, html files, text files and json metadata☆100Updated 2 years ago
- Library Convert PDF to PNG☆157Updated last month
- Create PowerPoint presentations with React☆174Updated 6 months ago
- Asynchronous Node.js wrapper for the Poppler PDF rendering library☆228Updated last week
- Get text content from any file☆64Updated last year
- Muhammara a node module with c/cpp bindings to modify PDF with js for node or electron (based/replacement on/of galkhana/hummusjs)☆275Updated last month
- a javascript docx parser☆387Updated 7 months ago
- Export a prosemirror document to a Microsoft Word file, using docx.☆144Updated 2 months ago
- Pure Javascript reader/writer for PowerPoint☆148Updated 9 years ago
- 📰 Yet another Webassembly PDF renderer for node and the browser☆204Updated last year
- Parser to convert PPTX to JSON format☆90Updated 2 years ago
- 🚜 Parse text and tables from PDF files.☆691Updated 7 months ago
- 📃📸 Converts PDFs to images in nodejs☆116Updated last week
- QuillJS Editor Plugin for advanced Markdown☆196Updated last year
- A lightweight Typescript library that interacts with Gotenberg's different modules to convert a variety of document formats to PDF files.☆125Updated this week
- Annotation layer for pdf.js☆288Updated 11 months ago
- Read data from a Word document using node.js☆145Updated last year
- Add image annotation to your web apps.☆154Updated last month
- Typescript wrapper for the PDFium library, works in browser and node.js☆121Updated last month
- React component for ONLYOFFICE Document Server☆47Updated this week
- JavaScript OCR and text extraction for images and PDFs.☆168Updated this week
- A synchronous zip module☆57Updated 4 months ago