shebinleo / pdf2html
pdf2html is a module which helps to convert PDF file to HTML pages using Apache Tika. This module also helps to generate thumbnail image for PDF file using Apache PDFBox.
☆163Updated last month
Alternatives and similar repositories for pdf2html:
Users that are interested in pdf2html are comparing it to the libraries listed below
- nodejs lib for extracting data from PDF files☆226Updated 11 months ago
- Node.js module for rendering pdf pages to images, svgs, html files, text files and json metadata☆97Updated last year
- Yet another library to extract text from MS Office and PDF files☆74Updated 8 months ago
- A Node.js library to parse text out of any office file. Currently supports docx, pptx, xlsx and odt, odp, ods..☆185Updated 5 months ago
- ☆277Updated last month
- React component for ONLYOFFICE Document Server☆42Updated 2 weeks ago
- Get text content from any file☆65Updated 7 months ago
- Fabric.js history plugin☆188Updated 9 months ago
- HTML to DOCX converter☆426Updated this week
- Documentation for Mozilla's PDF.js library☆59Updated 11 months ago
- Connect HTML elements with an arrow☆77Updated last year
- Pure Javascript reader/writer for PowerPoint☆142Updated 9 years ago
- A simple 🖼️ to 📄 converter for Node.js☆29Updated 3 months ago
- Create PowerPoint presentations with React☆149Updated last month
- Annotation layer for pdf.js☆280Updated 6 months ago
- a javascript docx parser☆376Updated 2 months ago
- Lightweight string similarity function for javascript☆100Updated last year
- A synchronous zip module☆50Updated last week
- Simple node package to convert a PDF into images.☆190Updated 5 months ago
- svg drawing library.☆92Updated this week
- WebViewer UI built in React☆426Updated this week
- Pdf editor react component☆96Updated 2 years ago
- docx to html converter☆15Updated 11 years ago
- 📰 Yet another Webassembly PDF renderer for node and the browser☆190Updated 9 months ago
- Node.js - Convert DOCX to PDF, PNG to PDF, get thumbnails for PDF, stream PDFs.☆80Updated 2 years ago
- Split {Japanese, English} text into sentences.☆124Updated last year
- Export your HTML canvas to PDF☆225Updated last year
- Asynchronous Node.js wrapper for the Poppler PDF rendering library☆211Updated last week
- Parser to convert PPTX to JSON format☆89Updated 2 years ago
- Convert PDF files into images using Poppler with promises. It achieves 10x faster performance compared to other PDF converters.☆55Updated 3 years ago