ScientaNL / pdf-extractor
Node.js module for rendering pdf pages to images, svgs, html files, text files and json metadata
☆90Updated last year
Related projects ⓘ
Alternatives and complementary repositories for pdf-extractor
- nodejs lib for extracting data from PDF files☆213Updated 6 months ago
- Microsoft Word doc/docx to PDF conversion, client-side in-browser, using docx-wasm☆52Updated 5 years ago
- Annotation layer for pdf.js☆268Updated last month
- pdf2html is a module which helps to convert PDF file to HTML pages using Apache Tika. This module also helps to generate thumbnail image …☆154Updated 5 months ago
- A Node.js library to parse text out of any office file. Currently supports docx, pptx, xlsx and odt, odp, ods..☆148Updated last week
- 📰 Yet another Webassembly PDF renderer for node and the browser☆177Updated 4 months ago
- Microsoft Word doc/docx to PDF conversion on AWS Lambda using Node.js☆48Updated 2 years ago
- Emscripten port of Tesseract C++ API☆159Updated 2 months ago
- Extracts email address from an arbitrary text input.☆62Updated 4 months ago
- Building PDFium for Web Assembly☆72Updated 3 years ago
- a javascript docx parser☆362Updated 2 months ago
- Pure Javascript reader/writer for PowerPoint☆130Updated 9 years ago
- Get text content from any file☆62Updated 3 months ago
- Export a prosemirror document to a Microsoft Word file, using docx.☆108Updated 2 months ago
- Simple node package to convert a PDF into images.☆180Updated last month
- mention tool for editor.js☆25Updated 5 years ago
- A React component for annotating PDF, powered by PDF.js and RecogitoJS☆48Updated 7 months ago
- Undo history for ProseMirror☆45Updated 3 weeks ago
- Javascript library for creating annotations in PDF documents☆553Updated last year
- Module for formatting and transforming text as you type in Quill☆68Updated 5 years ago
- A rich-text editor using Prosemirror with React☆38Updated 10 months ago
- Asynchronous node.js wrapper for the Poppler PDF rendering library☆186Updated last week
- RFC 822 EML file format parser and builder☆92Updated last year
- 🤠A library implementing different string similarity using JavaScript.☆49Updated 11 months ago
- Read data from a Word document using node.js☆138Updated 5 months ago
- Parser to convert PPTX to JSON format☆86Updated last year
- Yet another library to extract text from MS Office and PDF files☆62Updated 3 months ago
- Create a set of steps transforming one prosemirror json document to another☆18Updated 11 months ago
- Wrapper for PDF JS to add annotations☆342Updated 2 years ago