ScientaNL / pdf-extractor
Node.js module for rendering pdf pages to images, svgs, html files, text files and json metadata
☆97Updated last year
Alternatives and similar repositories for pdf-extractor:
Users that are interested in pdf-extractor are comparing it to the libraries listed below
- nodejs lib for extracting data from PDF files☆223Updated 10 months ago
- 📰 Yet another Webassembly PDF renderer for node and the browser☆188Updated 8 months ago
- Get text content from any file☆63Updated 6 months ago
- pdf2html is a module which helps to convert PDF file to HTML pages using Apache Tika. This module also helps to generate thumbnail image …☆163Updated 2 weeks ago
- Node.js - Convert DOCX to PDF, PNG to PDF, get thumbnails for PDF, stream PDFs.☆79Updated 2 years ago
- A Node.js library to parse text out of any office file. Currently supports docx, pptx, xlsx and odt, odp, ods..☆172Updated 3 months ago
- Simple node package to convert a PDF into images.☆187Updated 4 months ago
- Annotation layer for pdf.js☆277Updated 5 months ago
- Read data from a Word document using node.js☆141Updated 8 months ago
- Pure Javascript reader/writer for PowerPoint☆139Updated 9 years ago
- ☆272Updated this week
- 🔀 Replace {{ variables }} in all your files☆38Updated last year
- A wrapper for PDF Toolkit with streams and promises.☆141Updated 10 months ago
- Language agnostic named entity recognizer☆39Updated 2 years ago
- ☆186Updated 3 years ago
- Interactive PPTX slide viewer☆37Updated 6 years ago
- NLP Functions for amplifying negations, managing elisions, creating ngrams, stems, phonetic codes to tokens and more.☆125Updated last year
- Asynchronous node.js wrapper for the Poppler PDF rendering library☆208Updated this week
- OneDrive API module for Node.js☆111Updated last year
- Parser to convert PPTX to JSON format☆88Updated 2 years ago
- Muhammara a node module with c/cpp bindings to modify PDF with js for node or electron (based/replacement on/of galkhana/hummusjs)☆249Updated 2 months ago
- Yet another library to extract text from MS Office and PDF files☆71Updated 7 months ago
- Convert PDF files into images using Poppler with promises. It achieves 10x faster performance compared to other PDF converters.☆53Updated 3 years ago
- PDF.js-based PDF files viewer with annotation support☆84Updated 7 months ago
- A simple JS/TS client for interacting with a Gotenberg API☆113Updated last year
- Ghostscript4JS binds the Ghostscript C API to the Node.JS world.☆70Updated 8 months ago
- Module to export Word, Excel & PowerPoint to PDF. Requires windows and installed office 2013☆47Updated 3 years ago
- Distance/Similarity functions for Bag of Words, Strings, Vectors and more.☆23Updated last year
- a javascript docx parser☆371Updated 2 weeks ago
- Generate PPTX files on the server-side with JavaScript.☆168Updated last year