ScientaNL / pdf-extractorLinks
Node.js module for rendering pdf pages to images, svgs, html files, text files and json metadata
☆99Updated 2 years ago
Alternatives and similar repositories for pdf-extractor
Users that are interested in pdf-extractor are comparing it to the libraries listed below
Sorting:
- pdf2html is a module which helps to convert PDF file to HTML pages using Apache Tika. This module also helps to generate thumbnail image …☆170Updated last month
- nodejs lib for extracting data from PDF files☆232Updated last year
- Microsoft Word doc/docx to PDF conversion, client-side in-browser, using docx-wasm☆55Updated 6 years ago
- 📰 Yet another Webassembly PDF renderer for node and the browser☆192Updated 11 months ago
- A Node.js library to parse text out of any office file. Currently supports docx, pptx, xlsx and odt, odp, ods..☆199Updated 6 months ago
- Simple node package to convert a PDF into images.☆194Updated 7 months ago
- A wrapper for PDF Toolkit with streams and promises.☆141Updated last year
- Annotation layer for pdf.js☆284Updated 8 months ago
- A powerful PDF tool for NodeJS based on HummusJS.☆346Updated 2 years ago
- A "1:1 output" JavaScript port of Potrace JS for NodeJS.☆24Updated 3 months ago
- Get text content from any file☆65Updated 9 months ago
- Pure Javascript reader/writer for PowerPoint☆144Updated 9 years ago
- NLP Functions for amplifying negations, managing elisions, creating ngrams, stems, phonetic codes to tokens and more.☆130Updated last year
- Split {Japanese, English} text into sentences.☆128Updated last year
- Node module wrapper for WordNet dictionary.☆54Updated 3 years ago
- ☆187Updated 4 years ago
- Convert PDF files into images using Poppler with promises. It achieves 10x faster performance compared to other PDF converters.☆57Updated 4 years ago
- Add image annotation to your web apps.☆153Updated 2 months ago
- Fast Full Text Search based on BM25☆63Updated 2 years ago
- Mongodb adapter for Yjs☆37Updated 2 years ago
- Nodejs binding for fasttext representation and classification.☆43Updated last year
- WebAssembly based Javascript bindings for google Compact Language Detector v3☆68Updated last year
- Parser to convert PPTX to JSON format☆89Updated 2 years ago
- A NPM Utility program to convert office documents (documents/excel/presentations) into PDF/HTML☆37Updated 4 years ago
- An example Node.js app that integrates Keygen with Paddle for accepting payments.☆35Updated 3 years ago
- Node.js - Convert DOCX to PDF, PNG to PDF, get thumbnails for PDF, stream PDFs.☆81Updated 2 years ago
- javascript nodejs excel formula parser☆116Updated 8 months ago
- Artisanal inbound emails for every web app using nodejs☆77Updated 2 years ago
- 📃 Node.js wrapper for pdftocairo - PDF to PNG/JPEG/TIFF/PDF/PS/EPS/SVG using cairo☆26Updated 2 years ago
- RFC 822 EML file format parser and builder☆92Updated 2 years ago