ScientaNL / pdf-extractorLinks
Node.js module for rendering pdf pages to images, svgs, html files, text files and json metadata
☆100Updated 2 years ago
Alternatives and similar repositories for pdf-extractor
Users that are interested in pdf-extractor are comparing it to the libraries listed below
Sorting:
- nodejs lib for extracting data from PDF files☆238Updated last month
- 📰 Yet another Webassembly PDF renderer for node and the browser☆201Updated last year
- pdf2html is a module which helps to convert PDF file to HTML pages using Apache Tika. This module also helps to generate thumbnail image …☆191Updated 3 weeks ago
- a javascript docx parser☆385Updated 6 months ago
- Annotation layer for pdf.js☆288Updated 11 months ago
- Get text content from any file☆66Updated last year
- Extracts email address from an arbitrary text input.☆64Updated 6 months ago
- Add image annotation to your web apps.☆153Updated last month
- Generates a printable paginated pdf from DOM node using HTML5 canvas and svg☆148Updated last year
- Asynchronous Node.js wrapper for the Poppler PDF rendering library☆222Updated last week
- Microsoft Word doc/docx to PDF conversion, client-side in-browser, using docx-wasm☆57Updated 6 years ago
- Generate PPTX files on the server-side with JavaScript.☆179Updated last year
- HTML5 Canvas implementation for NodeJS backed by Puppeteer☆64Updated 2 years ago
- Yet another library to extract text from MS Office and PDF files☆81Updated last year
- A tiny, highly-customizable, single-function javascript/typescript library that captures a webpage and returns a new lightweight, self-co…☆237Updated 11 months ago
- Parser to convert PPTX to JSON format☆90Updated 2 years ago
- NLP Functions for amplifying negations, managing elisions, creating ngrams, stems, phonetic codes to tokens and more.☆131Updated last year
- Fast Full Text Search based on BM25☆64Updated 2 years ago
- Muhammara a node module with c/cpp bindings to modify PDF with js for node or electron (based/replacement on/of galkhana/hummusjs)☆275Updated last month
- 🤠A library implementing different string similarity using JavaScript.☆56Updated 4 months ago
- ☆292Updated 2 weeks ago
- javascript nodejs excel formula parser☆119Updated 11 months ago
- A high-performance in-memory convertor to convert svg to png/jpeg images for Node.☆165Updated last year
- ☆191Updated 4 years ago
- Simple node package to convert a PDF into images.☆196Updated 10 months ago
- HTML to DOCX converter☆467Updated 4 months ago
- Image annotation block for Airtable☆46Updated 4 years ago
- Machine learning based text classification in JavaScript using n-grams and cosine similarity☆132Updated last year
- WebAssembly based Javascript bindings for google Compact Language Detector v3☆72Updated last year
- ☆53Updated 2 years ago