ScientaNL / pdf-extractorLinks
Node.js module for rendering pdf pages to images, svgs, html files, text files and json metadata
☆106Updated 2 years ago
Alternatives and similar repositories for pdf-extractor
Users that are interested in pdf-extractor are comparing it to the libraries listed below
Sorting:
- Get text content from any file☆64Updated last year
- Extracts email address from an arbitrary text input.☆64Updated 11 months ago
- Microsoft Word doc/docx to PDF conversion, client-side in-browser, using docx-wasm☆58Updated 6 years ago
- pdf2html is a module which helps to convert PDF file to HTML pages using Apache Tika. This module also helps to generate thumbnail image …☆200Updated last month
- 📰 Yet another Webassembly PDF renderer for node and the browser☆212Updated last year
- Generate PPTX files on the server-side with JavaScript.☆187Updated last month
- NLP Functions for amplifying negations, managing elisions, creating ngrams, stems, phonetic codes to tokens and more.☆133Updated last year
- ☆194Updated 4 years ago
- a javascript docx parser☆399Updated 11 months ago
- NPM package for creating a keyword array from a string and excluding stop words.☆200Updated last year
- Generates a printable paginated pdf from DOM node using HTML5 canvas and svg☆149Updated last year
- A high-performance in-memory convertor to convert svg to png/jpeg images for Node.☆167Updated 2 years ago
- A tiny, highly-customizable, single-function javascript/typescript library that captures a webpage and returns a new lightweight, self-co…☆241Updated last year
- Fast Full Text Search based on BM25☆69Updated 3 years ago
- Yet another library to extract text from MS Office and PDF files☆84Updated 2 weeks ago
- A wrapper for PDF Toolkit with streams and promises.☆143Updated last year
- Node module wrapper for WordNet dictionary.☆53Updated 3 years ago
- Read data from a Word document using node.js☆148Updated last year
- Add image annotation to your web apps.☆152Updated 3 months ago
- An Express middleware for quick'n'easy server-sent events.☆126Updated 2 weeks ago
- Grammarify is a npm package that safely cleans up text that has mispellings, improper capitalization, lexical illusions, among other thin…☆73Updated 3 years ago
- Pure Javascript reader/writer for PowerPoint☆151Updated 10 years ago
- RFC 822 EML file format parser and builder☆96Updated 2 years ago
- html2screen is mean to use HTML+CSS+JS as motion design tools.☆74Updated 6 years ago
- Unified access to cloud storage services through a simple web API.☆151Updated 3 years ago
- Machine learning based text classification in JavaScript using n-grams and cosine similarity☆133Updated last year
- mention tool for editor.js☆26Updated 6 years ago
- HTML5 Canvas implementation for NodeJS backed by Puppeteer☆65Updated 2 years ago
- ☆53Updated 3 years ago
- Get n-grams from text☆84Updated 3 years ago