ScientaNL / pdf-extractor
Node.js module for rendering pdf pages to images, svgs, html files, text files and json metadata
☆88Updated last year
Related projects: ⓘ
- nodejs lib for extracting data from PDF files☆205Updated 4 months ago
- A Node.js library to parse text out of any office file. Currently supports docx, pptx, xlsx and odt, odp, ods..☆129Updated last month
- pdf2html is a module which helps to convert PDF file to HTML pages using Apache Tika. This module also helps to generate thumbnail image …☆153Updated 3 months ago
- 📰 Yet another Webassembly PDF renderer for node and the browser☆171Updated 2 months ago
- Asynchronous node.js wrapper for the Poppler PDF rendering library☆176Updated 2 weeks ago
- Get text content from any file☆61Updated last month
- Just the expression parser of mathjs☆54Updated 3 years ago
- A wrapper for PDF Toolkit with streams and promises.☆139Updated 5 months ago
- Provides an interface to convert PDF's pages to png files in Node.js by using ImageMagick☆237Updated 4 years ago
- Microsoft Word doc/docx to PDF conversion, client-side in-browser, using docx-wasm☆48Updated 5 years ago
- WebAssembly based Javascript bindings for google Compact Language Detector v3☆55Updated 8 months ago
- ☆87Updated 2 years ago
- Simple node package to convert a PDF into images.☆166Updated last month
- Pure Javascript reader/writer for PowerPoint☆128Updated 8 years ago
- pdf2table is a node.js library that attempts to extract tables from a pdf.☆32Updated 4 months ago
- Parser to convert PPTX to JSON format☆83Updated last year
- ☆251Updated 2 weeks ago
- Read data from a Word document using node.js☆135Updated 3 months ago
- Multilingual tokenizer that automatically tags each token with its type☆59Updated last year
- 📃📸 Converts PDFs to images in nodejs☆72Updated last month
- Muhammara a node module with c/cpp bindings to modify PDF with js for node or electron (based/replacement on/of galkhana/hummusjs)☆224Updated last week
- Extracts email address from an arbitrary text input.☆62Updated 2 months ago
- A powerful PDF tool for NodeJS based on HummusJS.☆339Updated last year
- NLP Functions for amplifying negations, managing elisions, creating ngrams, stems, phonetic codes to tokens and more.☆120Updated 6 months ago
- Light multi-platform disk space checker without third-party for Node.js☆101Updated 9 months ago
- a javascript docx parser☆355Updated last week
- Mongodb adapter for Yjs☆37Updated last year
- Building PDFium for Web Assembly☆70Updated 3 years ago
- Distance/Similarity functions for Bag of Words, Strings, Vectors and more.☆23Updated last year
- 🤠A library implementing different string similarity using JavaScript.☆47Updated 9 months ago