harshankur / officeParserLinks
A robust, strictly-typed Node.js and Browser library for parsing office files (docx, pptx, xlsx, odt, odp, ods, pdf, rtf). It produces a clean, hierarchical Abstract Syntax Tree (AST) with rich metadata, text formatting, and full attachment support.
☆276Updated 3 weeks ago
Alternatives and similar repositories for officeParser
Users that are interested in officeParser are comparing it to the libraries listed below
Sorting:
- Yet another library to extract text from MS Office and PDF files☆85Updated last month
- pdf2html is a module which helps to convert PDF file to HTML pages using Apache Tika. This module also helps to generate thumbnail image …☆201Updated 3 weeks ago
- ☆305Updated last week
- Generate PPTX files on the server-side with JavaScript.☆188Updated 2 months ago
- ☆157Updated 2 years ago
- Library Convert PDF to PNG☆168Updated last month
- Typescript wrapper for the PDFium library, works in browser and node.js☆157Updated 3 weeks ago
- Create PowerPoint presentations with React☆204Updated 11 months ago
- 📰 Yet another Webassembly PDF renderer for node and the browser☆212Updated last year
- Export a prosemirror document to a Microsoft Word file, using docx.☆158Updated 7 months ago
- Fast HTML to markdown converter for NodeJS or the browser☆249Updated 2 months ago
- Parse incomplete json text in best-effort manner☆272Updated 6 months ago
- Muhammara a node module with c/cpp bindings to modify PDF with js for node or electron (based/replacement on/of galkhana/hummusjs)☆296Updated this week
- 📃📸 Converts PDFs to images in nodejs☆136Updated 5 months ago
- Get text content from any file☆64Updated last year
- HTML to DOCX converter☆476Updated 9 months ago
- Parse partial JSON generated by LLM☆212Updated 6 months ago
- Generate vector embeddings in NodeJS☆172Updated last month
- Simple node package to convert a PDF into images.☆200Updated last year
- Example of drag-n-drop snippets in Tiptap. See demo-video for more info!☆110Updated 3 years ago
- Streaming, source-agnostic EventSource/Server-Sent Events parser☆450Updated last month
- Fast Node.js library to convert raster images to svg☆142Updated last year
- React component for ONLYOFFICE Document Server☆55Updated 2 weeks ago
- Dead simple pdf text reader☆44Updated 3 weeks ago
- nodejs lib for extracting data from PDF files☆246Updated 6 months ago
- Node.js bindings for faiss☆138Updated 2 years ago
- Node.js bindings for OpenAI's Whisper. (C++ CPU version by ggerganov)☆296Updated last year
- Column extension for tiptap v2☆111Updated 2 years ago
- ☆153Updated 11 months ago
- A diff tool for JavaScript written in TypeScript.☆179Updated last week