unoconv / unoserverLinks
☆805Updated 2 months ago
Alternatives and similar repositories for unoserver
Users that are interested in unoserver are comparing it to the libraries listed below
Sorting:
- Docker files for a dockerized unoserver☆66Updated last month
- Universal Office Converter - Convert between any document format supported by LibreOffice/OpenOffice.☆2,726Updated 2 years ago
- Python bindings to PDFium, reasonably cross-platform.☆638Updated this week
- Convert file formats like docx, xlx to other formats like pdf, png - based on jodconverter and libreoffice☆92Updated 4 months ago
- Demos, examples and utilities using PyMuPDF☆681Updated last year
- Simplify DOCX files to JSON☆250Updated 11 months ago
- A python module that wraps the pdftoppm utility to convert PDF to PIL Image object☆1,861Updated last year
- Convert Word documents (.docx files) to HTML☆989Updated this week
- Annotation layer for pdf.js☆288Updated 11 months ago
- Convenience Docker images for Apache Tika Server☆208Updated this week
- 🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based☆327Updated last year
- Javascript library for creating annotations in PDF documents☆614Updated 2 years ago
- Simple command line utility for converting .doc & .xls files to any supported format such as Text, RTF, CSV or PDF☆486Updated 2 months ago
- HTML to DOCX converter☆473Updated 5 months ago
- pdf2html is a module which helps to convert PDF file to HTML pages using Apache Tika. This module also helps to generate thumbnail image …☆193Updated last month
- pyHanko: sign and stamp PDF files☆629Updated last week
- A Python library for reading and writing PDF, powered by QPDF☆2,469Updated last week
- Simple PDF text extraction☆948Updated 7 months ago
- A command line tool to convert Microsoft Office documents to PDFs☆649Updated last year
- A utility to read and write PDFs with Python☆337Updated 3 years ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆396Updated last year
- Convert html to docx☆82Updated last year
- A simple python wrapper for PDFium.☆17Updated 3 years ago
- RUPS is an acronym for Reading and Updating PDF Syntax. RUPS is a tool built on top of iText® that allows you to look inside a PDF docume…☆328Updated this week
- Extract docx headers, footers, (formatted) text, footnotes, endnotes, properties, and images.☆191Updated last week
- Wrapper for PDF JS to add annotations☆370Updated 3 years ago
- Extract structured text from pdfs quickly☆597Updated 3 months ago
- Benchmarking PDF libraries☆311Updated 2 months ago
- An open source set of Java filters for creating, merging and validating XLIFF 1.2, 2.0, 2.1 and 2.2 files.☆75Updated this week
- Thin wrapper for "pandoc" (MIT)☆1,045Updated 2 weeks ago