unoconv / unoserverLinks
☆813Updated 2 weeks ago
Alternatives and similar repositories for unoserver
Users that are interested in unoserver are comparing it to the libraries listed below
Sorting:
- Docker files for a dockerized unoserver☆74Updated 2 weeks ago
- Universal Office Converter - Convert between any document format supported by LibreOffice/OpenOffice.☆2,726Updated 2 years ago
- Python bindings to PDFium, reasonably cross-platform.☆652Updated this week
- Convert Word documents (.docx files) to HTML☆1,007Updated 3 weeks ago
- Simple command line utility for converting .doc & .xls files to any supported format such as Text, RTF, CSV or PDF☆489Updated 3 weeks ago
- Annotation layer for pdf.js☆288Updated last year
- Convenience Docker images for Apache Tika Server☆209Updated 3 weeks ago
- Javascript library for creating annotations in PDF documents☆616Updated 2 years ago
- Convert file formats like docx, xlx to other formats like pdf, png - based on jodconverter and libreoffice☆93Updated 3 weeks ago
- A command line tool to convert Microsoft Office documents to PDFs☆648Updated last year
- Simplify DOCX files to JSON☆253Updated last year
- A python module that wraps the pdftoppm utility to convert PDF to PIL Image object☆1,873Updated last year
- HTML to DOCX converter☆473Updated 5 months ago
- Demos, examples and utilities using PyMuPDF☆684Updated last year
- A Python tool to help extracting information from structured PDFs.☆415Updated this week
- pdf2html is a module which helps to convert PDF file to HTML pages using Apache Tika. This module also helps to generate thumbnail image …☆195Updated 2 months ago
- A Python library for reading and writing PDF, powered by QPDF☆2,488Updated 3 weeks ago
- Convert html to docx☆83Updated last year
- 🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based☆327Updated last year
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆397Updated last year
- Extract docx headers, footers, (formatted) text, footnotes, endnotes, properties, and images.☆192Updated last week
- Display paginated content in the browser and generate print books using web technology☆1,055Updated 3 weeks ago
- ☆579Updated last month
- Wrapper for PDF JS to add annotations☆372Updated 3 years ago
- img2table is a table identification and extraction Python Library for PDF and images, based on OpenCV image processing☆792Updated last month
- Python binding to Poppler-cpp pdf library☆111Updated last year
- PGroonga is a PostgreSQL extension to use Groonga as index. PGroonga makes PostgreSQL fast full text search platform for all languages!☆659Updated this week
- 📰 Binary distribution of PDFium☆1,160Updated this week
- Library used to deskew a scanned document☆488Updated last week
- A LibreOffice server wrapper that is exposed over HTTP to allow easy conversions from supported documents to PDF.☆64Updated 3 months ago