unoconv / unoserverLinks
☆852Updated 2 weeks ago
Alternatives and similar repositories for unoserver
Users that are interested in unoserver are comparing it to the libraries listed below
Sorting:
- Docker files for a dockerized unoserver☆75Updated last week
- Universal Office Converter - Convert between any document format supported by LibreOffice/OpenOffice.☆2,741Updated 2 years ago
- Python bindings to PDFium, reasonably cross-platform.☆689Updated this week
- Simplify DOCX files to JSON☆257Updated last year
- Annotation layer for pdf.js☆289Updated last week
- A python module that wraps the pdftoppm utility to convert PDF to PIL Image object☆1,916Updated last year
- A command line tool to convert Microsoft Office documents to PDFs☆655Updated last year
- 🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based☆329Updated 2 years ago
- Javascript library for creating annotations in PDF documents☆627Updated 2 years ago
- ☆584Updated 2 months ago
- Demos, examples and utilities using PyMuPDF☆690Updated last year
- Convert Word documents (.docx files) to HTML☆1,034Updated 3 weeks ago
- HTML to DOCX converter☆476Updated 7 months ago
- A Python tool to help extracting information from structured PDFs.☆425Updated this week
- Python bindings for WPS Office RPC (for Linux)☆275Updated 8 months ago
- Thin wrapper for "pandoc" (MIT)☆1,081Updated last week
- pdf2html is a module which helps to convert PDF file to HTML pages using Apache Tika. This module also helps to generate thumbnail image …☆198Updated 3 weeks ago
- A Python library for reading and writing PDF, powered by QPDF☆2,545Updated 2 weeks ago
- Convert html to docx☆84Updated last year
- A Python library to extract tabular data from PDFs☆66Updated 8 months ago
- A simple RESTful server for converting documents using unoconv☆23Updated 3 years ago
- Extract docx headers, footers, (formatted) text, footnotes, endnotes, properties, and images.☆195Updated last week
- Wrapper for PDF JS to add annotations☆376Updated 3 years ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆404Updated last year
- Simple PDF text extraction☆963Updated 10 months ago
- Convert omml to latex for displaying in web browsers (KaTeX)☆35Updated 5 years ago
- Append/Concatenate .docx documents☆123Updated last year
- Library used to deskew a scanned document☆495Updated last week
- A post-processing tool for scanned sheets of paper.☆1,130Updated last year
- A pure python based utility to extract text and images from docx files.☆573Updated 8 months ago