unoconv / unoserver
☆580Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for unoserver
- Docker files for a dockerized unoserver☆42Updated last week
- Universal Office Converter - Convert between any document format supported by LibreOffice/OpenOffice.☆2,615Updated last year
- Python bindings to PDFium☆419Updated last week
- Simplify DOCX files to JSON☆219Updated last month
- Javascript library for creating annotations in PDF documents☆546Updated last year
- Annotation layer for pdf.js☆265Updated last month
- Simple command line utility for converting .doc & .xls files to any supported format such as Text, RTF, CSV or PDF☆447Updated this week
- Demos, examples and utilities using PyMuPDF☆570Updated 4 months ago
- Train Tesseract LSTM with make☆637Updated 5 months ago
- Docx rendering library☆1,289Updated this week
- A command line tool to convert Microsoft Office documents to PDFs☆616Updated 7 months ago
- A python module that wraps the pdftoppm utility to convert PDF to PIL Image object☆1,634Updated 3 months ago
- 📰 Yet another Webassembly PDF renderer for node and the browser☆176Updated 4 months ago
- Python bindings for WPS Office RPC (for Linux)☆225Updated 3 weeks ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆369Updated 3 months ago
- PDF to XML ALTO file converter☆215Updated last month
- Convert html to docx☆73Updated 4 months ago
- Convenience Docker images for Apache Tika Server☆135Updated 3 weeks ago
- ☆23Updated 2 weeks ago
- Python binding to Poppler-cpp pdf library☆97Updated 2 months ago
- Convert file formats like docx, xlx to other formats like pdf, png - based on jodconverter and libreoffice☆76Updated 3 weeks ago
- PDF signing software written in Java. It supports visible signatures, timestamping, certificate verification and many other cool features☆320Updated last week
- 🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based☆299Updated last year
- Wrapper for PDF JS to add annotations☆339Updated 2 years ago
- pdf2html is a module which helps to convert PDF file to HTML pages using Apache Tika. This module also helps to generate thumbnail image …☆154Updated 4 months ago
- PDF parser and converter to HTML☆83Updated last month
- Typescript wrapper for the PDFium library, works in browser and node.js☆42Updated 3 weeks ago
- pdfrw is a pure Python library that reads and writes PDFs☆1,869Updated 6 months ago
- OCR engine for all the languages☆743Updated last week