danigm / poppler
Personal clone of Poppler, official repository is here: https://gitlab.freedesktop.org/poppler/poppler
☆125Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for poppler
- The hOCR Embedded OCR Workflow and Output Format☆74Updated 3 months ago
- CMap Resources☆254Updated last year
- This is not the poppler repository. Please see https://poppler.freedesktop.org/☆52Updated 14 years ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆370Updated 3 months ago
- A more complete example of programming with PDFMiner, which continues where the default documentation stops☆215Updated 4 years ago
- Linguistic Annotation and Visualization Tool for PDF Documents☆200Updated 5 years ago
- A library for extracting tables from PDF files☆90Updated 11 years ago
- High performance library for creating, modiyfing and parsing PDF files in C++☆901Updated last month
- Wrapper for pdftohtml that tries to extract paragraph structure☆50Updated 5 years ago
- PoDoFo is a library to work with the PDF file format. The name comes from the first letter of PDF (Portable Document Format). A few tools…☆52Updated 10 years ago
- [DEPRECATED - please use i7j-rups instead] RUPS is an abbreviation for Reading and Updating PDF Syntax. RUPS is a tool built on top of iT…☆110Updated 6 years ago
- k2pdfopt library for koreader, based on http://willus.com/k2pdfopt☆93Updated last week
- Command line tool to extract figures, tables, and captions from scholarly documents in PDF form.☆129Updated 6 years ago
- An extendable docx file format parser and converter☆190Updated 4 years ago
- Run pdf2htmlEX in a Docker container.☆24Updated 9 months ago
- PDFium library without V8 JavaScript engine - compiles under Linux, Mac and Windows☆55Updated 9 years ago
- Extremely Naive Charset Analyser☆285Updated last month
- Convert a docx (OOXML) file to html. This project is deprecated in favor of https://github.com/OpenScienceFramework/pydocx☆45Updated 10 years ago
- ☆391Updated 10 years ago
- Python binding to libpoppler with focus on text extraction☆98Updated 2 years ago
- PDFEdit is a free PDF editor.☆85Updated 12 years ago
- ☆163Updated 10 years ago
- Cross platform C/C++ library with C#, Java, Python, Progress 4GL wrappers and command line tools for generating Microsoft Word .DOCX (Ope…☆163Updated 7 years ago
- Qt5 interface of the popular PDF library MuPDF☆113Updated 10 years ago
- cef-pdf HTML to PDF utility☆81Updated 2 years ago
- An open-source ODBC driver manager and SDK that facilitates the development of database-independent applications on linux, freebsd, unix …☆164Updated 6 months ago
- BOUML is a free UML 2 tool box allowing you to specify and generate code in C++, Java, Idl, Php and Python.☆36Updated 12 years ago
- 🧙♂️ ImageMagick 6☆199Updated this week
- A simple viewer and inspection tool for text boxes in PDF documents☆92Updated 2 years ago