py-pdf / sample-filesLinks
Files which can be used to test PDF readers
☆66Updated 2 months ago
Alternatives and similar repositories for sample-files
Users that are interested in sample-files are comparing it to the libraries listed below
Sorting:
- Document image dewarping library using a cubic sheet model☆197Updated last week
- An index of PDF-centric corpora☆161Updated 7 months ago
- ☆879Updated 2 months ago
- Python bindings to PDFium, reasonably cross-platform.☆719Updated last week
- Unicode to ASCII transliteration - C Elixir Go Java JS Julia PHP Python Ruby Rust Shell .NET☆362Updated 5 months ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆407Updated last year
- Easy to use PDF CLI tool powered by PDFium and go-pdfium☆34Updated 2 months ago
- Inspect how the PDF's structure looks.☆25Updated 2 years ago
- A simple python wrapper for PDFium.☆17Updated 4 years ago
- JBIG2 Encoder☆48Updated last month
- Read-only Mirror - no pull request☆19Updated last year
- An open source set of Java filters for creating, merging and validating XLIFF 1.2, 2.0, 2.1 and 2.2 files.☆79Updated 2 weeks ago
- JS/WebAssembly build of the Tesseract OCR engine for use in browsers and Node☆353Updated 2 months ago
- ☆19Updated 4 months ago
- Convert omml to latex for displaying in web browsers (KaTeX)☆36Updated 5 years ago
- Easy to use PDF library using Go and PDFium☆309Updated last week
- ☆20Updated last year
- OCR engine for all the languages☆940Updated this week
- Docker Image with latest Tesseract OCR Version 5.x.x built from sources☆48Updated last week
- Leptonica is an open source library containing software that is broadly useful for image processing and image analysis applications. The …☆2,018Updated this week
- A collection of Java APIs for Xpdf - the open source library for operating on PDF files.☆15Updated 10 months ago
- Open XML SDK for Rust☆42Updated 6 months ago
- Render PDFs in Rust using libpoppler☆39Updated last year
- Library used to deskew a scanned document☆498Updated this week
- Train Tesseract LSTM with make☆711Updated 9 months ago
- Rust binding to mupdf☆176Updated last week
- The Caesium compression library written in Rust (with a C interface)☆189Updated this week
- A Rust wrapper around PDFium allowing you to render PDFs from Rust☆29Updated 4 years ago
- A CUPS/PWG/Apple raster file viewer for Linux, macOS, and Windows☆33Updated 4 months ago
- Web interface for recognizing text, proofreading OCR, and creating fully-digitized documents.☆752Updated last week