galkahana / pdf-text-extractionLinks
cli for extracting text from PDF files (and maybe possibly tables)
☆74Updated 3 months ago
Alternatives and similar repositories for pdf-text-extraction
Users that are interested in pdf-text-extraction are comparing it to the libraries listed below
Sorting:
- Building PDFium for Web Assembly☆81Updated 4 years ago
- The PDF library used by the Chromium project☆435Updated this week
- A C++ PDF manipulation library forked from PoDoFo☆61Updated 2 years ago
- VersyPDF is a high-quality, industry-strength PDF library for C/C++ programming languages meeting the requirements of the most demanding …☆251Updated 4 years ago
- A C/C++ MIME creation and parser library with support for S/MIME, PGP, and Unix mbox spools.☆136Updated this week
- Blazing fast library for fuzzy filtering, matching, and other fuzzy things!☆27Updated last week
- SQLite3 encryption extension with support for multiple ciphers☆547Updated 2 weeks ago
- DocWire SDK: Award-winning modern data processing in C++20. SourceForge Community Choice & Microsoft support. AI-driven processing. Suppo…☆92Updated last month
- A modified version of Chromium's base library to support building on gcc and msvc.☆37Updated last year
- compact_enc_det - Compact Encoding Detection☆241Updated last year
- PDFium Mirror, updated automatically☆25Updated this week
- Apache Xerces-C validating XML parser☆150Updated 6 months ago
- wxWidgets components to display PDF content with the PDFium library☆99Updated 4 years ago
- ☆439Updated 11 years ago
- Port of QuickJS Javascript Engine.☆309Updated last year
- Fast fuzzy regex matcher: specify max edit distance to find approximate matches. FuzzyMatcher is now included in RE/flex.☆37Updated 4 months ago
- Compression and Encryption Virtual File System for SQLite 3.☆118Updated 4 years ago
- wxPdfDocument - Generation of PDF documents from wxWidgets applications☆81Updated 3 weeks ago
- A CUPS/PWG/Apple raster file viewer for Linux, macOS, and Windows☆31Updated last week
- PDFium Reader☆75Updated 2 years ago
- libchardet - Mozilla's Universal Charset Detector C/C++ API☆112Updated 4 years ago
- PoDoFo is a library to work with the PDF file format. The name comes from the first letter of PDF (Portable Document Format). A few tools…☆51Updated 11 years ago
- Fast C++ function "is_utf8": checks if the input is valid UTF-8. Made of a single source file. Optimized for ARM NEON, x64 SSE, AVX2 and…☆67Updated 11 months ago
- Cross platform C/C++ library with C#, Java, Python, Progress 4GL wrappers and command line tools for generating Microsoft Word .DOCX (Ope…☆170Updated 8 years ago
- Yet another what-you-see-is-what-you-get equation editor☆99Updated 3 years ago
- libzip Windows build with Visual Studio.☆64Updated 3 months ago
- This library provides a C++ interface to XML files. It uses libxml2 to access the XML files.☆71Updated this week
- C++ library that translates office documents to HTML☆29Updated this week
- Async C++ Cross-Platform library that modernizes libarchive using Qt . Simply extracts 7z , Tarballs and other supported formats by liba…☆92Updated last year
- SVG Native viewer is a library that parses and renders SVG Native documents☆163Updated this week