galkahana / pdf-text-extraction
cli for extracting text from PDF files (and maybe possibly tables)
☆77Updated last month
Alternatives and similar repositories for pdf-text-extraction:
Users that are interested in pdf-text-extraction are comparing it to the libraries listed below
- VersyPDF is a high-quality, industry-strength PDF library for C/C++ programming languages meeting the requirements of the most demanding …☆240Updated 3 years ago
- Building PDFium for Web Assembly☆75Updated 4 years ago
- A C++ PDF manipulation library forked from PoDoFo☆59Updated 2 years ago
- wxWidgets components to display PDF content with the PDFium library☆95Updated 4 years ago
- The PDF library used by the Chromium project☆408Updated this week
- DocWire SDK: Award-winning modern data processing in C++20. SourceForge Community Choice & Microsoft support. AI-driven processing. Suppo…☆83Updated last week
- Blazing fast library for fuzzy filtering, matching, and other fuzzy things!☆27Updated 3 weeks ago
- A C++17 PDF manipulation library☆454Updated this week
- PDFium Reader☆67Updated last year
- libchardet - Mozilla's Universal Charset Detector C/C++ API☆112Updated 3 years ago
- High performance library for creating, modiyfing and parsing PDF files in C++☆939Updated 3 weeks ago
- C++ library that translates office documents to HTML☆25Updated 2 months ago
- Qt global shortcut (system-wide hotkey) class☆31Updated 11 years ago
- ☆163Updated 10 years ago
- jbig2 decoder using code from pdfium☆9Updated 6 years ago
- Compile-time and runtime CSV parser written in C++17☆31Updated this week
- uchardet is an encoding detector library, which takes a sequence of bytes in an unknown character encoding and attempts to determine the …☆44Updated 10 months ago
- A simple C++ header-only template library implementing matching using wildcards☆87Updated last year
- compact_enc_det - Compact Encoding Detection☆228Updated last year
- 📰 Yet another Webassembly PDF renderer for node and the browser☆190Updated 9 months ago
- C++ wrapper for PCRE2 Library☆70Updated last year
- Unofficial mirror of the WebKit SVN repository☆85Updated last month
- PDFium library without V8 JavaScript engine - compiles under Linux, Mac and Windows☆62Updated 9 years ago
- Skia mirror with a performant CMakeLists.txt.☆27Updated 4 years ago
- SVG Native viewer is a library that parses and renders SVG Native documents☆159Updated 6 months ago
- Converts DOC, XLS, XLSB, PPT, RTF, ODF (ODT, ODS, ODP), OOXML (DOCX, XLSX, PPTX), iWork (PAGES, NUMBERS, KEYNOTE), ODFXML (FODP, FODS,…☆131Updated 7 years ago
- Async C++ Cross-Platform library that modernizes libarchive using Qt . Simply extracts 7z , Tarballs and other supported formats by liba…☆90Updated last year
- Pure C++17 header only implementation of the Facebook Flux-like pattern☆16Updated 7 years ago
- wxPdfDocument - Generation of PDF documents from wxWidgets applications☆77Updated 2 months ago
- Fast C++ function "is_utf8": checks if the input is valid UTF-8. Made of a single source file. Optimized for ARM NEON, x64 SSE, AVX2 and…☆62Updated 6 months ago