abhi18av / awesome-pdf
List of tools for dealing with the wonderful PDF format.
☆49Updated 4 years ago
Alternatives and similar repositories for awesome-pdf:
Users that are interested in awesome-pdf are comparing it to the libraries listed below
- Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format☆46Updated this week
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Updated 3 weeks ago
- The hOCR Embedded OCR Workflow and Output Format☆74Updated 7 months ago
- ☆36Updated 6 years ago
- Property Graph Exchange Format (PG) converter☆24Updated last month
- Awesome links related to RSS, ATOM, and Syndication formats.☆56Updated 8 months ago
- Installer for Thymeflow, a personal knowledge management system.☆33Updated 6 years ago
- Integration between Reaction ECommerce and Accelerated Text to provide product descriptions for an e-shop.☆12Updated 4 years ago
- Yet Another Solr Admin☆47Updated last year
- A server code for serving BERT-based models for text classification. It is designed by SerpApi for heavy-load prototyping and production …☆14Updated 11 months ago
- PAGE XML format collection for document image page content and more☆67Updated 3 years ago
- All the code from the book “Building User-Friendly DSLs” by Meinte Boersma for Manning Publications: https://www.manning.com/books/buildi…☆10Updated 6 months ago
- a collaborative scholarly text editor allowing to build static websites☆67Updated 3 years ago
- Easily display Zotero items on a webpage☆32Updated 2 years ago
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆97Updated 2 years ago
- PDF Table Extractor - repository to hold revisable version of code from https://www.cvast.tuwien.ac.at/projects/pdf2table by Burcu Yildiz☆38Updated last year
- Tool for visualizing hOCR output from Tesseract (or other OCR engines that support hOCR).☆23Updated 10 years ago
- A set of workflows for corpus building through OCR, post-correction and normalisation☆48Updated 2 years ago
- Awesome Diagram Tools☆63Updated 2 years ago
- A wrapper for tesseract / abbyyOCR11 ocr4linux finereader cli that can perform batch operations or monitor a directory and launch an OCR …☆65Updated last year
- PDF to XML ALTO file converter☆234Updated this week
- Build Neo4j graphs from Datashare projects☆12Updated last month
- Hypertext-infused personal research productivity/database software (Mac/Win/Linux)☆147Updated this week
- JS for overlaying OCR on image using HOCR formatted HTML☆26Updated 8 years ago
- Tools to process books in a cloud based pipeline system☆58Updated 2 weeks ago
- 📑 Scripts to repair, verify, OCR, compress, wrangle, crop (etc.) PDFs☆68Updated 10 months ago
- Free services, tools, articles and other resources for remote workers and distance learners☆51Updated 3 years ago
- The source code for self-managed Grist Enterprise.☆12Updated this week
- XSLT Documentation and examples☆19Updated last year
- Encapsulate dom-anchor-text-quote and dom-anchor-text-position for use in browser scripts☆10Updated 3 years ago