LeoFCardoso / pdf2pdfocrView external linksLinks
A free tool to OCR a PDF and add a text "layer" in the original file, making a searchable PDF. Use only open source tools. Please tip!
☆303May 25, 2025Updated 8 months ago
Alternatives and similar repositories for pdf2pdfocr
Users that are interested in pdf2pdfocr are comparing it to the libraries listed below
Sorting:
- Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format☆46Mar 31, 2025Updated 10 months ago
- ☆12Apr 13, 2024Updated last year
- Apertium linguistic data for English☆10Dec 31, 2025Updated last month
- Visual, page-by-page comparison of two PDF files☆21Apr 7, 2014Updated 11 years ago
- A wrapper for tesseract / abbyyOCR11 ocr4linux finereader cli that can perform batch operations or monitor a directory and launch an OCR …☆67Jan 6, 2024Updated 2 years ago
- postcorrection web☆12Mar 6, 2023Updated 2 years ago
- Some of my tools for paperless-ngx, for example title generation☆11Jul 10, 2024Updated last year
- Trained BERT and Word2Vec legal clause classifiers for SPACY using the Atticus Project's Open Source Contract Label Corpus☆13Jan 2, 2021Updated 5 years ago
- Utilities for the Ledger accounting system.☆12Jul 5, 2016Updated 9 years ago
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆12Aug 15, 2024Updated last year
- xState-based validation tool for OCF files☆15Apr 10, 2025Updated 10 months ago
- Node JS app that will loop through a directory of images, ocr sections and use this text to rename the file☆11Apr 28, 2018Updated 7 years ago
- Extract palette from an image☆15Nov 20, 2022Updated 3 years ago
- Library for Object Linking and Embedding (OLE) data types☆12Nov 27, 2025Updated 2 months ago
- Master repository which includes most other OCR-D repositories as submodules☆72Jul 4, 2025Updated 7 months ago
- jwt rest api using realworld spec and google apps script☆14Jan 5, 2023Updated 3 years ago
- ☆10Mar 16, 2023Updated 2 years ago
- FFMPEG/Python script to generate cover art videos for songs and albums☆13Jul 22, 2019Updated 6 years ago
- Learning Finite State Machine Models from Data with a Genetic Algorithm☆11Dec 1, 2025Updated 2 months ago
- Gui for users who use the coqui-TTS vits model.☆15Sep 16, 2022Updated 3 years ago
- Named Entity Recognition with the Nametag Maximum Entropy Markov model☆12Updated this week
- A post-processing tool for scanned sheets of paper.☆1,152Jul 11, 2024Updated last year
- Download client for legal opinions☆13Jan 26, 2025Updated last year
- Kubernetes Secret generation from secure credential repos☆69Jan 30, 2019Updated 7 years ago
- ☆14Nov 30, 2022Updated 3 years ago
- tiny .7z extractor and SFX, size-optimized for Linux i386☆15Nov 30, 2025Updated 2 months ago
- Themed, fully featured PDF viewer for the Atom editor☆12Jan 28, 2026Updated 2 weeks ago
- Utilities and applications for the FlatGov project by Demand Progress☆16Feb 8, 2023Updated 3 years ago
- Fast reading and writing of Msgpack data in R msgpack.org[R]☆14Mar 22, 2019Updated 6 years ago
- Google App Scripts that sends a number of emails from the specific number and that tracks the open status of each email☆17Dec 11, 2024Updated last year
- GUI for creating and editing regular expressions☆15Jan 31, 2026Updated 2 weeks ago
- A miniature version of the l4 language☆13Jun 29, 2025Updated 7 months ago
- ☆15Jun 16, 2021Updated 4 years ago
- Correction of spaces with character-based neural language models.☆13Aug 23, 2022Updated 3 years ago
- Reflectance Transformation Imaging☆15Aug 1, 2023Updated 2 years ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆199May 21, 2025Updated 8 months ago
- A small command-line tool to extract or restore the URL of every open tab in the current browser window on macOS.☆18Jan 19, 2026Updated 3 weeks ago
- Ever wanted to use custom discord emojis on other servers, without a nitro subscription? Well, with this script, YOU CAN without needing …☆25Jan 8, 2021Updated 5 years ago
- Library with user interface elements and client-server communication classes based on Google Web Toolkit (GWT) that can be used for crowd…☆14Oct 3, 2017Updated 8 years ago