A free tool to OCR a PDF and add a text "layer" in the original file, making a searchable PDF. Use only open source tools. Please tip!
☆304May 25, 2025Updated 10 months ago
Alternatives and similar repositories for pdf2pdfocr
Users that are interested in pdf2pdfocr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- `pdf2searchablepdf input.pdf` = voila! "input_searchable.pdf" is created & now has searchable text!☆137Aug 2, 2023Updated 2 years ago
- ☆10Mar 16, 2023Updated 3 years ago
- Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format☆46Mar 31, 2025Updated last year
- Python script to do PDF OCR conversion using Tesseract☆375Jun 2, 2023Updated 2 years ago
- xState-based validation tool for OCF files☆15Apr 1, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆411Aug 10, 2024Updated last year
- A wrapper for tesseract / abbyyOCR11 ocr4linux finereader cli that can perform batch operations or monitor a directory and launch an OCR …☆67Jan 6, 2024Updated 2 years ago
- OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched☆33,120Updated this week
- Tool to OCR PDFs using Google Cloud Vision☆42Dec 7, 2022Updated 3 years ago
- Onchain cap table management with an offchain SEC transfer agent-compliant DB.☆16Apr 2, 2026Updated last week
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆12Aug 15, 2024Updated last year
- Client library for OpenOCR☆31Dec 3, 2014Updated 11 years ago
- Automatically exported from code.google.com/p/osis-converters☆13Apr 1, 2026Updated last week
- Configuration files for Unbound as a caching DNS server with DNSSEC validation and DNS over TLS forwarding.☆13Jan 13, 2019Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Chambua is an open-source semantic tagging application that analyses text and extracts names of people, places (& geocodes them), organis…☆33Nov 12, 2021Updated 4 years ago
- Extract palette from an image☆15Nov 20, 2022Updated 3 years ago
- A set of tools for rotating, cropping, and binding the images from a scanned book into a PDF.☆19Aug 15, 2018Updated 7 years ago
- A post-processing tool for scanned sheets of paper.☆1,167Jul 11, 2024Updated last year
- Your own personal assistant powered by Twilio☆15Feb 19, 2015Updated 11 years ago
- Technical Committee Documents☆16Mar 18, 2026Updated 3 weeks ago
- Easily work with .docx files from Clojure (a wrapper on Apache POI library).☆12Sep 4, 2019Updated 6 years ago
- NLP Web API for Legal Text☆18Dec 23, 2022Updated 3 years ago
- Hyperbox Client☆13Dec 27, 2021Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- OCR engine for all the languages☆976Updated this week
- Tools to process books in a cloud based pipeline system☆66Mar 30, 2026Updated last week
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Apr 30, 2025Updated 11 months ago
- ☆14Nov 30, 2022Updated 3 years ago
- A mirror of https://git.tecosaur.net/tec/pdftotext.el☆12Jan 4, 2024Updated 2 years ago
- Code for several utilities for use with VIVO☆11Nov 15, 2012Updated 13 years ago
- Little api client for paperless(-ngx): pypaperless☆88Apr 1, 2026Updated last week
- A web app for transliterating Hebrew☆18Updated this week
- guides and test data for OCR4all☆32Oct 4, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Write beautifully short contract. https://reference.legal/ is a referenceable clause library to standardize contracts once and for all.☆13Jul 12, 2022Updated 3 years ago
- jwt rest api using realworld spec and google apps script☆14Jan 5, 2023Updated 3 years ago
- ☆14Dec 1, 2023Updated 2 years ago
- A text annotation plugin for Protege 5+☆18Mar 10, 2026Updated 3 weeks ago
- ☆16Feb 16, 2023Updated 3 years ago
- Scrape posts from Deadspin☆10Aug 23, 2021Updated 4 years ago
- A post-processing tool for scanned sheets of paper.☆85Mar 9, 2024Updated 2 years ago