A free tool to OCR a PDF and add a text "layer" in the original file, making a searchable PDF. Use only open source tools. Please tip!
☆303May 25, 2025Updated last year
Alternatives and similar repositories for pdf2pdfocr
Users that are interested in pdf2pdfocr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- `pdf2searchablepdf input.pdf` = voila! "input_searchable.pdf" is created & now has searchable text!☆137Aug 2, 2023Updated 2 years ago
- Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format☆47Mar 31, 2025Updated last year
- A wrapper for tesseract / abbyyOCR11 ocr4linux finereader cli that can perform batch operations or monitor a directory and launch an OCR …☆67Jan 6, 2024Updated 2 years ago
- A simple library for segmenting legal texts☆18Apr 22, 2023Updated 3 years ago
- Tool to OCR PDFs using Google Cloud Vision☆42Dec 7, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Dead Sea Scrolls in TF format based on Abegg's data☆29Apr 22, 2026Updated last month
- Onchain cap table management with an offchain SEC transfer agent-compliant DB.☆16May 2, 2026Updated 3 weeks ago
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆12Aug 15, 2024Updated last year
- Turn go/links into clickable elements in Obsidian☆13Apr 28, 2025Updated last year
- Utilities for the Ledger accounting system.☆12Jul 5, 2016Updated 9 years ago
- Configuration files for Unbound as a caching DNS server with DNSSEC validation and DNS over TLS forwarding.☆13Jan 13, 2019Updated 7 years ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆202May 21, 2025Updated last year
- A post-processing tool for scanned sheets of paper.☆1,179Jul 11, 2024Updated last year
- NLP Web API for Legal Text☆18Dec 23, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- OCR engine for all the languages☆996May 7, 2026Updated 2 weeks ago
- personal diary☆14Apr 28, 2026Updated 3 weeks ago
- Code to analyse books and newspapers data using Apache Spark.☆16Feb 11, 2022Updated 4 years ago
- framework and tools for statically-generated and dynamic online reading environments☆14Jun 12, 2017Updated 8 years ago
- Tools to process books in a cloud based pipeline system☆65Apr 16, 2026Updated last month
- ☆14Nov 30, 2022Updated 3 years ago
- A web app for transliterating Hebrew☆18May 14, 2026Updated last week
- jwt rest api using realworld spec and google apps script☆14Jan 5, 2023Updated 3 years ago
- OCR-D python tools☆33Aug 16, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Phonotate.App is a local, open-source Electron app built with React designed to simplify creating training data for StyleTTS 2 and voice …☆11Jan 17, 2025Updated last year
- A text annotation plugin for Protege 5+☆18Mar 10, 2026Updated 2 months ago
- Trained BERT and Word2Vec legal clause classifiers for SPACY using the Atticus Project's Open Source Contract Label Corpus☆13Jan 2, 2021Updated 5 years ago
- Project Documentation☆12Jan 21, 2016Updated 10 years ago
- A miniature version of the l4 language☆13Jun 29, 2025Updated 10 months ago
- ☆13Dec 8, 2022Updated 3 years ago
- ☆260Updated this week
- Use the Google Cloud Speech API to transcribe audio files from a podcast.☆20May 17, 2017Updated 9 years ago
- Node JS app that will loop through a directory of images, ocr sections and use this text to rename the file☆11Apr 28, 2018Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A Protégé Desktop plugin that provides a graphical representation of the class hierarchy in an OWL ontology.☆43Jun 18, 2024Updated last year
- ☆13Apr 13, 2024Updated 2 years ago
- Scripture Burrito Schema & Docs 🌯☆25Feb 22, 2026Updated 3 months ago
- Index and Search Your Private PDF Collection☆18Jan 16, 2016Updated 10 years ago
- FFMPEG/Python script to generate cover art videos for songs and albums☆14Jul 22, 2019Updated 6 years ago
- A Betty Blocks Component Set based on Material UI☆25Updated this week
- Correction of spaces with character-based neural language models.☆13Aug 23, 2022Updated 3 years ago