Convert a PDF via OCR to a TXT file in UTF-8 encoding
☆160Oct 3, 2023Updated 2 years ago
Alternatives and similar repositories for ocr2text
Users that are interested in ocr2text are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A free tool to OCR a PDF and add a text "layer" in the original file, making a searchable PDF. Use only open source tools. Please tip!☆303May 24, 2026Updated last month
- Python library to extract tabular data from images and scanned PDFs☆286Jul 30, 2024Updated last year
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …☆73Nov 7, 2020Updated 5 years ago
- DFKI Layout Detection for OCR-D☆47May 1, 2025Updated last year
- ☆10Mar 16, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- OCR-D python tools☆33Aug 16, 2024Updated last year
- Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format☆47Mar 31, 2025Updated last year
- Read-only mirror of https://gitlab.gnome.org/GNOME/ocrfeeder☆95Apr 14, 2026Updated 2 months ago
- ☆10Mar 21, 2020Updated 6 years ago
- A wrapper for tesseract / abbyyOCR11 ocr4linux finereader cli that can perform batch operations or monitor a directory and launch an OCR …☆67Jan 6, 2024Updated 2 years ago
- ☆12Dec 19, 2019Updated 6 years ago
- HOCR Specification Python Parser☆12Sep 23, 2015Updated 10 years ago
- Turn images of tables into CSV data. Detect tables from images and run OCR on the cells.☆522Mar 3, 2021Updated 5 years ago
- Optical character recognition using neural network. Implemented with Python and its libraries Numpy and OpenCV.☆36Oct 1, 2016Updated 9 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Chess Forge application☆25Jun 26, 2026Updated last week
- `pdf2searchablepdf input.pdf` = voila! "input_searchable.pdf" is created & now has searchable text!☆137Aug 2, 2023Updated 2 years ago
- notmuch-labeler improves notmuch way of displaying labels through fonts, pictures, and hyperlinks.☆16Jul 21, 2015Updated 10 years ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆415Aug 10, 2024Updated last year
- Generate periodic oscillation into an array/audiobuffer☆27May 25, 2020Updated 6 years ago
- Building API and tools for EPO OPS patent data☆10Mar 16, 2017Updated 9 years ago
- ☆19Dec 4, 2021Updated 4 years ago
- Grepify the GUI Regex Text Scanner for Code Reviewers☆23Apr 15, 2013Updated 13 years ago
- Tools, utils and helpers for JSON RPC 2.0 integration☆14Mar 8, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- BigSMILES☆11Jun 16, 2024Updated 2 years ago
- VenomPred 2.0 API☆11Feb 4, 2026Updated 5 months ago
- ☆12Aug 30, 2018Updated 7 years ago
- OCR engine for all the languages☆1,022Jun 26, 2026Updated last week
- ☆14Sep 8, 2017Updated 8 years ago
- Python wrapper for OSRA. Supports R-Group logic and integration with ChemSchematicResolver☆10Apr 4, 2020Updated 6 years ago
- ODK Validate is a Java application for confirming that a form is valid and compliant with the ODK XForms specification. Contribute and ma…☆12Jan 8, 2026Updated 5 months ago
- Antigen-receptor Design Against Peptide-MHC Targets☆21Jan 9, 2026Updated 5 months ago
- Web app for a practice of four Chinese tones recognition.☆13Aug 29, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Emoji Toolkit for iOS - from JoyPixels (formerly EmojiOne)☆10Aug 21, 2023Updated 2 years ago
- A minimal implementation of GraphRAG, designed to quickly prototype whether you're able to get good sense-making out of a large dataset w…☆48Feb 7, 2025Updated last year
- Search and evaluation extensions for python-chess☆19Feb 16, 2019Updated 7 years ago
- ☆48Jan 27, 2026Updated 5 months ago
- Python projects☆14Oct 31, 2022Updated 3 years ago
- Clone of https://gitlab.com/scripta/escriptorium.git with updates from UB Mannheim☆40May 23, 2026Updated last month
- A CLI tool for converting chess FENs into images☆11Jun 24, 2022Updated 4 years ago