writecrow/ocr2text

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/writecrow/ocr2text)

writecrow / ocr2text

Convert a PDF via OCR to a TXT file in UTF-8 encoding

☆160

Alternatives and similar repositories for ocr2text

Users that are interested in ocr2text are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LeoFCardoso / pdf2pdfocr
View on GitHub
A free tool to OCR a PDF and add a text "layer" in the original file, making a searchable PDF. Use only open source tools. Please tip!
☆303May 24, 2026Updated last month
ExtractTable / ExtractTable-py
View on GitHub
Python library to extract tabular data from images and scanned PDFs
☆286Jul 30, 2024Updated last year
ckorzen / pdf-text-extraction-benchmark
View on GitHub
A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …
☆73Nov 7, 2020Updated 5 years ago
OCR-D / ocrd_anybaseocr
View on GitHub
DFKI Layout Detection for OCR-D
☆47May 1, 2025Updated last year
seuretm / ocrd_typegroups_classifier
View on GitHub
☆10Mar 16, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
cisocrgroup / ocrd_cis
View on GitHub
OCR-D python tools
☆33Aug 16, 2024Updated last year
mauvilsa / tesseract-recognize
View on GitHub
Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format
☆47Mar 31, 2025Updated last year
GNOME / ocrfeeder
View on GitHub
Read-only mirror of https://gitlab.gnome.org/GNOME/ocrfeeder
☆95Apr 14, 2026Updated 2 months ago
WoodenJin / OptimalControl-RL_abstract
View on GitHub
☆10Mar 21, 2020Updated 6 years ago
deajan / pmOCR
View on GitHub
A wrapper for tesseract / abbyyOCR11 ocr4linux finereader cli that can perform batch operations or monitor a directory and launch an OCR …
☆67Jan 6, 2024Updated 2 years ago
sparkfun / Serial_Controlled_Motor_Driver
View on GitHub
☆12Dec 19, 2019Updated 6 years ago
athento / hocr-parser
View on GitHub
HOCR Specification Python Parser
☆12Sep 23, 2015Updated 10 years ago
eihli / image-table-ocr
View on GitHub
Turn images of tables into CSV data. Detect tables from images and run OCR on the cells.
☆522Mar 3, 2021Updated 5 years ago
aksth / ocr
View on GitHub
Optical character recognition using neural network. Implemented with Python and its libraries Numpy and OpenCV.
☆36Oct 1, 2016Updated 9 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
czbar / ChessForge
View on GitHub
Chess Forge application
☆25Jun 26, 2026Updated last week
ElectricRCAircraftGuy / PDF2SearchablePDF
View on GitHub
`pdf2searchablepdf input.pdf` = voila! "input_searchable.pdf" is created & now has searchable text!
☆137Aug 2, 2023Updated 2 years ago
DamienCassou / notmuch-labeler
View on GitHub
notmuch-labeler improves notmuch way of displaying labels through fonts, pictures, and hyperlinks.
☆16Jul 21, 2015Updated 10 years ago
ocropus / hocr-tools
View on GitHub
Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.
☆415Aug 10, 2024Updated last year
audiojs / audio-oscillator
View on GitHub
Generate periodic oscillation into an array/audiobuffer
☆27May 25, 2020Updated 6 years ago
fros1y / epo-download
View on GitHub
Building API and tools for EPO OPS patent data
☆10Mar 16, 2017Updated 9 years ago
brandoncc / telescope-harpoon.nvim
View on GitHub
☆19Dec 4, 2021Updated 4 years ago
nccgroup / grepify
View on GitHub
Grepify the GUI Regex Text Scanner for Code Reviewers
☆23Apr 15, 2013Updated 13 years ago
qiwi / json-rpc
View on GitHub
Tools, utils and helpers for JSON RPC 2.0 integration
☆14Mar 8, 2023Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
dylanwal / BigSMILES
View on GitHub
BigSMILES
☆11Jun 16, 2024Updated 2 years ago
MMVSL / VenomPred2.0
View on GitHub
VenomPred 2.0 API
☆11Feb 4, 2026Updated 5 months ago
cytoscape / cyjs-sample
View on GitHub
☆12Aug 30, 2018Updated 7 years ago
mittagessen / kraken
View on GitHub
OCR engine for all the languages
☆1,022Jun 26, 2026Updated last week
dunkarooftop / thought
View on GitHub
☆14Sep 8, 2017Updated 8 years ago
edbeard / pyosra
View on GitHub
Python wrapper for OSRA. Supports R-Group logic and integration with ChemSchematicResolver
☆10Apr 4, 2020Updated 6 years ago
getodk / validate
View on GitHub
ODK Validate is a Java application for confirming that a form is valid and compliant with the ODK XForms specification. Contribute and ma…
☆12Jan 8, 2026Updated 5 months ago
phbradley / ADAPT
View on GitHub
Antigen-receptor Design Against Peptide-MHC Targets
☆21Jan 9, 2026Updated 5 months ago
met / chinese-tones-practice
View on GitHub
Web app for a practice of four Chinese tones recognition.
☆13Aug 29, 2023Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
joypixels / emoji-toolkit-ios
View on GitHub
Emoji Toolkit for iOS - from JoyPixels (formerly EmojiOne)
☆10Aug 21, 2023Updated 2 years ago
khoj-ai / knowledge-graph
View on GitHub
A minimal implementation of GraphRAG, designed to quickly prototype whether you're able to get good sense-making out of a large dataset w…
☆48Feb 7, 2025Updated last year
Mk-Chan / python-chess-engine-extensions
View on GitHub
Search and evaluation extensions for python-chess
☆19Feb 16, 2019Updated 7 years ago
ntranoslab / vesm
View on GitHub
☆48Jan 27, 2026Updated 5 months ago
AFei19911012 / PythonSamples
View on GitHub
Python projects
☆14Oct 31, 2022Updated 3 years ago
UB-Mannheim / escriptorium
View on GitHub
Clone of https://gitlab.com/scripta/escriptorium.git with updates from UB Mannheim
☆40May 23, 2026Updated last month
tikul / fen-to-png
View on GitHub
A CLI tool for converting chess FENs into images
☆11Jun 24, 2022Updated 4 years ago