Convert a PDF via OCR to a TXT file in UTF-8 encoding
☆159Oct 3, 2023Updated 2 years ago
Alternatives and similar repositories for ocr2text
Users that are interested in ocr2text are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A free tool to OCR a PDF and add a text "layer" in the original file, making a searchable PDF. Use only open source tools. Please tip!☆303May 25, 2025Updated 11 months ago
- Python library to extract tabular data from images and scanned PDFs☆286Jul 30, 2024Updated last year
- ☆10Mar 16, 2023Updated 3 years ago
- An expandable and scalable OCR pipeline☆90Nov 14, 2017Updated 8 years ago
- Use Cache URLs in your Django Application☆20Jan 24, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- jpdfbookmarks - fix JPdfBookmarks GUI mode open a pdf have bookmarks include CJK (Chinese , Japanese , Korean ) characters will show like…☆11Sep 4, 2023Updated 2 years ago
- OCR-D python tools☆33Aug 16, 2024Updated last year
- Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format☆47Mar 31, 2025Updated last year
- A wrapper for tesseract / abbyyOCR11 ocr4linux finereader cli that can perform batch operations or monitor a directory and launch an OCR …☆67Jan 6, 2024Updated 2 years ago
- Fast and lightweight persistent promise based JSON RPC 2.0 client implementation over TCP and Unix socket☆11Oct 27, 2015Updated 10 years ago
- HOCR Specification Python Parser☆12Sep 23, 2015Updated 10 years ago
- ☆10Apr 2, 2024Updated 2 years ago
- A Python helper library to convert between ISO 639 two- and three-letter codes.☆11Nov 13, 2024Updated last year
- Optical character recognition using neural network. Implemented with Python and its libraries Numpy and OpenCV.☆36Oct 1, 2016Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Chess Forge application☆23Updated this week
- `pdf2searchablepdf input.pdf` = voila! "input_searchable.pdf" is created & now has searchable text!☆137Aug 2, 2023Updated 2 years ago
- Extract tables from scanned image PDFs using Optical Character Recognition.☆277Jun 9, 2020Updated 5 years ago
- ☆11Jun 21, 2023Updated 2 years ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆411Aug 10, 2024Updated last year
- A simple example application for providing a comment system that is hosted serverlessly in AWS.☆13May 1, 2019Updated 7 years ago
- Experimental library for connecting Arduino boards to Elasticsearch and Elastic Cloud☆13Feb 6, 2025Updated last year
- sci palettes for matplotlib/seaborn☆10Feb 14, 2022Updated 4 years ago
- Multiwriter documents over dat☆13May 11, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Garlmap is the Gapless Almighty Rule-based Logical Mpv Audio Player☆15Feb 27, 2026Updated 2 months ago
- Grepify the GUI Regex Text Scanner for Code Reviewers☆23Apr 15, 2013Updated 13 years ago
- ☆19Dec 4, 2021Updated 4 years ago
- Transliterate español (spanish) spelling to andaluz proposals using python☆27Apr 24, 2026Updated last week
- Document Layout Analysis☆403Updated this week
- One Big Text File (OBTF) Journal in Markdown☆22Jan 17, 2026Updated 3 months ago
- Racket interpreter in JavaScript/TypeScript☆10Apr 17, 2017Updated 9 years ago
- Tools, utils and helpers for JSON RPC 2.0 integration☆14Mar 8, 2023Updated 3 years ago
- 🏝 Simple sinon like sandbox for jest☆16Jun 2, 2020Updated 5 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- BigSMILES☆10Jun 16, 2024Updated last year
- A Gtk/Qt front-end to tesseract-ocr.☆1,942Jan 15, 2026Updated 3 months ago
- Cross-platform Blob implementation for Node.js and the Web.☆12Oct 18, 2022Updated 3 years ago
- VenomPred 2.0 API☆11Feb 4, 2026Updated 3 months ago
- ☆12Aug 30, 2018Updated 7 years ago
- OCR engine for all the languages☆987Updated this week
- Qt builds for the Raspberry Pi platform☆11Nov 27, 2021Updated 4 years ago