Convert a PDF via OCR to a TXT file in UTF-8 encoding
☆159Oct 3, 2023Updated 2 years ago
Alternatives and similar repositories for ocr2text
Users that are interested in ocr2text are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tesseract Powered Windows Desktop OCR Application With Multiple Pre/Post Processing GUI☆43Apr 3, 2024Updated 2 years ago
- A free tool to OCR a PDF and add a text "layer" in the original file, making a searchable PDF. Use only open source tools. Please tip!☆303May 25, 2025Updated last year
- Python library to extract tabular data from images and scanned PDFs☆286Jul 30, 2024Updated last year
- A dashboard to visualize earthquakes around the world between 2000 and 2020.☆16Apr 25, 2024Updated 2 years ago
- DFKI Layout Detection for OCR-D☆47May 1, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A mirror of https://git.tecosaur.net/tec/pdftotext.el☆12Jan 4, 2024Updated 2 years ago
- ☆10Mar 16, 2023Updated 3 years ago
- Make UFW docker-compatible with a single command☆31Nov 30, 2025Updated 5 months ago
- Scrape posts from Deadspin☆10Aug 23, 2021Updated 4 years ago
- jpdfbookmarks - fix JPdfBookmarks GUI mode open a pdf have bookmarks include CJK (Chinese , Japanese , Korean ) characters will show like…☆11Sep 4, 2023Updated 2 years ago
- Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format☆47Mar 31, 2025Updated last year
- Read-only mirror of https://gitlab.gnome.org/GNOME/ocrfeeder☆94Apr 14, 2026Updated last month
- Next generation OCR engine based on LSTMs.☆51Apr 8, 2018Updated 8 years ago
- A Python helper library to convert between ISO 639 two- and three-letter codes.☆11Nov 13, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Chess Forge application☆24Updated this week
- `pdf2searchablepdf input.pdf` = voila! "input_searchable.pdf" is created & now has searchable text!☆137Aug 2, 2023Updated 2 years ago
- Vim editing support for kmonad config files☆38Mar 20, 2022Updated 4 years ago
- Extract tables from scanned image PDFs using Optical Character Recognition.☆277Jun 9, 2020Updated 5 years ago
- Editable Tree View in React☆16Apr 14, 2019Updated 7 years ago
- ☆15Jul 7, 2024Updated last year
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆411Aug 10, 2024Updated last year
- Building API and tools for EPO OPS patent data☆10Mar 16, 2017Updated 9 years ago
- Grepify the GUI Regex Text Scanner for Code Reviewers☆23Apr 15, 2013Updated 13 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆19Dec 4, 2021Updated 4 years ago
- BigSMILES☆11Jun 16, 2024Updated last year
- A Gtk/Qt front-end to tesseract-ocr.☆1,952Jan 15, 2026Updated 4 months ago
- VenomPred 2.0 API☆11Feb 4, 2026Updated 3 months ago
- Python code and jupyter notebooks to accompany the manuscript "Deep learning models for lipid-nanoparticle-based drug delivery"☆14Jul 28, 2020Updated 5 years ago
- ☆12Aug 30, 2018Updated 7 years ago
- OCR engine for all the languages☆996May 7, 2026Updated 2 weeks ago
- Antigen-receptor Design Against Peptide-MHC Targets☆21Jan 9, 2026Updated 4 months ago
- LocalAI website, powered by Hugo☆15Nov 22, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Visual SPARQL query tool☆10Feb 26, 2016Updated 10 years ago
- Clone of https://gitlab.com/scripta/escriptorium.git with updates from UB Mannheim☆38Updated this week
- Free open-source OCR application for the Windows Desktop - A modern GUI front-end for the Tesseract OCR engine. The application also incl…☆267Apr 11, 2015Updated 11 years ago
- Unofficial implementation of "TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Docum…☆324Mar 25, 2023Updated 3 years ago
- A deep-learning-based multiple toolkits (DeTool) approach that uses the inputs of enzymes and substrates for biocatalystic tasks.☆13Nov 24, 2023Updated 2 years ago
- Select, yank, paste, delete, or other operation of phrase.☆25Apr 17, 2014Updated 12 years ago
- Didactic Web crawler for Web Search Engines (CS 6913) course at NYU☆10Dec 8, 2022Updated 3 years ago