ocrmypdf / OCRmyPDF-EasyOCR
OCRmyPDF EasyOCR plugin
☆62Updated 5 months ago
Alternatives and similar repositories for OCRmyPDF-EasyOCR:
Users that are interested in OCRmyPDF-EasyOCR are comparing it to the libraries listed below
- A post-processing tool for scanned sheets of paper.☆79Updated 11 months ago
- Building scantailor and its dependencies☆58Updated last year
- Web interface for recognizing text, proofreading OCR, and creating fully-digitized documents.☆146Updated last week
- ScriptReader allows you to perform Optical Character Recognition (OCR) on your handwritten notes!☆77Updated 5 months ago
- Web application that converts audio and video to text using AI, supporting various formats and self-hosting.☆76Updated last week
- Convert a PDF via OCR to a TXT file in UTF-8 encoding☆145Updated last year
- Document image dewarping library using a cubic sheet model☆142Updated this week
- A LibreOffice Writer extension that adds local-inference generative AI features.☆72Updated 3 weeks ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆89Updated this week
- Pre-Recognize Library - library with algorithms for improving OCR quality.☆104Updated last year
- Benchmarking PDF libraries☆257Updated last year
- web based editor for subtitles and transcripts☆121Updated 6 months ago
- A wrapper for tesseract / abbyyOCR11 ocr4linux finereader cli that can perform batch operations or monitor a directory and launch an OCR …☆65Updated last year
- Logical structure analysis for visually structured documents☆86Updated 2 years ago
- A free tool to OCR a PDF and add a text "layer" in the original file, making a searchable PDF. Use only open source tools. Please tip!☆280Updated last year
- Translate HTML using Argos Translate☆50Updated last year
- Library used to deskew a scanned document☆438Updated last week
- losslessly convert images to pdf☆65Updated 4 years ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆184Updated 2 months ago
- Nougat is a Meta AI's revolutionary OCR model designed to transcribe scientific PDFs into an easy-to-use Markdown format.☆22Updated last year
- gcv2hocr converts from Google Cloud Vision OCR output to hocr to make a searchable pdf.☆105Updated 4 years ago
- A repository of custom widgets to embed in Grist documents☆62Updated last month
- ez audio transcription tool with flexible processing and post-processing options☆144Updated last year
- Python library to extract tabular data from images and scanned PDFs☆271Updated 6 months ago
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆65Updated this week
- Local cross-platform machine translation GUI, based on CTranslate2☆91Updated last year
- Tool for automatic transcription and speaker diarization based on whisper and pyannote.☆42Updated last month
- Streamlit Web UI for OCRmyPDF☆41Updated last year
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆382Updated 6 months ago
- Toolkit for training/converting LibreTranslate compatible language models 🚂☆51Updated 3 months ago