OCR4all / OCR4all
Provides OCR (Optical Character Recognition) services through web applications
β608Updated last year
Alternatives and similar repositories for OCR4all:
Users that are interested in OCR4all are comparing it to the libraries listed below
- A fully static distributed library system powered by IPFS, SQLite and GitHubβ513Updated last month
- π Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with π§ LLM.β527Updated 3 weeks ago
- Open-source platform for extracting structured data from documents using AI.β1,252Updated this week
- A Python library to inspect and modify the internal structure of a PDF fileβ963Updated this week
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.β184Updated 2 months ago
- Visualise your CSV files in seconds without sending your data anywhereβ493Updated last month
- Web interface for recognizing text, proofreading OCR, and creating fully-digitized documents.β146Updated last week
- Document Layout Analysisβ359Updated last month
- An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing anβ¦β828Updated 4 months ago
- OCR engine for all the languagesβ788Updated this week
- Crawls a Multi-Page Application to a zip file, serve the Multi-Page Application from the zip file. A MPA archiver. Could be used as a Sitβ¦β479Updated 4 months ago
- WireQuery is the first full-stack session replay and API call exploration tool. Using WireQuery, you get a holistic overview of how an isβ¦β301Updated 7 months ago
- Transform JSON objects using vector embeddingsβ417Updated 7 months ago
- Uses an llm to generate ffmpeg commandsβ449Updated last month
- Collection of OCR-related python tools and wrappers from @OCR-Dβ126Updated this week
- A deep learning toolkit specialized for handwritten document analysisβ221Updated 5 months ago
- SeekStorm - sub-millisecond full-text search library & multi-tenancy server in Rustβ1,601Updated this week
- Handwriting synthesis with Harfbuzz WASM.β457Updated 5 months ago
- Document image dewarping library using a cubic sheet modelβ142Updated this week
- WARC + AI - Experimental Retrieval Augmented Generation Pipeline for Web Archive Collections.β240Updated last week
- guides and test data for OCR4allβ30Updated 2 years ago
- Master repository which includes most other OCR-D repositories as submodulesβ72Updated last week
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.β382Updated 6 months ago
- π PDF text extraction pipeline: self-hosted, local-first, Docker-basedβ311Updated last year
- Create SEPA, FedNow, SWIFT, RTP payment initiations and process bank statements.β341Updated last week
- β279Updated 2 months ago
- 3D animated bookshelf for ebooksβ403Updated 6 months ago
- Free and source-available Apache 2.0 licensed lightweight workflow automation tool.β228Updated this week