OCR4all / OCR4allLinks
Provides OCR (Optical Character Recognition) services through web applications
☆679Updated last year
Alternatives and similar repositories for OCR4all
Users that are interested in OCR4all are comparing it to the libraries listed below
Sorting:
- A Python library to inspect and modify the internal structure of a PDF file☆994Updated 3 weeks ago
- Visualise your CSV files in seconds without sending your data anywhere☆510Updated last week
- Document Layout Analysis☆376Updated 2 weeks ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆187Updated last month
- Crawls a Multi-Page Application to a zip file, serve the Multi-Page Application from the zip file. A MPA archiver. Could be used as a Sit…☆477Updated 2 weeks ago
- 🔍 Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with 🧠 LLM.☆564Updated 2 weeks ago
- Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)☆653Updated last month
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆394Updated 10 months ago
- OCR engine for all the languages☆841Updated last week
- CleverBee - The Open Source Deep Researcher Tool☆300Updated 2 weeks ago
- Examples and guides for using the VLM Run API☆279Updated 3 weeks ago
- Open-source platform for extracting structured data from documents using AI.☆1,330Updated last month
- WARC + AI - Experimental Retrieval Augmented Generation Pipeline for Web Archive Collections.☆255Updated 4 months ago
- A hub for various industry-specific schemas to be used with VLMs.☆518Updated last month
- This is a python implementation for stitching images.☆232Updated 8 months ago
- A fully static distributed library system powered by IPFS, SQLite and GitHub☆523Updated 5 months ago
- Fully neural approach for text chunking☆360Updated 2 months ago
- Your filesystem as a vector database☆377Updated 2 months ago
- Collection of OCR-related python tools and wrappers from @OCR-D☆128Updated last week
- Algolia alternative for technical docs☆540Updated 7 months ago
- Browser-LLM Auto-Scaling Technology☆526Updated this week
- Master repository which includes most other OCR-D repositories as submodules☆73Updated this week
- ☆279Updated 2 weeks ago
- OCR Benchmark☆511Updated last month
- A curated list of awesome projects to simplify and improve paper and document scanning.☆444Updated 2 weeks ago
- RAG Logger is an open-source logging tool designed specifically for Retrieval-Augmented Generation (RAG) applications. It serves as a lig…☆220Updated 6 months ago
- SeekStorm - sub-millisecond full-text search library & multi-tenancy server in Rust☆1,696Updated last month
- Weave your codebase into a single, navigable Markdown document☆427Updated last month
- WireQuery is the first full-stack session replay and API call exploration tool. Using WireQuery, you get a holistic overview of how an is…☆301Updated last year
- Cutting-edge web scraping techniques workshop at NICAR 2025☆351Updated 3 months ago