OCR4all / OCR4allLinks
Provides OCR (Optical Character Recognition) services through web applications
☆698Updated last year
Alternatives and similar repositories for OCR4all
Users that are interested in OCR4all are comparing it to the libraries listed below
Sorting:
- Visualise your CSV files in seconds without sending your data anywhere☆514Updated this week
- WARC + AI - Experimental Retrieval Augmented Generation Pipeline for Web Archive Collections.☆259Updated 8 months ago
- 🔍 Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with 🧠 LLM.☆590Updated 4 months ago
- A Python library to inspect and modify the internal structure of a PDF file☆1,008Updated last month
- Examples and guides for using the VLM Run API☆292Updated last week
- Open-source platform for extracting structured data from documents using AI.☆1,429Updated 4 months ago
- Safe, Open, High-Performance — PDF for AI☆687Updated this week
- Multimodal RAG to search and interact locally with technical documents of any kind☆252Updated this week
- A fully static distributed library system powered by IPFS, SQLite and GitHub☆525Updated 9 months ago
- A hub for various industry-specific schemas to be used with VLMs.☆535Updated 4 months ago
- WireQuery is the first full-stack session replay and API call exploration tool. Using WireQuery, you get a holistic overview of how an is…☆303Updated last year
- Crawls a Multi-Page Application to a zip file, serve the Multi-Page Application from the zip file. A MPA archiver. Could be used as a Sit…☆479Updated 3 months ago
- SeekStorm - sub-millisecond full-text search library & multi-tenancy server in Rust☆1,742Updated 3 weeks ago
- ☆280Updated 4 months ago
- CleverBee - The Open Source Deep Researcher Tool☆307Updated 4 months ago
- grep for words with similar meaning to the query☆1,183Updated last year
- Loki: Open-source solution designed to automate the process of verifying factuality☆1,116Updated last year
- Transcribe PDFs with local LLMs☆727Updated last week
- Note as HTML☆287Updated last month
- AI Dataset Generator – Create realistic datasets for demos, learning, and dashboards☆720Updated last week
- tail -f your gmail☆426Updated 2 months ago
- Fully neural approach for text chunking☆374Updated 5 months ago
- Algolia alternative for technical docs☆584Updated 11 months ago
- Transform JSON objects using vector embeddings☆427Updated last year
- 🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based☆327Updated last year
- OCR engine for all the languages☆894Updated this week
- This is a python implementation for stitching images.☆233Updated last year
- A web content preservation service☆427Updated this week
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆397Updated last year
- Document Layout Analysis☆387Updated this week