OCR4all / OCR4allLinks
Provides OCR (Optical Character Recognition) services through web applications
☆704Updated 2 years ago
Alternatives and similar repositories for OCR4all
Users that are interested in OCR4all are comparing it to the libraries listed below
Sorting:
- Visualise your CSV files in seconds without sending your data anywhere☆516Updated last week
- Open-source platform for extracting structured data from documents using AI.☆1,464Updated 8 months ago
- 🔍 Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with 🧠 LLM.☆611Updated 8 months ago
- A hub for various industry-specific schemas to be used with VLMs.☆539Updated last month
- PDF Parsing for RAG — Convert to Markdown & JSON, Fast, Local, No GPU☆847Updated this week
- A Python library to inspect and modify the internal structure of a PDF file☆1,011Updated 5 months ago
- CleverBee - The Open Source Deep Researcher Tool☆310Updated last week
- Fully neural approach for text chunking☆406Updated 3 months ago
- WARC + AI - Experimental Retrieval Augmented Generation Pipeline for Web Archive Collections.☆268Updated last year
- Examples and guides for using the VLM Run API☆305Updated 2 weeks ago
- A fully static distributed library system powered by IPFS, SQLite and GitHub☆528Updated last year
- Multimodal RAG to search and interact locally with technical documents of any kind☆283Updated 3 weeks ago
- Transcribe PDFs with local LLMs☆818Updated 2 weeks ago
- Crawls a Multi-Page Application to a zip file, serve the Multi-Page Application from the zip file. A MPA archiver. Could be used as a Sit…☆477Updated 7 months ago
- ☆279Updated 8 months ago
- Transform JSON objects using vector embeddings☆429Updated last year
- Note as HTML☆289Updated 3 months ago
- A web content preservation service☆588Updated 3 weeks ago
- AI Dataset Generator – Create realistic datasets for demos, learning, and dashboards☆746Updated 4 months ago
- Web interface for recognizing text, proofreading OCR, and creating fully-digitized documents.☆752Updated 2 weeks ago
- tail -f your gmail☆438Updated 2 months ago
- CLI app- Give it a YouTube URL and you get a transcription with possible speaker identification and optional summary or translation, all …☆330Updated last month
- grep for words with similar meaning to the query☆1,203Updated last year
- A tool to detect whether a PDF has a bad redaction☆785Updated 3 weeks ago
- Index your Gmail account to a SQLite DB and play with the data.☆1,217Updated 7 months ago
- WireQuery is the first full-stack session replay and API call exploration tool. Using WireQuery, you get a holistic overview of how an is…☆304Updated last year
- High-accuracy PDF-to-Markdown OCR API using LLMs with vision capabilities. Features parallel processing, batching, and auto-retry logic f…☆883Updated 2 months ago
- Algolia alternative for technical docs☆594Updated last year
- A series of top performing Text to SQL LLMs☆868Updated last year
- SeekStorm - sub-millisecond full-text search library & multi-tenancy server in Rust☆1,830Updated this week