OCR4all / OCR4allLinks
Provides OCR (Optical Character Recognition) services through web applications
☆702Updated last year
Alternatives and similar repositories for OCR4all
Users that are interested in OCR4all are comparing it to the libraries listed below
Sorting:
- PDF Parsing for RAG — Convert to Markdown & JSON, Fast, Local, No GPU☆821Updated this week
- A Python library to inspect and modify the internal structure of a PDF file☆1,011Updated 5 months ago
- Visualise your CSV files in seconds without sending your data anywhere☆516Updated 3 weeks ago
- WARC + AI - Experimental Retrieval Augmented Generation Pipeline for Web Archive Collections.☆267Updated 11 months ago
- Examples and guides for using the VLM Run API☆304Updated 2 weeks ago
- A fully static distributed library system powered by IPFS, SQLite and GitHub☆529Updated last year
- Transcribe PDFs with local LLMs☆818Updated last month
- Multimodal RAG to search and interact locally with technical documents of any kind☆284Updated 2 months ago
- Crawls a Multi-Page Application to a zip file, serve the Multi-Page Application from the zip file. A MPA archiver. Could be used as a Sit…☆478Updated 7 months ago
- A web content preservation service☆581Updated 3 weeks ago
- A hub for various industry-specific schemas to be used with VLMs.☆538Updated last month
- OCR engine for all the languages☆929Updated this week
- Transform JSON objects using vector embeddings☆429Updated last year
- Fully neural approach for text chunking☆406Updated 2 months ago
- tail -f your gmail☆438Updated last month
- Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)☆682Updated 7 months ago
- Open-source platform for extracting structured data from documents using AI.☆1,462Updated 8 months ago
- ☆280Updated 7 months ago
- CLI app- Give it a YouTube URL and you get a transcription with possible speaker identification and optional summary or translation, all …☆330Updated last month
- WireQuery is the first full-stack session replay and API call exploration tool. Using WireQuery, you get a holistic overview of how an is…☆304Updated last year
- Document Layout Analysis☆393Updated 3 weeks ago
- AI Dataset Generator – Create realistic datasets for demos, learning, and dashboards☆745Updated 3 months ago
- A tool to detect whether a PDF has a bad redaction☆765Updated last week
- Collection of OCR-related python tools and wrappers from @OCR-D☆133Updated last week
- Loki: Open-source solution designed to automate the process of verifying factuality☆1,129Updated last year
- Markdown blog with GitHub Pages☆161Updated last month
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆194Updated 2 months ago
- Note as HTML☆285Updated 2 months ago
- Linux Bash Script for the Paranoid Admin on a Budget - real-time monitoring and active threat response☆580Updated last week
- CleverBee - The Open Source Deep Researcher Tool☆309Updated 7 months ago