OCR4all / OCR4all
Provides OCR (Optical Character Recognition) services through web applications
☆678Updated last year
Alternatives and similar repositories for OCR4all:
Users that are interested in OCR4all are comparing it to the libraries listed below
- Crawls a Multi-Page Application to a zip file, serve the Multi-Page Application from the zip file. A MPA archiver. Could be used as a Sit…☆475Updated 6 months ago
- Visualise your CSV files in seconds without sending your data anywhere☆505Updated 3 weeks ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆185Updated 4 months ago
- 🔍 Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with 🧠 LLM.☆538Updated 2 months ago
- Document Layout Analysis☆365Updated this week
- A hub for various industry-specific schemas to be used with VLMs.☆496Updated last week
- guides and test data for OCR4all☆30Updated 2 years ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆389Updated 8 months ago
- A fully static distributed library system powered by IPFS, SQLite and GitHub☆520Updated 3 months ago
- SeekStorm - sub-millisecond full-text search library & multi-tenancy server in Rust☆1,664Updated 3 weeks ago
- Examples and guides for using the VLM Run API☆271Updated 3 weeks ago
- WARC + AI - Experimental Retrieval Augmented Generation Pipeline for Web Archive Collections.☆245Updated 2 months ago
- Algolia alternative for technical docs☆510Updated 5 months ago
- A Python library to inspect and modify the internal structure of a PDF file☆983Updated last week
- Visual tool to explore SQLite databases page-by-page, the way they're stored on disk and the way SQLite sees them.☆621Updated 4 months ago
- Collection of OCR-related python tools and wrappers from @OCR-D☆128Updated 2 weeks ago
- Migrate from Docker to Podman.☆358Updated 2 weeks ago
- CLI tool and python library to inspect databases fast.☆485Updated 2 months ago
- Gantt charts with only Javascript, CSS, HTML and YAML☆214Updated 3 months ago
- Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)☆582Updated last week
- A Command-Line Utility to automatically backup Google Mail, Calendar & Contacts to local files.☆404Updated 2 weeks ago
- WireQuery is the first full-stack session replay and API call exploration tool. Using WireQuery, you get a holistic overview of how an is…☆302Updated 9 months ago
- Open-source personal bookmarks search engine☆619Updated this week
- Fully neural approach for text chunking☆26Updated last week
- Piping logs, visualising on a web app – just suffix "| npx logscreen"☆453Updated last year
- LLM Chain querying a scientific Zotero library, with citations☆424Updated last year
- (Cross-Platform) An open source approach to locally record and enable searching everything you view on any computer.☆274Updated 11 months ago
- AI-powered tools to enhance Anki flashcards with explanations, mnemonics, illustrations, and adaptive learning for medical school and bey…☆720Updated 2 months ago
- Note taking for developers and power users☆423Updated last week
- A deep learning toolkit specialized for handwritten document analysis☆234Updated 7 months ago