getomni-ai / zeroxLinks
OCR & Document Extraction using vision models
☆11,985Updated 6 months ago
Alternatives and similar repositories for zerox
Users that are interested in zerox are comparing it to the libraries listed below
Sorting:
- OCR, layout analysis, reading order, table recognition in 90+ languages☆18,959Updated last month
- Toolkit for linearizing PDFs for LLM datasets/training☆16,165Updated last week
- Convert PDF to markdown + JSON quickly with high accuracy☆30,314Updated 3 weeks ago
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆8,989Updated 11 months ago
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆9,463Updated 7 months ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,030Updated 10 months ago
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,241Updated 9 months ago
- Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks☆6,748Updated 6 months ago
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆2,946Updated this week
- Document to Markdown OCR library with Llama 3.2 vision☆2,416Updated 10 months ago
- An open-source RAG-based tool for chatting with your documents.☆24,745Updated 5 months ago
- Turn any website into clean data pipelines, APIs & spreadsheets in minutes☆14,019Updated this week
- Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.☆50,272Updated this week
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.☆2,787Updated 9 months ago
- AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recording☆16,143Updated this week
- Get your documents ready for gen AI☆45,950Updated last week
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,915Updated 2 months ago
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆27,697Updated 2 months ago
- A simple screen parsing tool towards pure vision based GUI agent☆23,968Updated 3 months ago
- Using GPT to parse PDF☆3,558Updated 7 months ago
- NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extra…☆2,774Updated last week
- OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched☆31,976Updated this week
- Open-source Rust based AI meeting assistant with 4x faster Parakeet/Whisper live transcription, speaker diarization, and Ollama summariz…☆8,617Updated last week
- Vibe Workflow Platform for Non-technical Creators.☆4,906Updated this week
- StarVector is a foundation model for SVG generation that transforms vectorization into a code generation task. Using a vision-language mo…☆4,136Updated last month
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet and cites it too. …☆11,083Updated 3 weeks ago
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆13,359Updated 2 weeks ago
- [EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,…☆30,347Updated 2 weeks ago
- The python library for real-time communication☆4,444Updated 2 weeks ago
- A visual playground for agentic workflows: Iterate over your agents 10x faster☆5,613Updated 4 months ago