getomni-ai / zeroxLinks
OCR & Document Extraction using vision models
☆12,041Updated 8 months ago
Alternatives and similar repositories for zerox
Users that are interested in zerox are comparing it to the libraries listed below
Sorting:
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,159Updated 3 months ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,064Updated 11 months ago
- Toolkit for linearizing PDFs for LLM datasets/training☆16,798Updated this week
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆9,143Updated last year
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.☆2,840Updated last week
- Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks☆6,788Updated last month
- Convert PDF to markdown + JSON quickly with high accuracy☆31,237Updated this week
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,269Updated 11 months ago
- 🪄 Create rich visualizations with AI☆14,789Updated this week
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆9,708Updated 8 months ago
- A blazing fast AI Gateway with integrated guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.☆10,377Updated last week
- Document to Markdown OCR library with Llama 3.2 vision☆2,422Updated last year
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆2,969Updated last month
- Simple, unified interface to multiple Generative AI providers☆13,394Updated last month
- Python scraper based on AI☆22,357Updated last week
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,314Updated 2 months ago
- Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.☆52,995Updated this week
- Turn websites into clean data pipelines & structured APIs in minutes!☆14,164Updated last week
- A simple screen parsing tool towards pure vision based GUI agent☆24,265Updated 4 months ago
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆6,750Updated 6 months ago
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,932Updated 4 months ago
- An open-source RAG-based tool for chatting with your documents.☆24,873Updated 6 months ago
- pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tid…☆2,731Updated 3 weeks ago
- A language model programming library.☆5,877Updated 7 months ago
- A visual playground for agentic workflows: Iterate over your agents 10x faster☆5,662Updated 6 months ago
- Using GPT to parse PDF☆3,558Updated 9 months ago
- E2M converts various file types (doc, docx, epub, html, htm, url, pdf, ppt, pptx, mp3, m4a) into Markdown. It’s easy to install, with ded…☆1,248Updated last year
- NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extra…☆2,817Updated last week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆34,405Updated this week
- An AI-powered search engine with a generative UI☆8,508Updated last month