arshad-yaseen / ocr-llmLinks
⚡️ Fast, ultra-accurate text extraction from any image or PDF—including challenging ones—with structured markdown output powered by vision models.
☆37Updated last year
Alternatives and similar repositories for ocr-llm
Users that are interested in ocr-llm are comparing it to the libraries listed below
Sorting:
- Parse PDFs into markdown using Vision LLMs☆455Updated 3 months ago
- Co-create PowerPoint slide decks with AI☆306Updated last week
- Implementing OCR with a local visual model run by ollama.☆296Updated last year
- Conversion of PDF documents to structured Markdown, optimized for Retrieval Augmented Generation (RAG) and other NLP tasks. Extract text,…☆105Updated last year
- Extract structured text from pdfs quickly☆648Updated 7 months ago
- Checkbox Detection Model for Scanned Documents☆90Updated 10 months ago
- Extract and convert data from any document, images, pdfs, word doc, ppt or URL into multiple formats (Markdown, JSON, CSV, HTML) with int…☆1,114Updated 2 months ago
- OCR Benchmark☆604Updated 2 months ago
- PDF Table Extractor is an innovative Python project designed to tackle the challenge of extracting tables from scanned PDF documents. Lev…☆43Updated last year
- 🔍 Table Extraction Tool: A powerful open-source solution combining OCR and computer vision for extracting structured tabular data from i…☆78Updated 10 months ago
- End-to-End Local-First Text-to-SQL Pipelines☆426Updated 10 months ago
- Collection of PDF parsing libraries like AI based docling, claude, openai, gemini, meta's llama-vision, unstructured-io, and pdfminer, py…☆159Updated 4 months ago
- [CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation☆1,342Updated 3 weeks ago
- YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis☆146Updated 5 months ago
- UniTable: Towards a Unified Table Foundation Model☆519Updated last year
- Using GPT-4 Vision and GPT-4 Turbo, take a PDF as input and get a markdown file as output.☆98Updated 11 months ago
- An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)☆1,828Updated 4 months ago
- An enterprise-grade AI retriever designed to streamline AI integration into your applications, ensuring cutting-edge accuracy.☆292Updated 6 months ago
- OCR library to extract text & tables from PDF files and images. Convert any image or PDF to CSV / TXT / JSON / Searchable PDF.☆125Updated 3 years ago
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,473Updated 4 months ago
- Simple package to extract text with coordinates from programmatic PDFs☆226Updated last month
- SmolDocling OCR App built using SmolDocling 256M Model and Streamlit.☆223Updated 9 months ago
- PDFStract - PDF Data Extraction & Benchmarking for RAG and AI Pipelines - Available as CLI - WEBUI - API☆94Updated 2 weeks ago
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine☆491Updated 5 months ago
- ☆104Updated this week
- A comprehensive list of document parsers, covering PDF-to-text conversion and layout extraction. Each tested for support of tables, equat…☆167Updated 5 months ago
- Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) int…☆736Updated 10 months ago
- Open-Source RAG app with LLM Observability (Langfuse), support for 100+ providers (LiteLLM), Dockerized, Full Type-checking, 100% Test co…☆157Updated 3 weeks ago
- A Python package for converting PDFs to markdown while extracting images and tables, generate descriptive text descriptions for extracted…☆180Updated 6 months ago
- ➖ Stripped down, stable version of firecrawl optimized for self-hosting and ease of contribution. Billing logic and AI features are compl…☆627Updated 7 months ago