rednote-hilab / dots.ocrLinks
Multilingual Document Layout Parsing in a Single Vision-Language Model
☆7,139Updated last month
Alternatives and similar repositories for dots.ocr
Users that are interested in dots.ocr are comparing it to the libraries listed below
Sorting:
- An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)☆1,851Updated 5 months ago
- The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.☆8,774Updated last month
- OCRFlux is a lightweight yet powerful multimodal toolkit that significantly advances PDF-to-Markdown conversion, excelling in complex lay…☆2,480Updated 6 months ago
- 🦛 CHONK docs with Chonkie ✨ — The lightweight ingestion library for fast, efficient and robust RAG pipelines☆3,727Updated this week
- A lightweight LMM-based Document Parsing Model☆6,461Updated this week
- ContextGem: Effortless LLM extraction from documents☆1,777Updated last month
- [CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation☆1,446Updated last month
- Toolkit for linearizing PDFs for LLM datasets/training☆16,860Updated this week
- DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception☆1,981Updated 9 months ago
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,935Updated 4 months ago
- 📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG☆14,072Updated 2 weeks ago
- OCR model that handles complex tables, forms, handwriting with full layout.☆4,733Updated 3 weeks ago
- "RAG-Anything: All-in-One RAG Framework"☆12,776Updated 2 weeks ago
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆9,203Updated last year
- A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The servic…☆1,075Updated last month
- ☆2,112Updated 10 months ago
- Legacy Python library for Agentic Document Extraction (ADE). Use the landingai-ade library for all new projects.☆2,354Updated this week
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,448Updated 9 months ago
- ☆1,523Updated 3 weeks ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,073Updated last year
- Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)☆5,076Updated last month
- A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive vi…☆24,058Updated last month
- Task-Aware Agent-driven Prompt Optimization Framework☆3,753Updated 3 months ago
- Visual Causal Flow☆2,011Updated last week
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.☆7,563Updated 2 months ago
- OCR & Document Extraction using vision models☆12,070Updated 8 months ago
- A quick vibe coded app for deepseek OCR☆1,714Updated 2 months ago
- NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extra…☆2,840Updated this week
- Official Implementation of "KBLaM: Knowledge Base augmented Language Model"☆1,436Updated 3 months ago
- Tongyi Deep Research, the Leading Open-source Deep Research Agent☆18,165Updated 2 weeks ago