rednote-hilab / dots.ocrLinks

Multilingual Document Layout Parsing in a Single Vision-Language Model

☆7,139

Alternatives and similar repositories for dots.ocr

Users that are interested in dots.ocr are comparing it to the libraries listed below

Sorting:

NanoNets / docext
An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)
☆1,851Updated 5 months ago
bytedance / Dolphin
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
☆8,774Updated last month
chatdoc-com / OCRFlux
OCRFlux is a lightweight yet powerful multimodal toolkit that significantly advances PDF-to-Markdown conversion, excelling in complex lay…
☆2,480Updated 6 months ago
chonkie-inc / chonkie
🦛 CHONK docs with Chonkie ✨ — The lightweight ingestion library for fast, efficient and robust RAG pipelines
☆3,727Updated this week
Yuliang-Liu / MonkeyOCR
A lightweight LMM-based Document Parsing Model
☆6,461Updated this week
shcherbak-ai / contextgem
ContextGem: Effortless LLM extraction from documents
☆1,777Updated last month
opendatalab / OmniDocBench
[CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation
☆1,446Updated last month
allenai / olmocr
Toolkit for linearizing PDFs for LLM datasets/training
☆16,860Updated this week
opendatalab / DocLayout-YOLO
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
☆1,981Updated 9 months ago
lumina-ai-inc / chunkr
Vision infrastructure to turn complex documents into RAG/LLM-ready data
☆2,935Updated 4 months ago
VectifyAI / PageIndex
📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG
☆14,072Updated 2 weeks ago
datalab-to / chandra
OCR model that handles complex tables, forms, handwriting with full layout.
☆4,733Updated 3 weeks ago
HKUDS / RAG-Anything
"RAG-Anything: All-in-One RAG Framework"
☆12,776Updated 2 weeks ago
opendatalab / PDF-Extract-Kit
A Comprehensive Toolkit for High-Quality PDF Content Extraction
☆9,203Updated last year
huridocs / pdf-document-layout-analysis
A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The servic…
☆1,075Updated last month
imanoop7 / Ollama-OCR
☆2,112Updated 10 months ago
landing-ai / agentic-doc
Legacy Python library for Agentic Document Extraction (ADE). Use the landingai-ade library for all new projects.
☆2,354Updated this week
tjmlabs / ColiVara
Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…
☆1,448Updated 9 months ago
Tencent-Hunyuan / HunyuanOCR
☆1,523Updated 3 weeks ago
Ucas-HaoranWei / GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
☆8,073Updated last year
jina-ai / node-DeepResearch
Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)
☆5,076Updated last month
google / langextract
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive vi…
☆24,058Updated last month
microsoft / PromptWizard
Task-Aware Agent-driven Prompt Optimization Framework
☆3,753Updated 3 months ago
deepseek-ai / DeepSeek-OCR-2
Visual Causal Flow
☆2,011Updated last week
zilliztech / deep-searcher
Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
☆7,563Updated 2 months ago
getomni-ai / zerox
OCR & Document Extraction using vision models
☆12,070Updated 8 months ago
rdumasia303 / deepseek_ocr_app
A quick vibe coded app for deepseek OCR
☆1,714Updated 2 months ago
NVIDIA / nv-ingest
NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extra…
☆2,840Updated this week
microsoft / KBLaM
Official Implementation of "KBLaM: Knowledge Base augmented Language Model"
☆1,436Updated 3 months ago
Alibaba-NLP / DeepResearch
Tongyi Deep Research, the Leading Open-source Deep Research Agent
☆18,165Updated 2 weeks ago