opendatalab / OmniDocBench
A Comprehensive Benchmark for Document Parsing and Evaluation
☆288Updated 3 weeks ago
Alternatives and similar repositories for OmniDocBench:
Users that are interested in OmniDocBench are comparing it to the libraries listed below
- A High-efficiency Open-source Toolkit for Table-to-Latex Task☆221Updated 3 months ago
- official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"☆141Updated 9 months ago
- Code for explaining and evaluating late chunking (chunked pooling)☆352Updated 3 months ago
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆286Updated 2 weeks ago
- [NAACL 2024] Visually Guided Generative Text-Layout Pre-training for Document Intelligence☆141Updated 6 months ago
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆188Updated 10 months ago
- [ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"☆219Updated last month
- DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception☆954Updated 2 months ago
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆227Updated 6 months ago
- OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation☆68Updated last week
- Document Artifical Intelligence☆155Updated 3 months ago
- UniTable: Towards a Unified Table Foundation Model☆445Updated 9 months ago
- Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train …☆189Updated 5 months ago
- A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating…☆167Updated 6 months ago
- 基于序列表格识别算法推理库,集成PP-Structure和modelscope等表格识别算法。☆240Updated 2 months ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆80Updated 4 months ago
- HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieval Results in RAG Systems (WWW 2025)☆378Updated 2 months ago
- TF-ID: Table/Figure IDentifier for academic papers☆229Updated 8 months ago
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆268Updated 6 months ago
- GOT的vLLM加速实现 并结合 MinerU 实现RAG中的pdf 解析☆50Updated 4 months ago
- InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)☆160Updated 9 months ago
- A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The servic…☆282Updated this week
- On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)☆571Updated last month
- Parsing-free RAG supported by VLMs☆636Updated last month
- Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception☆130Updated last week
- LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA☆480Updated 2 months ago
- Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.☆185Updated 3 weeks ago
- ☆131Updated last year
- Analysis of Chinese and English layouts 中英文版面分析☆181Updated last month
- 【ArXiv】PDF-Wukong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling☆115Updated 5 months ago