opendatalab / OmniDocBench
A Comprehensive Benchmark for Document Parsing and Evaluation
☆244Updated last week
Alternatives and similar repositories for OmniDocBench:
Users that are interested in OmniDocBench are comparing it to the libraries listed below
- A High-efficiency Open-source Toolkit for Table-to-Latex Task☆205Updated 2 months ago
- [NAACL 2024] Visually Guided Generative Text-Layout Pre-training for Document Intelligence☆138Updated 5 months ago
- official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"☆139Updated 8 months ago
- OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation☆65Updated 3 weeks ago
- Document Artifical Intelligence☆146Updated 2 months ago
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆167Updated 8 months ago
- UniTable: Towards a Unified Table Foundation Model☆432Updated 8 months ago
- InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)☆159Updated 8 months ago
- Code for explaining and evaluating late chunking (chunked pooling)☆324Updated last month
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆220Updated 5 months ago
- DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception☆857Updated last month
- Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train …☆184Updated 4 months ago
- A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The servic…☆258Updated 2 weeks ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆73Updated 3 months ago
- [ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"☆215Updated this week
- HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieval Results in RAG Systems (WWW 2025)☆351Updated last month
- Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception☆114Updated 3 weeks ago
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine☆430Updated last month
- MPB (Miner-PDF-Benchmark) is an end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios.☆21Updated 2 months ago
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆176Updated this week
- A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating…☆160Updated 5 months ago
- TF-ID: Table/Figure IDentifier for academic papers☆228Updated 7 months ago
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆268Updated last month
- Parsing-free RAG supported by VLMs☆593Updated this week
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆77Updated last month
- 基于序列表格识别算法推理库,集成PP-Structure和modelscope等表格识别算法。☆201Updated last month
- [EMNLP 2024: Demo Oral] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation