opendatalab / OmniDocBench
A Comprehensive Benchmark for Document Parsing and Evaluation
☆135Updated last week
Alternatives and similar repositories for OmniDocBench:
Users that are interested in OmniDocBench are comparing it to the libraries listed below
- OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation☆35Updated this week
- InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)☆151Updated 6 months ago
- official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"☆133Updated 6 months ago
- [NAACL 2024] Visually Guided Generative Text-Layout Pre-training for Document Intelligence☆128Updated 3 months ago
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆148Updated last week
- This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.☆53Updated 2 months ago
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆67Updated 3 weeks ago
- The forthcoming work.☆158Updated last week
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆203Updated 3 months ago
- TF-ID: Table/Figure IDentifier for academic papers☆224Updated 5 months ago
- Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24☆126Updated 6 months ago
- Code for explaining and evaluating late chunking (chunked pooling)☆273Updated 2 months ago
- [EMNLP 2024] LongRAG: A Dual-perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering☆86Updated last month
- ☆126Updated 10 months ago
- An enterprise-grade AI retriever designed to streamline AI integration into your applications, ensuring cutting-edge accuracy.☆268Updated this week
- conversion doc(pdf/html/doc/docx/ppt/pptx)to markdown☆35Updated 4 months ago
- Document Artifical Intelligence☆133Updated last week
- [EMNLP 2024: Demo Oral] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation☆267Updated last month
- Expert Specialized Fine-Tuning☆150Updated 2 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆187Updated last month
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆330Updated 6 months ago
- Serverless Modal + FastAPI + React + ColPali + Qdrant + GPT4o Vision RAG (V-RAG) Demo☆117Updated 3 weeks ago
- [EMNLP 2024 Findings] OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs.☆139Updated last month
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆185Updated last month
- Parsing-free RAG supported by VLMs☆491Updated this week
- Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train …☆169Updated 2 months ago
- HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieval Results in RAG Systems☆261Updated this week
- A High-efficiency Open-source Toolkit for Table-to-Latex Task☆165Updated this week
- ☆78Updated 3 weeks ago
- [ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"☆206Updated last week