opendatalab / OHR-BenchLinks
(ICCV 2025) OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation
☆81Updated 2 weeks ago
Alternatives and similar repositories for OHR-Bench
Users that are interested in OHR-Bench are comparing it to the libraries listed below
Sorting:
- MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding☆189Updated 3 months ago
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆85Updated 5 months ago
- The All-in-one Judge Models introduced by Opencompass☆96Updated 4 months ago
- This is the code repo for our paper "Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts".☆36Updated 4 months ago
- [EMNLP 2024 Findings] OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs.☆148Updated 8 months ago
- ☆90Updated 2 months ago
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆235Updated 10 months ago
- Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24☆141Updated last year
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆215Updated last week
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆45Updated last week
- ☆94Updated 3 months ago
- DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents☆190Updated 3 weeks ago
- ☆280Updated last month
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆52Updated 7 months ago
- ☆72Updated last month
- Efficient Agent Training for Computer Use☆114Updated last month
- ☆45Updated last month
- ☆56Updated 7 months ago
- Blended RAG: Improving RAG (Retriever-Augmented Generation) Accuracy with Semantic Search and Hybrid Query-Based Retrievers☆69Updated 2 months ago
- ☆94Updated 7 months ago
- RetroLLM: Empowering LLMs to Retrieve Fine-grained Evidence within Generation [ACL 2025]☆114Updated 5 months ago
- InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)☆161Updated last year
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆33Updated last year
- This is the official repository for Auto-RAG.☆212Updated 2 months ago
- official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"☆151Updated last year
- This is the code repo for our paper "Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents".☆107Updated 8 months ago
- This is the repo for the paper "PANGEA: A FULLY OPEN MULTILINGUAL MULTIMODAL LLM FOR 39 LANGUAGES"☆108Updated 2 weeks ago
- Parameter-efficient finetuning script for Phi-3-vision, the strong multimodal language model by Microsoft.☆58Updated last year
- Document Haystacks: Vision-Language Reasoning Over Piles of 1000+ Documents, CVPR 2025☆21Updated 5 months ago
- Complex Function Calling Benchmark.☆117Updated 5 months ago