opendatalab / OHR-BenchLinks
(ICCV 2025) OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation
☆86Updated last month
Alternatives and similar repositories for OHR-Bench
Users that are interested in OHR-Bench are comparing it to the libraries listed below
Sorting:
- An implementation of "M3DOCRAG: Multi-modal Retrieval is What You Need for Multi-page Multi-document Understanding" by Jaemin Cho, Debanj…☆44Updated 9 months ago
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆238Updated last year
- MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding☆201Updated 2 weeks ago
- [EMNLP 2024 Findings] OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs.☆147Updated 9 months ago
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆232Updated 3 weeks ago
- This is the official repository for Auto-RAG.☆218Updated last month
- The All-in-one Judge Models introduced by Opencompass☆110Updated last month
- Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24☆143Updated last year
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆224Updated 2 months ago
- [EMNLP 2024] LongRAG: A Dual-perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering☆111Updated 6 months ago
- ☆292Updated 2 months ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆216Updated last week
- Parameter-efficient finetuning script for Phi-3-vision, the strong multimodal language model by Microsoft.☆58Updated last year
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆85Updated 7 months ago
- DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents☆306Updated 3 weeks ago
- Open replication of DeepSeek R1 for text-to-graph extraction.☆98Updated 6 months ago
- This is the code repo for our paper "Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts".☆37Updated last week
- ☆90Updated 3 months ago
- This is the code repo for our paper "Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents".☆107Updated 10 months ago
- InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)☆161Updated last year
- RetroLLM: Empowering LLMs to Retrieve Fine-grained Evidence within Generation [ACL 2025]☆115Updated 7 months ago
- ACL 2025: Synthetic data generation pipelines for text-rich images.☆133Updated 5 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆101Updated 2 months ago
- ☆94Updated 5 months ago
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆53Updated last month
- Blended RAG: Improving RAG (Retriever-Augmented Generation) Accuracy with Semantic Search and Hybrid Query-Based Retrievers☆69Updated 3 months ago
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆193Updated 2 months ago
- [Up-to-date] Awesome RAG Reasoning Resources☆257Updated last month
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆305Updated this week
- Data and Code for EMNLP 2025 Findings Paper "MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search"☆63Updated last month