(ICCV 2025) OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation
☆101Dec 3, 2025Updated 5 months ago
Alternatives and similar repositories for OHR-Bench
Users that are interested in OHR-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICCV25 Highlight] The official implementation of the paper "LEGION: Learning to Ground and Explain for Synthetic Image Detection"☆79Oct 22, 2025Updated 7 months ago
- ☆25Nov 7, 2022Updated 3 years ago
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆476Sep 28, 2025Updated 7 months ago
- [MM '25] This is the code repo for our paper "Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts".☆44Sep 27, 2025Updated 7 months ago
- This is the code repo for our paper "Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression".☆13Feb 27, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation, ECCV 2024☆22Feb 15, 2024Updated 2 years ago
- This is the code repo for the paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards".☆24Oct 28, 2024Updated last year
- Data Set Description Language Specification (新一代人工智能数据集描述语言DSDL)☆46May 29, 2024Updated last year
- [ACL 2024 Oral] This is the code repo for our ACL‘24 paper "MARVEL: Unlocking the Multi-Modal Capability of Dense Retrieval via Visual Mo…☆39Jun 30, 2024Updated last year
- MLLM-DataEngine: An Iterative Refinement Approach for MLLM☆48May 24, 2024Updated 2 years ago
- SDK of OpenDataLab - https://opendatalab.org.cn☆60Jul 31, 2025Updated 9 months ago
- (ICLR 2026 🔥) Code for "The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs"☆77Feb 9, 2026Updated 3 months ago
- Official Implementation for Generative Neural Fields by Mixtures of Neural Implicit Functions☆19Mar 10, 2024Updated 2 years ago
- ☆35Apr 22, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- (NeurIPS 2024) One-shot Federated Learning via Synthetic Distiller-Distillate Communication☆19Mar 11, 2025Updated last year
- AAAI 2024: Visual Instruction Generation and Correction☆97Feb 4, 2024Updated 2 years ago
- ☆16Jul 23, 2024Updated last year
- This is the code repo for our paper "Learning More Effective Representations for Dense Retrieval through Deliberate Thinking Before Searc…☆28Mar 2, 2025Updated last year
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆37Apr 25, 2026Updated last month
- NanaDraw turns complex scientific ideas into clear, expressive visuals you can use right away. Powered by Nano Banana, it generates edita…☆96Apr 29, 2026Updated 3 weeks ago
- Official implementation of Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents (NeurIPS 2025)☆47Nov 24, 2025Updated 6 months ago
- DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception☆2,166Apr 14, 2025Updated last year
- Current Alpha version of the ONTO-TRON-5000☆41Dec 1, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆11Oct 31, 2024Updated last year
- Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM | EMNLP 2025 Findings☆18Oct 17, 2025Updated 7 months ago
- Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.☆56Mar 12, 2026Updated 2 months ago
- Dense Article Dataset (DAD): A Benchmark Dataset for Document Layout Analysis☆16Jan 13, 2022Updated 4 years ago
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆36Aug 28, 2025Updated 8 months ago
- Generative AI Governance for Enterprises☆16Dec 29, 2024Updated last year
- (NeurIPS 2025) Official implementation for "MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?"☆51Jun 3, 2025Updated 11 months ago
- This repository contains code and datasets for our paper on the effects of document multiplicity while the context size is fixed in Retri…☆18Mar 13, 2025Updated last year
- ☆16May 22, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆11Jul 28, 2023Updated 2 years ago
- this repository provides the agentic rag powered by crewai and qdrant and applied for health care industry.☆18Jan 11, 2025Updated last year
- ☆12Jul 13, 2023Updated 2 years ago
- DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors☆37Sep 13, 2024Updated last year
- ☆25Jul 20, 2025Updated 10 months ago
- Parsing-free RAG supported by VLMs☆956Dec 7, 2025Updated 5 months ago
- Promtless-TaskSpecific-Finetuning of MetaAI Segment-Anything Model☆11Jan 1, 2024Updated 2 years ago