nttmdlab-nlp / VDocRAG
[CVPR2025] VDocRAG: Retirval-Augmented Generation over Visually-Rich Documents
☆17Updated 2 weeks ago
Alternatives and similar repositories for VDocRAG
Users that are interested in VDocRAG are comparing it to the libraries listed below
Sorting:
- Document Haystacks: Vision-Language Reasoning Over Piles of 1000+ Documents, CVPR 2025☆18Updated 3 months ago
- This is the code repo for our paper "Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts".☆30Updated last month
- The official implementation of "LevelRAG: Enhancing Retrieval-Augmented Generation with Multi-hop Logic Planning over Rewriting Augmented…☆29Updated last month
- Control LLM☆14Updated last month
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated 11 months ago
- "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023☆14Updated 5 months ago
- Multimodal Open-O1 (MO1) is designed to enhance the accuracy of inference models by utilizing a novel prompt-based approach. This tool wo…☆29Updated 7 months ago
- An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation☆14Updated 6 months ago
- ☆32Updated 3 months ago
- ☆48Updated 2 months ago
- Official PyTorch Implementation of MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced …☆73Updated 6 months ago
- ☆16Updated 9 months ago
- ☆56Updated 3 weeks ago
- ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration☆34Updated 4 months ago
- ABC: Achieving Better Control of Multimodal Embeddings using VLMs☆11Updated last month
- A Survey of Multimodal Retrieval-Augmented Generation☆18Updated 3 weeks ago
- ☆17Updated 3 months ago
- ☆95Updated last month
- Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types☆18Updated last month
- ☆74Updated last week
- RuleRAG: Rule-guided Retrieval-Augmented Generation with Language Models for Question Answering☆22Updated 6 months ago
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆20Updated 3 months ago
- ☆16Updated last week
- Official Pytorch Implementation of Self-emerging Token Labeling☆33Updated last year
- survery of small language models☆15Updated 9 months ago
- Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generation☆14Updated this week
- Code for paper: Unified Text-to-Image Generation and Retrieval☆15Updated 10 months ago
- ☆27Updated 3 months ago
- ☆38Updated this week
- The official code of "Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs"☆61Updated last week