bloomberg/m3docrag

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bloomberg/m3docrag)

bloomberg / m3docrag

☆71

Alternatives and similar repositories for m3docrag

Users that are interested in m3docrag are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nttmdlab-nlp / VDocRAG
View on GitHub
[CVPR2025] VDocRAG: Retirval-Augmented Generation over Visually-Rich Documents
☆66May 26, 2025Updated last year
puar-playground / Self-Visual-RAG
View on GitHub
Implementation of MLLM-based Self-Vision-RAG models
☆15Nov 30, 2025Updated 7 months ago
Omaralsaabi / M3DOCRAG
View on GitHub
An implementation of "M3DOCRAG: Multi-modal Retrieval is What You Need for Multi-page Multi-document Understanding" by Jaemin Cho, Debanj…
☆56Nov 13, 2024Updated last year
MRAMG-Bench / MRAMG
View on GitHub
[SIGIR 2025] Official impl. of "MRAMG-Bench: A Comprehensive Benchmark for Advancing Multimodal Retrieval-Augmented Multimodal Generation…
☆19Apr 15, 2025Updated last year
maziao / M2RAG
View on GitHub
Implementation of "Multi-modal Retrieval Augmented Multi-modal Generation: Datasets, Evaluation Metrics and Strong Baselines"
☆33Feb 24, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
mayubo2333 / MMLongBench-Doc
View on GitHub
Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations
☆149Sep 28, 2025Updated 9 months ago
aiming-lab / MDocAgent
View on GitHub
MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding
☆352Aug 8, 2025Updated 11 months ago
SalesforceAIResearch / UniDoc-Bench
View on GitHub
☆38Jun 2, 2026Updated last month
MaYufei-NPU / InfoGain-RAG
View on GitHub
Implementation of EMNLP Oral Paper: InfoGain-RAG: Boosting Retrieval-Augmented Generation through Document Information Gain-based Reranki…
☆18Sep 17, 2025Updated 10 months ago
Gabesarch / ICAL
View on GitHub
☆53May 11, 2025Updated last year
OpenBMB / VisRAG
View on GitHub
Parsing-free RAG supported by VLMs
☆972Jul 17, 2026Updated last week
Ziyang412 / Video-RTS
View on GitHub
Code for EMNLP25 paper "Video-RTS: Rethinking Reinforcement Learning and Test-Time Scaling for Efficient and Enhanced Video Reasoning"
☆24Feb 18, 2026Updated 5 months ago
MananSuri27 / VisDoM
View on GitHub
☆45Jul 28, 2025Updated 11 months ago
DataArcTech / RagVL
View on GitHub
Official PyTorch Implementation of MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced …
☆92Nov 15, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
mattf1n / basis-aware-threshold
View on GitHub
Code for the paper "Closing the Curious Case of Neural Text Degeneration"
☆12Apr 9, 2025Updated last year
cqu-student / Wiki-PRF
View on GitHub
☆19Mar 9, 2026Updated 4 months ago
llm-lab-org / Multimodal-RAG-Survey
View on GitHub
A Survey on Multimodal Retrieval-Augmented Generation
☆533Feb 20, 2026Updated 5 months ago
google-research-datasets / swim-ir
View on GitHub
SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…
☆50Nov 13, 2023Updated 2 years ago
illuin-tech / modernvbert
View on GitHub
ModernVBERT is a 250M-parameter vision–language encoder that aligns a text-encoder (Ettin-150M) with a vision-encoder (SigLIP2-B) through…
☆16Oct 16, 2025Updated 9 months ago
gaojingsheng / SmartRAG
View on GitHub
Original implementation of SmartRAG: Jointly Learn RAG-Related Tasks From the Environment Feedback (ICLR 2025)
☆18Feb 17, 2025Updated last year
john-hewitt / truncation-sampling
View on GitHub
Codebase describing experiments in Truncation Sampling as Language Model Desmoothing
☆13Dec 6, 2022Updated 3 years ago
jaehong31 / RACCooN
View on GitHub
(EMNLP 2025 Main) RACCooN: A Versatile Instructional Video Editing Framework with Auto-Generated Narratives
☆37Dec 20, 2025Updated 7 months ago
illuin-tech / colpali
View on GitHub
The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.
☆2,706Jul 13, 2026Updated last week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
aimagelab / ReT-2
View on GitHub
Recurrence Meets Transformers for Universal Multimodal Retrieval
☆15Dec 15, 2025Updated 7 months ago
Alibaba-NLP / VRAG
View on GitHub
Multimodal Retrieval-augmented Generation Framework Built by Tongyi Lab, Alibaba Group.
☆970Apr 29, 2026Updated 2 months ago
amazon-science / adaptive-in-context-learning
View on GitHub
AdaICL: Which Examples to Annotate of In-Context Learning? Towards Effective and Efficient Selection
☆19Oct 30, 2023Updated 2 years ago
TL-UESTC / DAPM
View on GitHub
A Pytorch implementation of Diffusion-Based Probabilistic Uncertainty Estimation for Active Domain Adaptation
☆16Nov 28, 2023Updated 2 years ago
illuin-tech / vidore-benchmark
View on GitHub
Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.
☆278Mar 25, 2026Updated 3 months ago
ZhangShiyue / extractive_is_not_faithful
View on GitHub
☆17May 19, 2023Updated 3 years ago
daeunni / VideoRepair
View on GitHub
Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement [ACL 2026 Findings]"
☆52Apr 7, 2026Updated 3 months ago
yh-hust / PDF-Wukong
View on GitHub
【ArXiv】PDF-Wukong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling
☆131Jun 4, 2025Updated last year
hljoren / sufficientcontext
View on GitHub
Official page for ICLR 2025 paper "Sufficient Context: A New Lens on Retrieval Augmented Generation Systems"
☆69Jul 22, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ZhengboZhang / VisBrowse-Bench
View on GitHub
Official data and code for the paper "VisBrowse-Bench: Benchmarking Visual-Native Search for Multimodal Browsing Agents".
☆15Mar 18, 2026Updated 4 months ago
StibiumT16 / Robust-Fine-tuning
View on GitHub
Code for Robust Fine-tuning (RbFT)
☆19Jan 31, 2025Updated last year
manga109 / public-annotations
View on GitHub
Various annotations of Manga109 dataset
☆13Apr 23, 2025Updated last year
wgcyeo / UniversalRAG
View on GitHub
[ACL 2026 Oral] UniversalRAG: Retrieval-Augmented Generation over Corpora of Diverse Modalities and Granularities
☆174Jun 24, 2026Updated last month
1529835657 / CSU-Library
View on GitHub
CSU签到、临时离开、签离助手
☆12Aug 27, 2022Updated 3 years ago
MrZilinXiao / AutoVER
View on GitHub
[ECCV'24] Official Implementation of Autoregressive Visual Entity Recognizer.
☆14Mar 2, 2024Updated 2 years ago
LoieSun / Auto-ACD
View on GitHub
code for A Large-scale Dataset for Audio-Language Representation Learning
☆14Sep 18, 2024Updated last year