MMGraphRAG is a multi-modal knowledge graph-based framework designed to enhance complex reasoning tasks, such as multi-modal document question-answering. It integrates text and image data into a fine-grained, structured knowledge graph, utilizing scene graphs for image data and a spectral clustering-based fusion module.
☆90Mar 10, 2026Updated 3 months ago
Alternatives and similar repositories for MMGraphRAG
Users that are interested in MMGraphRAG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于Qwen2.5模型、使用DISC-Law-SFT-Pair数据集微调的法律大模型☆12Dec 29, 2024Updated last year
- ICCV 2025: Official Implematation of "Aligning Vision to Language: Annotation-Free Multimodal Knowledge Graph Construction for Enhanced L…☆73Oct 25, 2025Updated 7 months ago
- Enhancing Legal Case Retrieval via Scaling High-quality Synthetic Query-Candidate Pairs (EMNLP 2024)☆16Nov 17, 2024Updated last year
- 此项目旨在为医疗专业人员提供一个高效且准确的工具,用于辅助诊断 COVID-19、普通疾病以及病毒性肺炎。通过结合自编码器的图像增强能力和 CNN 的强大分类性能,该系统力求在医学影像分析领域提供卓越的性能。☆27Dec 24, 2024Updated last year
- [ACL 2025] An official pytorch implement of the paper: Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement☆41May 28, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Thai Grapheme to Phoneme (G2P) Wiktionary Corpus☆13Jul 25, 2022Updated 3 years ago
- [KDD 2026] Voxlect: A Speech Foundation Model Benchmark for Modeling Dialects and Regional Languages Around the Globe☆36Aug 10, 2025Updated 10 months ago
- Main use to store some object trackiing code☆11Sep 1, 2021Updated 4 years ago
- ☆25Nov 27, 2025Updated 6 months ago
- ☆14Feb 26, 2024Updated 2 years ago
- [CVPR 2026] UFVideo: Towards Unified Fine-Grained Video Cooperative Understanding with Large Language Models☆37Feb 21, 2026Updated 3 months ago
- ☆34Feb 12, 2026Updated 4 months ago
- OpenMediation SDK Server☆16Oct 4, 2022Updated 3 years ago
- Graph Key Information Extraction: GKIE☆11Sep 15, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- VideoDetective: Clue Hunting via both Extrinsic Query and Intrinsic Relevance for Long Video Understanding☆58May 1, 2026Updated last month
- [ICCV 2025] Factorized Learning for Temporally Grounded Video-Language Models☆24Apr 18, 2026Updated last month
- ☆40Apr 6, 2026Updated 2 months ago
- Analyze a real-time IPv4 packet stream and export metrics about the data flows☆14Jan 29, 2020Updated 6 years ago
- ☆16Feb 26, 2023Updated 3 years ago
- The official code for paper "UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation"☆37Jul 29, 2024Updated last year
- code for paper Hierarchical Retrieval-Augmented Generation Model with Rethink for Multi-hop Question Answering☆14Aug 13, 2024Updated last year
- ☆15Aug 12, 2022Updated 3 years ago
- A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning☆14Oct 27, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This is a Repository corresponding to ACCV2022 accepted paper ”Complex Handwriting Trajectory Recovery: Evaluation Metrics and Algorithm“…☆14Oct 3, 2022Updated 3 years ago
- ACM Multimedia 2023 (Oral) - RTQ: Rethinking Video-language Understanding Based on Image-text Model☆15Apr 7, 2026Updated 2 months ago
- Package for word stress detection☆11Jan 27, 2023Updated 3 years ago
- ArcFace 3.0的Android Demo☆12Dec 16, 2019Updated 6 years ago
- Vulnerability Knowledge Base comparison tool☆13Feb 9, 2022Updated 4 years ago
- Official repository for "Boosting Audio Visual Question Answering via Key Semantic-Aware Cues" in ACM MM 2024.☆16Oct 25, 2024Updated last year
- NeurIPS 2024: RAGraph: A General Retrieval-Augmented Graph Learning Framework☆23Feb 4, 2025Updated last year
- ☆28Oct 14, 2024Updated last year
- Code for the NAACL 2024 HCI+NLP Workshop paper "LLMCheckup: Conversational Examination of Large Language Models via Interpretability Tool…☆13Mar 24, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 使用WebGL和Node.js技术构建复旦三维社交网络。目前实现了校园模型demo显示,多用户在线聊天。☆12Jun 12, 2015Updated 11 years ago
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆18Dec 6, 2022Updated 3 years ago
- My Implementation for the paper EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks using Tensor…☆12Mar 18, 2022Updated 4 years ago
- A Multi-Agent Approach Integrating Socratic Guidance for Automated Prompt Optimization☆18Dec 15, 2025Updated 6 months ago
- A4-Agent: An Agentic Framework for Zero-Shot Affordance Reasoning☆41Mar 12, 2026Updated 3 months ago
- ☆40Dec 8, 2025Updated 6 months ago
- \infty-Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation☆21Feb 14, 2025Updated last year