Wings-Of-Disaster / VaLiK
Aligning Vision to Language: Text-Free Multimodal Knowledge Graph Construction for Enhanced LLMs Reasoning
☆20Updated this week
Alternatives and similar repositories for VaLiK
Users that are interested in VaLiK are comparing it to the libraries listed below
Sorting:
- ☆14Updated last year
- ☆19Updated 3 months ago
- [ACL 2024] This is the code repo for our ACL‘24 paper "MARVEL: Unlocking the Multi-Modal Capability of Dense Retrieval via Visual Module …☆35Updated 10 months ago
- [ACL2024] Progressively Modality Freezing for Multi-Modal Entity Alignment☆16Updated last month
- [ICML 2025] Official resources of "KBQA-o1: Agentic Knowledge Base Question Answering with Monte Carlo Tree Search".☆21Updated 2 weeks ago
- Document Haystacks: Vision-Language Reasoning Over Piles of 1000+ Documents, CVPR 2025☆18Updated 3 months ago
- [ACL 2024] Logical Closed Loop: Uncovering Object Hallucinations in Large Vision-Language Models. Detect and mitigate object hallucinatio…☆20Updated 3 months ago
- Code and model for AAAI 2024: UMIE: Unified Multimodal Information Extraction with Instruction Tuning☆35Updated 11 months ago
- Using image captions with LLM for zero-shot VQA☆17Updated last year
- ☆14Updated 2 weeks ago
- [NeurIPS 2023]DDCoT: Duty-Distinct Chain-of-Thought Prompting for Multimodal Reasoning in Language Models☆43Updated last year
- ☆12Updated last month
- [Paper][ICLR 2025] Multiple Heads are Better than One: Mixture of Modality Knowledge Experts for Entity Representation Learning☆27Updated this week
- [EMNLP 2024 Findings] The official PyTorch implementation of EchoSight: Advancing Visual-Language Models with Wiki Knowledge.☆61Updated 2 months ago
- Official PyTorch Implementation of MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced …☆75Updated 6 months ago
- ☆39Updated 8 months ago
- ☆37Updated 2 weeks ago
- An Easy-to-use Hallucination Detection Framework for LLMs.☆58Updated last year
- [ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in…☆53Updated this week
- [EMNLP'2023 Findings] MoqaGPT, for zero-shot multimodal question answering with LLMs☆12Updated 4 months ago
- [ACL 2024 Findings] LLMs as Bridges: Reformulating Grounded Multimodal Named Entity Recognition☆29Updated last month
- ☆14Updated 5 months ago
- Code for ACM MM 2024 paper "A Picture Is Worth a Graph: A Blueprint Debate Paradigm for Multimodal Reasoning"☆17Updated 5 months ago
- [KDD 2023] Multi-Grained Multimodal Interaction Network for Entity Linking☆26Updated last year
- [CVPR' 25] Interleaved-Modal Chain-of-Thought☆40Updated 3 weeks ago
- Multimodal Instruction Tuning with Conditional Mixture of LoRA (ACL 2024)☆20Updated 9 months ago
- Can Atomic Step Decomposition Enhance the Self-structured Reasoning of Multimodal Large Models?☆24Updated 2 months ago
- [Paper][AAAI2024]Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal Structured Representations☆138Updated 10 months ago
- MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU☆46Updated last year
- ☆46Updated last year