season1blue / DWE
Official implementation of AAAI24 paper "A Dual-way Enhanced Framework from Text Matching Point of View for Multimodal Entity Linking"
☆8Updated 7 months ago
Alternatives and similar repositories for DWE:
Users that are interested in DWE are comparing it to the libraries listed below
- Official Repo for FoodieQA paper (EMNLP 2024)☆16Updated 5 months ago
- [ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆13Updated 10 months ago
- Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".☆11Updated 6 months ago
- [EMNLP’24 Main] Encoding and Controlling Global Semantics for Long-form Video Question Answering☆17Updated 6 months ago
- ☆11Updated 6 months ago
- ☆27Updated last year
- [EMNLP 2024] Official implementation of "Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Ut…☆21Updated 4 months ago
- ☆14Updated 10 months ago
- Official PyTorch implementation of Rethinking Guidance Information to Utilize Unlabeled Samples: A Label-Encoding Perspective.☆19Updated 7 months ago
- ☆11Updated 3 months ago
- Mitigating Open-Vocabulary Caption Hallucinations (EMNLP 2024)☆14Updated 6 months ago
- Official pytorch implementation of 'Relation-aware Language-Graph Transformer for Question Answering' (AAAI 2023)☆17Updated 2 years ago
- An automatic MLLM hallucination detection framework☆19Updated last year
- Source code for InBedder, an instruction-following text embedder☆24Updated 6 months ago
- [NeurIPS 2023]DDCoT: Duty-Distinct Chain-of-Thought Prompting for Multimodal Reasoning in Language Models☆41Updated last year
- Code and data for ACL 2024 paper on 'Cross-Modal Projection in Multimodal LLMs Doesn't Really Project Visual Attributes to Textual Space'☆13Updated 9 months ago
- Original VinVL (and Oscar) repo with API designed for an easy inference☆8Updated last year
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆41Updated 6 months ago
- Official Code Repository for the paper "Knowledge-Augmented Reasoning Distillation for Small Language Models in Knowledge-intensive Tasks…☆37Updated 5 months ago
- Mosaic IT: Enhancing Instruction Tuning with Data Mosaics☆17Updated 2 months ago
- ☆18Updated 5 months ago
- ☆12Updated 4 months ago
- [TMLR 2024] Official implementation of "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"☆19Updated last year
- code for Learning the Unlearned: Mitigating Feature Suppression in Contrastive Learning☆16Updated 9 months ago
- ☆17Updated 8 months ago
- Official pytorch implementation of "Don't Miss the Forest for the Trees: Attentional Vision Calibration for Large Vision Language Models"☆14Updated 9 months ago
- The Code for Lever LM: Configuring In-Context Sequence to Lever Large Vision Language Models☆14Updated 6 months ago
- [KDD 2023] Multi-Grained Multimodal Interaction Network for Entity Linking☆26Updated last year
- text-only training or language-free training for multimodal tasks (image/audio/video caption, retrieval, text2image)☆11Updated 6 months ago
- Official implementation of CVPR 2024 paper "Prompt Learning via Meta-Regularization".☆27Updated last month