The codes and datasets about our ACL 2024 Main Conference paper titled "Cognitive Visual-Language Mapper: Advancing Multimodal Comprehension with Enhanced Visual Knowledge Alignment"
☆17Jan 24, 2025Updated last year
Alternatives and similar repositories for Cognitive-Visual-Language-Mapper
Users that are interested in Cognitive-Visual-Language-Mapper are comparing it to the libraries listed below
Sorting:
- The benchmark and datasets of the ICML 2024 paper "VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual C…☆17May 27, 2024Updated last year
- ☆30Dec 16, 2022Updated 3 years ago
- ☆11Feb 19, 2022Updated 4 years ago
- Source code for paper "VD-PCR: Improving Visual Dialog with Pronoun Coreference Resolution"☆10Nov 1, 2022Updated 3 years ago
- 学习记录☆11Oct 30, 2024Updated last year
- ☆12Jul 4, 2024Updated last year
- Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"☆13May 12, 2023Updated 2 years ago
- CLIPCleaner: Cleaning Noisy Labels with CLIP (ACM MM2024)☆13Apr 28, 2025Updated 10 months ago
- [TNNLS 2022] Official pytorch implementation of "Tackling the Challenges in Scene Graph Generation with Local-to-Global Interactions"☆11Apr 19, 2022Updated 3 years ago
- ☆10Dec 26, 2023Updated 2 years ago
- [CIKM 2022] Towards Automated Over-Sampling for Imbalanced Classification☆10Mar 20, 2023Updated 2 years ago
- [ICCV 2021] Multimodal Knowledge Expansion☆10Aug 28, 2021Updated 4 years ago
- ☆10Jul 18, 2023Updated 2 years ago
- paper code commit-fsmafl☆10Mar 18, 2024Updated last year
- Official repository for the paper, "FedMABench: Benchmarking Mobile GUI Agents on Decentralized Heterogeneous User Data", EMNLP 2025 Main…☆15Nov 11, 2025Updated 3 months ago
- This repository contains the dataset of the paper ARGUS: Context-Based Detection of Stealthy IoT Infiltration Attacks☆12Apr 28, 2023Updated 2 years ago
- ☆11Jul 30, 2025Updated 7 months ago
- A curated publication list on visual dialog☆14May 8, 2023Updated 2 years ago
- List of papers on Hallucination in LMM☆10Nov 29, 2023Updated 2 years ago
- Attaching human-like eyes to the large language model. The codes of IEEE TMM paper "LMEye: An Interactive Perception Network for Large La…☆49Jul 18, 2024Updated last year
- the code for paper: A Symmetric Dual Encoding Dense Retrieval Framework for Knowledge-Intensive Visual Question Answering☆13Aug 22, 2023Updated 2 years ago
- The example of correspondence between fine classes and superclasses (coarse classes) in ImageNet.☆13Dec 4, 2024Updated last year
- ☆10Aug 19, 2023Updated 2 years ago
- Offline RL experiments☆15Oct 1, 2022Updated 3 years ago
- ☆14Mar 28, 2025Updated 11 months ago
- Official implementation of our paper "Towards Reasoning in Large Language Models via Multi-Agent Peer Review Collaboration".☆14Nov 18, 2024Updated last year
- [EMNLP'23 Oral] ReSee: Responding through Seeing Fine-grained Visual Knowledge in Open-domain Dialogue PyTorch Implementation☆13Dec 4, 2023Updated 2 years ago
- Dronet, adapted for Pytorch.☆11Oct 21, 2025Updated 4 months ago
- (CVPR 2024) Official Implementation of "FedSOL: Stabilized Orthogonal Learning with Proximal Restrictions in Federated Learning"☆15Jun 28, 2024Updated last year
- [ECCV2022] Rethinking Data Augmentation for Robust Visual Question Answering☆13Nov 23, 2022Updated 3 years ago
- [ACM MM 2024] See or Guess: Counterfactually Regularized Image Captioning☆16Feb 17, 2025Updated last year
- Official pytorch implementation of "RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in Large Vision Language…☆14Dec 16, 2024Updated last year
- End-to-end codebase for finetuning LLMs (LLaMA 2, 3, etc.) with or without DP☆16Sep 23, 2024Updated last year
- 边画边学numpy☆13Mar 27, 2022Updated 3 years ago
- ☆18Feb 8, 2024Updated 2 years ago
- Benchmark and analysis of 165 pretrained SSL models. Code for "Evaluating Self-Supervised Learning via Risk Decomposition".☆15Jul 26, 2023Updated 2 years ago
- Official codes for FNBench: Benchmarking Robust Federated Learning against Noisy Labels☆18Updated this week
- Implementation of "DIME-FM: DIstilling Multimodal and Efficient Foundation Models"☆15Oct 12, 2023Updated 2 years ago
- Conformal Prediction + Federated Learning☆14Mar 16, 2024Updated last year