jhliu17 / MCR
Code for the paper "Multi-perspective Coherent Reasoning for Helpfulness Prediction of Multimodal Reviews" (ACL 2021)
☆15Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for MCR
- [ICCV2023] Official code for "VL-PET: Vision-and-Language Parameter-Efficient Tuning via Granularity Control"☆53Updated last year
- Code for "Visual Spatial Description: Controlled Spatial-Oriented Image-to-Text Generation"☆26Updated 8 months ago
- [ACL 2024 Findings] LLMs as Bridges: Reformulating Grounded Multimodal Named Entity Recognition☆24Updated 2 months ago
- ☆19Updated 3 years ago
- Learning Situation Hyper-Graphs for Video Question Answering☆18Updated 9 months ago
- Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"☆32Updated last year
- Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome☆24Updated 5 years ago
- NewsCLIPpings: Automatic Generation of Out-of-Context Multimodal Media, EMNLP 2021☆34Updated 2 months ago
- ☆19Updated 2 years ago
- ☆37Updated last year
- Recent Advances in Visual Dialog☆30Updated 2 years ago
- ☆26Updated last year
- MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering☆88Updated last year
- Official Repository for CVPR 2022 paper "REX: Reasoning-aware and Grounded Explanation"☆18Updated last year
- MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU☆41Updated last year
- Code and model for AAAI 2024: UMIE: Unified Multimodal Information Extraction with Instruction Tuning☆29Updated 5 months ago
- Code for ACM MM 2021 Paper "Multimodal Relation Extraction with Efficient Graph Alignment".☆90Updated 2 years ago
- MSTI☆13Updated 8 months ago
- Code for IEEE Trans. on Multimedia (TMM) paper "Object-aware Multimodal Named Entity Recognition in Social Media Posts with Adversarial L…☆16Updated 3 years ago
- Official implementation for CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding☆42Updated last year
- for DTCA model☆8Updated last year
- Code for EMNLP 2022 paper “Distilled Dual-Encoder Model for Vision-Language Understanding”☆29Updated last year
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)☆33Updated 3 weeks ago
- ☆24Updated 3 years ago
- [ICCV 2021] Official implementation of the paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering"☆65Updated 3 years ago
- Code of the NeurIPS 2021 paper: Language and Visual Entity Relationship Graph for Agent Navigation☆45Updated 3 years ago
- Official implementation of Towards Multi-Modal Sarcasm Detection via Hierarchical Congruity Modeling with Knowledge Enhancement.☆31Updated 11 months ago
- A reading list of papers about Visual Question Answering.☆32Updated 2 years ago
- ACL'2023: Few-shot Event Detection: An Empirical Study and a Unified View☆11Updated 8 months ago
- VQACL: A Novel Visual Question Answering Continual Learning Setting (CVPR'23)☆31Updated 7 months ago