tuhinjubcse / VisualMetaphors
Code and Data for ACL 2023 paper I Spy a Metaphor: Large Language Models and Diffusion Models Co-Create Visual Metaphors
☆11Updated last year
Alternatives and similar repositories for VisualMetaphors:
Users that are interested in VisualMetaphors are comparing it to the libraries listed below
- How well can Text-to-Image Generative Models understand Ethical Natural Language Interventions?☆13Updated last year
- ☆33Updated last year
- Visual Instruction-guided Explainable Metric. Code for "Towards Explainable Metrics for Conditional Image Synthesis Evaluation" (ACL 2024…☆32Updated 2 months ago
- The git repository of Modular Prompted Chatbot paper☆32Updated last year
- Codes for paper "Stylized Story Generation with Style-Guided Planning"☆13Updated 3 years ago
- Corpus to accompany: "Do Androids Laugh at Electric Sheep? Humor "Understanding" Benchmarks from The New Yorker Caption Contest"☆55Updated 2 months ago
- Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022☆30Updated last year
- ☆34Updated last year
- Vision Large Language Models trained on M3IT instruction tuning dataset☆17Updated last year
- ICCV 2023 (Oral) Open-domain Visual Entity Recognition Towards Recognizing Millions of Wikipedia Entities☆35Updated 4 months ago
- Neuron Activation☆23Updated 2 months ago
- ☆31Updated 11 months ago
- Code and data for ACL 2024 paper on 'Cross-Modal Projection in Multimodal LLMs Doesn't Really Project Visual Attributes to Textual Space'☆11Updated 5 months ago
- Mr. Right: Multimodal Retrieval on Representation of ImaGe witH Text☆24Updated 2 years ago
- This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception o…☆20Updated last month
- Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)☆80Updated last month
- Scripts to evaluate various bias metrics for different NLG models + decoding algorithms☆15Updated last year
- The SVO-Probes Dataset for Verb Understanding☆31Updated 2 years ago
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆31Updated 7 months ago
- The released data for paper "Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models".☆32Updated last year
- ☆28Updated 11 months ago
- [NAACL 2024] Vision language model that reduces hallucinations through self-feedback guided revision. Visualizes attentions on image feat…☆43Updated 5 months ago
- Preference Learning for LLaVA☆29Updated 2 months ago
- Github repository for Plot and Rework: Modeling Storylines for Visual Storytelling (ACL-IJCNLP2021 Findings)☆20Updated 2 years ago
- The Conceptual Coverage Across Languages Benchmark for Text-to-Image Models☆11Updated 2 months ago
- [TMLR 2024] Official implementation of "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"☆19Updated last year
- Code, data, models for the Sherlock corpus☆55Updated 2 years ago
- [ACL 2023] Code and data for our paper "Measuring Progress in Fine-grained Vision-and-Language Understanding"☆13Updated last year
- VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆44Updated last year
- Official implementation for "MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?"☆41Updated 2 months ago