Hritikbansal / entigen_emnlp
How well can Text-to-Image Generative Models understand Ethical Natural Language Interventions?
☆13Updated last year
Related projects: ⓘ
- ☆45Updated 10 months ago
- Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]☆11Updated last month
- ☆30Updated 11 months ago
- VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆42Updated 9 months ago
- Implementation and dataset for paper "Can MLLMs Perform Text-to-Image In-Context Learning?"☆22Updated last month
- ☕️ CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion☆24Updated 3 months ago
- Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022☆29Updated last year
- Code and datasets for "What’s “up” with vision-language models? Investigating their struggle with spatial reasoning".☆32Updated 6 months ago
- This repository includes the official implementation of our paper "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness …☆19Updated last year
- Repository for the paper: dense and aligned captions (dac) promote compositional reasoning in vl models☆24Updated 9 months ago
- NegCLIP.☆23Updated last year
- Visual Instruction-guided Explainable Metric. Code for "Towards Explainable Metrics for Conditional Image Synthesis Evaluation" (ACL 2024…☆22Updated last month
- Code for Debiasing Vision-Language Models via Biased Prompts☆50Updated last year
- Forget-Me-Not: Learning to Forget in Text-to-Image Diffusion Models, 2023☆107Updated last year
- ☆24Updated 11 months ago
- ☆46Updated last year
- ☆72Updated 5 months ago
- Official code repo for "Editing Implicit Assumptions in Text-to-Image Diffusion Models"☆81Updated last year
- ☆55Updated 11 months ago
- [Arxiv] Calibrated Self-Rewarding Vision Language Models☆35Updated 3 months ago
- Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆51Updated last year
- PHASE annotations for societal bias in vision-and-language tasks.☆15Updated 3 months ago
- TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering☆132Updated 4 months ago
- [NAACL 2024] Vision language model that reduces hallucinations through self-feedback guided revision. Visualizes attentions on image feat…☆41Updated last month
- ☆30Updated 7 months ago
- Official Code Release for "Diagnosing and Rectifying Vision Models using Language" (ICLR 2023)☆31Updated last year
- Repo for our NeurIPS 2023 paper on: Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Fee…☆24Updated 10 months ago
- ICCV 2023 (Oral) Open-domain Visual Entity Recognition Towards Recognizing Millions of Wikipedia Entities☆31Updated 2 weeks ago
- Data repository for the VALSE benchmark.☆34Updated 7 months ago
- ☆25Updated 4 months ago