simran-khanuja / image-transcreation
☆16Updated last month
Related projects ⓘ
Alternatives and complementary repositories for image-transcreation
- The Conceptual Coverage Across Languages Benchmark for Text-to-Image Models☆11Updated 3 weeks ago
- ☆25Updated 2 weeks ago
- NAACL 2022: MCSE: Multimodal Contrastive Learning of Sentence Embeddings☆53Updated 5 months ago
- Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022☆30Updated last year
- Code and datasets for "What’s “up” with vision-language models? Investigating their struggle with spatial reasoning".☆34Updated 8 months ago
- MaXM is a suite of test-only benchmarks for multilingual visual question answering in 7 languages: English (en), French (fr), Hindi (hi),…☆13Updated 10 months ago
- Public code repo for EMNLP 2024 Findings paper "MACAROON: Training Vision-Language Models To Be Your Engaged Partners"☆12Updated last month
- ☆15Updated 2 years ago
- Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models☆43Updated 5 months ago
- DiffusER: Discrete Diffusion via Edit-based Reconstruction (Reid, Hellendoorn & Neubig, 2022)☆54Updated last year
- ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models (ICLR 2024, Official Implementation)☆14Updated 10 months ago
- [ICML 2022] Code and data for our paper "IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages"☆49Updated last year
- [NAACL 2024] Vision language model that reduces hallucinations through self-feedback guided revision. Visualizes attentions on image feat…☆43Updated 3 months ago
- Can LLMs generate code-mixed sentences through zero-shot prompting?☆11Updated last year
- Implementation of our ACL2023 paper: Unifying Cross-Lingual and Cross-Modal Modeling Towards Weakly Supervised Multilingual Vision-Langua…☆15Updated last year
- Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.☆34Updated 3 years ago
- ☆24Updated last year
- ☆14Updated 2 years ago
- Retrieval-augmented Image Captioning☆12Updated last year
- Evaluation pipeline for the BabyLM Challenge 2023.☆72Updated last year
- [2024-ACL]: TextBind: Multi-turn Interleaved Multimodal Instruction-following in the Wildrounded Conversation☆47Updated last year
- Code repository for the paper "Mission: Impossible Language Models."☆39Updated 10 months ago
- ☆11Updated 2 years ago
- This repository contains the code used for the experiments in the paper "Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity…☆19Updated 8 months ago
- Code for ACL 2024 paper "Soft Self-Consistency Improves Language Model Agents"☆16Updated 2 months ago
- ☆12Updated 8 months ago
- ☆126Updated 2 years ago
- Repository for the paper: dense and aligned captions (dac) promote compositional reasoning in vl models☆25Updated 11 months ago
- Materials for "Quantifying the Plausibility of Context Reliance in Neural Machine Translation" at ICLR'24 🐑 🐑☆13Updated 7 months ago
- Code for Zero-Shot Tokenizer Transfer☆115Updated last month