simran-khanuja / image-transcreation
☆19Updated last week
Alternatives and similar repositories for image-transcreation:
Users that are interested in image-transcreation are comparing it to the libraries listed below
- The Conceptual Coverage Across Languages Benchmark for Text-to-Image Models☆12Updated 5 months ago
- DiffusER: Discrete Diffusion via Edit-based Reconstruction (Reid, Hellendoorn & Neubig, 2022)☆54Updated 2 years ago
- Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022☆30Updated last year
- ☆28Updated 3 weeks ago
- Public code repo for EMNLP 2024 Findings paper "MACAROON: Training Vision-Language Models To Be Your Engaged Partners"☆14Updated 6 months ago
- Measuring the Mixing of Contextual Information in the Transformer☆28Updated last year
- [ICML 2022] Code and data for our paper "IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages"☆49Updated 2 years ago
- Code of NAACL 2022 "Efficient Hierarchical Domain Adaptation for Pretrained Language Models" paper.☆32Updated last year
- DEMix Layers for Modular Language Modeling☆53Updated 3 years ago
- ☆49Updated 3 months ago
- CLEVR-X: A Visual Reasoning Dataset for Natural Language Explanations☆26Updated last year
- Python 3 support for the MS COCO caption evaluation tools☆14Updated 9 months ago
- ☆30Updated 9 months ago
- ☆33Updated last year
- ☆14Updated 3 years ago
- NAACL 2022: MCSE: Multimodal Contrastive Learning of Sentence Embeddings☆55Updated 9 months ago
- ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models (ICLR 2024, Official Implementation)☆15Updated last year
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".☆47Updated 3 weeks ago
- [NAACL 2024] Vision language model that reduces hallucinations through self-feedback guided revision. Visualizes attentions on image feat…☆44Updated 7 months ago
- Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.☆34Updated 3 years ago
- An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generation☆27Updated 9 months ago
- [ACL 2023] Code and data for our paper "Measuring Progress in Fine-grained Vision-and-Language Understanding"☆13Updated last year
- On the Effectiveness of Parameter-Efficient Fine-Tuning☆38Updated last year
- Can LLMs generate code-mixed sentences through zero-shot prompting?☆11Updated last year
- Repository for the paper: dense and aligned captions (dac) promote compositional reasoning in vl models☆27Updated last year
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆42Updated 5 months ago
- Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models☆43Updated 9 months ago
- PyTorch reimplementation of REALM and ORQA☆22Updated 3 years ago
- ☆16Updated 3 weeks ago
- Data repository for the VALSE benchmark.☆37Updated last year