EMNLP 2018. Learning to Describe Differences Between Pairs of Similar Images. Harsh Jhamtani, Taylor Berg-Kirkpatrick.
☆67Jan 27, 2026Updated last month
Alternatives and similar repositories for spot-the-diff
Users that are interested in spot-the-diff are comparing it to the libraries listed below
Sorting:
- Code and dataset release for Park et al., Robust Change Captioning (ICCV 2019)☆50Dec 8, 2022Updated 3 years ago
- Harsh Jhamtani*, Varun Gangal*, Eduard Hovy, Graham Neubig, Taylor Berg-Kirkpatrick. Learning to Generate Move-by-Move Commentary for Che…☆45Jul 21, 2022Updated 3 years ago
- Demos of neural image editing☆11Mar 15, 2021Updated 4 years ago
- ☆14Aug 5, 2018Updated 7 years ago
- "Learning Rhyming Constraints using Structured Adversaries. Jhamtani H., Mehta S., Carbonell J., Berg-Kirkpatrick T. EMNLP-IJCNLP (Short …☆11Mar 17, 2020Updated 5 years ago
- Amazon - Understanding Product Images☆14May 7, 2021Updated 4 years ago
- ☆13Jan 29, 2024Updated 2 years ago
- [BMVC2024] Erasing Concepts from Text-to-Image Diffusion Models with Few-shot Unlearning☆14Feb 14, 2026Updated 2 weeks ago
- [NeurIPS24] VisMin: Visual Minimal-Change Understanding☆19Mar 3, 2025Updated 11 months ago
- code for running trained model from Visual Reasoning by Progressive Module Networks (ICLR19)☆15Jan 30, 2019Updated 7 years ago
- A length-controllable and non-autoregressive image captioning model.☆69Jun 10, 2021Updated 4 years ago
- On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning, …☆19Dec 16, 2024Updated last year
- Pytorch implementation for "Erasing the Bias: Fine-Tuning Foundation Models for Semi-Supervised Learning" (ICML 2024)☆24May 11, 2025Updated 9 months ago
- Super fast implementations of common benchmark text world games☆52Aug 25, 2025Updated 6 months ago
- Toward Scalable Neural Dialogue State Tracking Model☆20Sep 23, 2022Updated 3 years ago
- Official code for "Can We Talk Models Into Seeing the World Differently?" (ICLR 2025).☆28Jan 26, 2025Updated last year
- Brown clustering in Python☆22Dec 12, 2017Updated 8 years ago
- This repository contains the nine hundred sky segmentation datasets and the sky segmentation model provided by us.☆23Jun 21, 2022Updated 3 years ago
- Official implementation of "Top Batch DropBlock for Person Re-Identification" ICPR 2020☆27Mar 28, 2021Updated 4 years ago
- [IEEE GRSL 2022 🔥] "Remote Sensing Image Captioning Based on Multi-Layer Aggregated Transformer"☆32Jun 20, 2023Updated 2 years ago
- Code and data for the paper: Learning Action and Reasoning-Centric Image Editing from Videos and Simulation☆33Jun 30, 2025Updated 8 months ago
- GPT-3 Interactive CLI built on ai dungeon☆40Aug 26, 2020Updated 5 years ago
- Joint detection of Object and its Semantic parts using Attention-based Feature Fusion on PASCAL Parts 2010 dataset☆27Jul 25, 2024Updated last year
- The implementation of our paper: Bilinear Representation for Language-Based Image Editing using Conditional Generative Adversarial Networ…☆25Feb 1, 2022Updated 4 years ago
- A library of techniques for local interpretation of machine learning models☆10Mar 24, 2023Updated 2 years ago
- A retrieve and edit approach to generate sarcasm by reversing valence and adding incongruent common sense context☆32Mar 27, 2021Updated 4 years ago
- Evaluation and dataset construction code for the CVPR 2025 paper "Vision-Language Models Do Not Understand Negation"☆46Updated this week
- Attentive Semantic Video Generation using Captions☆36Oct 22, 2017Updated 8 years ago
- This repo contains evaluation code for the paper "BLINK: Multimodal Large Language Models Can See but Not Perceive". https://arxiv.or…☆161Sep 27, 2025Updated 5 months ago
- Dataset created for the Power Line Insulators Inspection Detections☆10Jul 2, 2020Updated 5 years ago
- An Educational Framework Based on PyTorch for Deep Learning Education and Exploration☆10Dec 24, 2023Updated 2 years ago
- Data repository for the VALSE benchmark.☆37Feb 15, 2024Updated 2 years ago
- [IEEE TGRS 2024 🔥] Change-Agent: Toward Interactive Comprehensive Remote Sensing Change Interpretation and Analysis☆177Jul 27, 2025Updated 7 months ago
- [NeurIPS 2023] Generalized Logit Adjustment☆39Apr 21, 2024Updated last year
- VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆45Nov 29, 2023Updated 2 years ago
- Code for the ICME 2021 paper "SAFIN: Arbitrary Style Transfer With Self-Attentive Factorized Instance Normalization"☆34Jan 15, 2024Updated 2 years ago
- Dataset introduced in PlotQA: Reasoning over Scientific Plots☆83Jun 20, 2023Updated 2 years ago
- Code to train and evaluate the GeNeVA-GAN model for the GeNeVA task proposed in our ICCV 2019 paper "Tell, Draw, and Repeat: Generating a…☆85Sep 27, 2022Updated 3 years ago
- ☆13Nov 9, 2025Updated 3 months ago