Extended Intramodal and Intermodal Semantic Similarity Judgments for MS-COCO
☆54Sep 3, 2020Updated 5 years ago
Alternatives and similar repositories for Crisscrossed-Captions
Users that are interested in Crisscrossed-Captions are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Extended COCO Validation (ECCV) Caption dataset (ECCV 2022)☆56Mar 1, 2024Updated 2 years ago
- Code for the paper "Code-Mixing on Sesame Street: Dawn of the Adversarial Polyglots" (NAACL-HLT 2021)☆10May 1, 2025Updated last year
- Code and data for "Broaden the Vision: Geo-Diverse Visual Commonsense Reasoning" (EMNLP 2021).☆29Sep 4, 2021Updated 4 years ago
- [COLING 2018] Learning Visually-Grounded Semantics from Contrastive Adversarial Samples.☆58Sep 12, 2019Updated 6 years ago
- ☆27Dec 15, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆14Aug 30, 2022Updated 3 years ago
- [ICML 2022] Code and data for our paper "IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages"☆49Dec 7, 2022Updated 3 years ago
- Meta-BERT: Learning to Learn fast For Low-Resource Text Classification☆19Oct 22, 2021Updated 4 years ago
- ☆98May 12, 2026Updated last week
- A large scale dataset for Video Captioning in Italian☆13May 16, 2023Updated 3 years ago
- SVL-Adapter: Self-Supervised Adapter for Vision-Language Pretrained Models☆21Jan 11, 2024Updated 2 years ago
- Code for ViLBERTScore in EMNLP Eval4NLP☆18Oct 27, 2022Updated 3 years ago
- Cross Modal Retrieval with Querybank Normalisation☆57Nov 21, 2023Updated 2 years ago
- Learning Fragment Self-Attention Embeddings for Image-Text Matching, in ACM MM 2019☆41Sep 24, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- official code repo for paper "Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging"☆25Oct 11, 2025Updated 7 months ago
- Official Implementation for CVPR 2023 paper "Divide and Conquer: Answering Questions with Object Factorization and Compositional Reasonin…☆10Jun 16, 2024Updated last year
- This repository contains the code for our ECCV 2022 paper on our "Non-isotropic Probabilistic Take on Proxy-based Deep Metric Learning".☆12Dec 6, 2022Updated 3 years ago
- Code and Resources for the Transformer Encoder Reasoning and Alignment Network (TERAN), accepted for publication in ACM Transactions on M…☆74Dec 6, 2023Updated 2 years ago
- ☆82Nov 6, 2023Updated 2 years ago
- BISON: Binary Image SelectiON☆49Sep 15, 2021Updated 4 years ago
- VaLM: Visually-augmented Language Modeling. ICLR 2023.☆56Mar 6, 2023Updated 3 years ago
- [NeurIPS 2023] Official repository for "Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models"☆11Jun 18, 2024Updated last year
- ☆131Aug 18, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions☆17Apr 4, 2024Updated 2 years ago
- ☆13Sep 5, 2023Updated 2 years ago
- code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022☆269Oct 2, 2024Updated last year
- The implementation for "DEER: Descriptive Knowledge Graph for Explaining Entity Relationships" (EMNLP '22)☆12Oct 31, 2022Updated 3 years ago
- Official Implementation for CVPR 2022 paper "Unsupervised Vision-Language Parsing: Seamlessly Bridging Visual Scene Graphs with Language …☆24Oct 19, 2022Updated 3 years ago
- source code and pre-trained/fine-tuned checkpoint for NAACL 2021 paper LightningDOT☆72Nov 14, 2022Updated 3 years ago
- Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021 (Oral)☆165Aug 24, 2025Updated 8 months ago
- Torch Implementation of Speaker-Listener-Reinforcer for Referring Expression Generation and Comprehension☆34Mar 8, 2018Updated 8 years ago
- CVPR 2022 (Oral) Pytorch Code for Unsupervised Vision-and-Language Pre-training via Retrieval-based Multi-Granular Alignment☆21Apr 15, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 12-in-1: Multi-Task Vision and Language Representation Learning Web Demo☆35Dec 8, 2022Updated 3 years ago
- Benchmark data for "Rethinking Benchmarks for Cross-modal Image-text Retrieval" (SIGIR 2023)☆27Apr 24, 2023Updated 3 years ago
- incremental symbol learning for natural language understanding☆10Jun 12, 2023Updated 2 years ago
- An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA, AAAI 2022 (Oral)☆87Apr 10, 2022Updated 4 years ago
- ☆26Mar 4, 2022Updated 4 years ago
- An implementation of Compositional Attention: Disentangling Search and Retrieval by MILA☆14Jun 1, 2022Updated 3 years ago
- Generate a denotation graph from a set of image captions☆16Sep 4, 2018Updated 7 years ago