usydnlp / VICTRLinks
This repository contains code for paper VICTR: Visual Information Captured Text Representation for Text-to-Image Multimodal Tasks
☆13Updated 3 years ago
Alternatives and similar repositories for VICTR
Users that are interested in VICTR are comparing it to the libraries listed below
Sorting:
- This repo contains codes and instructions for baselines in the VLUE benchmark.☆41Updated 3 years ago
- ☆22Updated 2 years ago
- ☆97Updated this week
- LeicaGAN-Pytorch☆35Updated 5 years ago
- ☆45Updated 3 years ago
- DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models (ICCV 2023)☆141Updated 2 months ago
- Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"☆34Updated 2 years ago
- Implementation of our PR 2020 paper:Unsupervised Text-to-Image Synthesis☆13Updated 5 years ago
- ☆120Updated 2 years ago
- ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration☆56Updated 2 years ago
- ☆76Updated 2 years ago
- ☆73Updated 3 years ago
- FlatNCE: A Novel Contrastive Representation Learning Objective☆90Updated 3 years ago
- Robust Contrastive Learning Using Negative Samples with Diminished Semantics (NeurIPS 2021)☆39Updated 3 years ago
- ☆33Updated 3 years ago
- We rank the 1st in DSTC8 Audio-Visual Scene-Aware Dialog competition. This is the source code for our IEEE/ACM TASLP (AAAI2020-DSTC8-AVSD…☆56Updated 2 years ago
- [ICML 2022] Code and data for our paper "IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages"☆49Updated 2 years ago
- The SVO-Probes Dataset for Verb Understanding☆31Updated 3 years ago
- Human-like Controllable Image Captioning with Verb-specific Semantic Roles.☆36Updated 3 years ago
- Official Code of ECCV 2022 paper MS-CLIP☆90Updated 3 years ago
- [ICML 2021] “ Self-Damaging Contrastive Learning”, Ziyu Jiang, Tianlong Chen, Bobak Mortazavi, Zhangyang Wang☆63Updated 3 years ago
- Extended Intramodal and Intermodal Semantic Similarity Judgments for MS-COCO☆52Updated 4 years ago
- This repository hosts the dataset and source code for "A causal view of compositional zero-shot recognition". Yuval Atzmon, Felix Kreuk, …☆27Updated 4 years ago
- implementation of paper https://arxiv.org/abs/2210.04559☆54Updated 2 years ago
- Code for Debiasing Vision-Language Models via Biased Prompts☆56Updated 2 years ago
- kdexd/coco-caption@de6f385☆26Updated 5 years ago
- ☆84Updated 2 years ago
- ☆55Updated 5 years ago
- ☆63Updated 3 years ago
- Official code for the paper "Self-Distillation for Few-Shot Image Captioning"☆15Updated 4 years ago