Code and data for ImageCoDe, a contextual vison-and-language benchmark
☆41Mar 1, 2024Updated 2 years ago
Alternatives and similar repositories for imagecode
Users that are interested in imagecode are comparing it to the libraries listed below
Sorting:
- Official repository for Fourier model that can generate periodic signals☆10Mar 10, 2022Updated 3 years ago
- This is an official implementation of GRIT-VLP☆20Aug 8, 2022Updated 3 years ago
- Implementation for the paper "Reliable Visual Question Answering Abstain Rather Than Answer Incorrectly" (ECCV 2022: https//arxiv.org/abs…☆38May 19, 2023Updated 2 years ago
- Extended COCO Validation (ECCV) Caption dataset (ECCV 2022)☆56Mar 1, 2024Updated 2 years ago
- Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"☆13Jul 27, 2023Updated 2 years ago
- [Findings of ACL-2023] This is the official implementation of On the Difference of BERT-style and CLIP-style Text Encoders.☆14Jun 7, 2023Updated 2 years ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 3 years ago
- ☆10Aug 22, 2022Updated 3 years ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Sep 17, 2021Updated 4 years ago
- Code for the paper "Understanding and Evaluating Racial Biases in Image Captioning"☆12Oct 19, 2021Updated 4 years ago
- Code for ACL 2023 Oral Paper: ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning☆12Aug 23, 2025Updated 6 months ago
- ☆29Oct 18, 2022Updated 3 years ago
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25Nov 23, 2024Updated last year
- ☆54Jul 31, 2022Updated 3 years ago
- Code and data setup for the paper "Are Diffusion Models Vision-and-language Reasoners?"☆33Mar 15, 2024Updated last year
- This repository shows how to efficiently process variable-length sequences in TensorFlow.☆14Apr 26, 2022Updated 3 years ago
- A collection of functions to help you easily train and run Tensorflow Keras. It includes 1-line auto-TPU support, GPU memory management, …☆12Jul 6, 2022Updated 3 years ago
- VisBERT: Demo web app for "How Does BERT Answer Questions?"☆11Jul 22, 2023Updated 2 years ago
- Curated list of awesome ML Visualization Libraries☆13Jun 23, 2023Updated 2 years ago
- ☆15Nov 30, 2023Updated 2 years ago
- MLPs for Vision and Langauge Modeling (Coming Soon)☆27Dec 9, 2021Updated 4 years ago
- ☆20Apr 23, 2024Updated last year
- [NeurIPS 2024] Official PyTorch implementation of "Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives"☆46Dec 1, 2024Updated last year
- 3D Medical Image Retrieval in Radiology☆18Dec 19, 2024Updated last year
- A general purpose web app for connecting participants to engage in realtime conversations based on generated prompts.☆20Jun 21, 2023Updated 2 years ago
- Code for "CLIP Behaves like a Bag-of-Words Model Cross-modally but not Uni-modally"☆20Feb 14, 2025Updated last year
- Large-Scale Bidirectional Training for Zero-Shot Image Captioning☆21Feb 14, 2023Updated 3 years ago
- This repository is related to 'Intriguing Properties of Hyperbolic Embeddings in Vision-Language Models', published at TMLR (2024), https…☆22Jul 5, 2024Updated last year
- [TACL 2021] Code and data for the framework in "Multimodal Pretraining Unmasked: A Meta-Analysis and a Unified Framework of Vision-and-La…☆114Mar 24, 2022Updated 3 years ago
- Stochastic Optimization for Global Contrastive Learning without Large Mini-batches☆20Mar 31, 2023Updated 2 years ago
- The project is about predicting sets (of classes) from images.☆23Aug 31, 2021Updated 4 years ago
- Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query (ICCV2021)☆20Dec 4, 2021Updated 4 years ago
- [ECCV2024][ICCV2023] Official PyTorch implementation of SeiT++ and SeiT☆56Aug 12, 2024Updated last year
- [ACL 2023] Delving into the Openness of CLIP☆24Jan 11, 2023Updated 3 years ago
- [CVPR 2024] The official implementation of paper "synthesize, diagnose, and optimize: towards fine-grained vision-language understanding"☆52Jun 16, 2025Updated 8 months ago
- [NeurIPS 2022] code for "K-LITE: Learning Transferable Visual Models with External Knowledge" https://arxiv.org/abs/2204.09222☆53Jun 12, 2023Updated 2 years ago
- [NAACL 2022] GlobEnc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformers☆21May 16, 2023Updated 2 years ago
- Paper Today I Read☆27Jan 27, 2026Updated last month
- 【ICLR 2024, Spotlight】Sentence-level Prompts Benefit Composed Image Retrieval☆92Apr 16, 2024Updated last year