Code and data for ImageCoDe, a contextual vison-and-language benchmark
☆41Mar 1, 2024Updated 2 years ago
Alternatives and similar repositories for imagecode
Users that are interested in imagecode are comparing it to the libraries listed below
Sorting:
- Official repository for Fourier model that can generate periodic signals☆10Mar 10, 2022Updated 4 years ago
- Code and data setup for the paper "Are Diffusion Models Vision-and-language Reasoners?"☆33Mar 15, 2024Updated 2 years ago
- Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"☆13Jul 27, 2023Updated 2 years ago
- Emergent Communication Pretraining for Few-Shot Machine Translation☆13Dec 3, 2020Updated 5 years ago
- Implementation for the paper "Reliable Visual Question Answering Abstain Rather Than Answer Incorrectly" (ECCV 2022: https//arxiv.org/abs…☆39May 19, 2023Updated 2 years ago
- [TACL 2021] Code and data for the framework in "Multimodal Pretraining Unmasked: A Meta-Analysis and a Unified Framework of Vision-and-La…☆114Mar 24, 2022Updated 3 years ago
- ☆10Aug 22, 2022Updated 3 years ago
- This is an official implementation of GRIT-VLP☆20Aug 8, 2022Updated 3 years ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Sep 17, 2021Updated 4 years ago
- ☆54Jul 31, 2022Updated 3 years ago
- Code for ACL 2023 Oral Paper: ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning☆12Aug 23, 2025Updated 6 months ago
- ☆29Oct 18, 2022Updated 3 years ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 3 years ago
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25Nov 23, 2024Updated last year
- [ACL 2023] Delving into the Openness of CLIP☆24Jan 11, 2023Updated 3 years ago
- SSLCL: An Efficient Model-Agnostic Supervised Contrastive Learning Framework for Emotion Recognition in Conversations☆15Jul 27, 2024Updated last year
- Extended COCO Validation (ECCV) Caption dataset (ECCV 2022)☆56Mar 1, 2024Updated 2 years ago
- Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query (ICCV2021)☆20Dec 4, 2021Updated 4 years ago
- [Findings of ACL-2023] This is the official implementation of On the Difference of BERT-style and CLIP-style Text Encoders.☆14Jun 7, 2023Updated 2 years ago
- Code for the paper "Understanding and Evaluating Racial Biases in Image Captioning"☆12Oct 19, 2021Updated 4 years ago
- The official repository for the paper entitled "Time Travel in LLMs: Tracing Data Contamination in Large Language Models."☆12Jun 11, 2024Updated last year
- Code and data for reproducing baselines for TopiOCQA, an open-domain conversational question-answering dataset☆56Nov 15, 2023Updated 2 years ago
- Curated list of awesome ML Visualization Libraries☆13Jun 23, 2023Updated 2 years ago
- code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022☆268Oct 2, 2024Updated last year
- 3D Medical Image Retrieval in Radiology☆18Dec 19, 2024Updated last year
- The project is about predicting sets (of classes) from images.☆23Aug 31, 2021Updated 4 years ago
- Large-Scale Bidirectional Training for Zero-Shot Image Captioning☆21Feb 14, 2023Updated 3 years ago
- Data repository for the VALSE benchmark.☆37Feb 15, 2024Updated 2 years ago
- Code and Models for "GeneCIS A Benchmark for General Conditional Image Similarity"☆61Jun 12, 2023Updated 2 years ago
- VisBERT: Demo web app for "How Does BERT Answer Questions?"☆11Jul 22, 2023Updated 2 years ago
- [EMNLP'21] Mirror-BERT: Converting Pretrained Language Models to universal text encoders without labels.☆77Aug 14, 2022Updated 3 years ago
- A general purpose web app for connecting participants to engage in realtime conversations based on generated prompts.☆20Jun 21, 2023Updated 2 years ago
- ☆22Dec 1, 2021Updated 4 years ago
- [NeurIPS 2024] Official PyTorch implementation of "Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives"☆46Dec 1, 2024Updated last year
- ☆15Nov 30, 2023Updated 2 years ago
- [ICCV 2023 & IJCV 2026] PARTNER: Level up the Polar Representation for LiDAR 3D Object Detection☆22Aug 12, 2024Updated last year
- [NeurIPS 2022] code for "K-LITE: Learning Transferable Visual Models with External Knowledge" https://arxiv.org/abs/2204.09222☆53Jun 12, 2023Updated 2 years ago
- [EMNLP'2024 Findings] Explore generated documents for enhanced IR with LLMs. We enhance BM25 to surpass strong dense retriever on many da…☆15Mar 28, 2025Updated 11 months ago
- This repository is used to publish our codes for the conference paper "Vietnamese punctuation prediction using deep neural networks" at S…☆11Jul 11, 2020Updated 5 years ago