Source code and data used in the papers ViQuAE (Lerner et al., SIGIR'22), Multimodal ICT (Lerner et al., ECIR'23) and Cross-modal Retrieval (Lerner et al., ECIR'24)
☆38Dec 19, 2024Updated last year
Alternatives and similar repositories for ViQuAE
Users that are interested in ViQuAE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Jul 23, 2021Updated 4 years ago
- ☆16Dec 25, 2021Updated 4 years ago
- An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA, AAAI 2022 (Oral)☆87Apr 10, 2022Updated 3 years ago
- ☆43Aug 15, 2023Updated 2 years ago
- ☆30Dec 16, 2022Updated 3 years ago
- ☆40Aug 4, 2025Updated 7 months ago
- One-stop shop for running and fine-tuning transformer-based language models for retrieval☆63Mar 14, 2026Updated last week
- ☆13Mar 25, 2023Updated 3 years ago
- Multimodal entity linking for Tweets☆29Aug 30, 2021Updated 4 years ago
- Vision Large Language Models trained on M3IT instruction tuning dataset☆17Aug 16, 2023Updated 2 years ago
- A Python utility for indexing file lines. Best demo honourable mention at ECIR 2024.☆23Nov 9, 2025Updated 4 months ago
- ☆14May 10, 2021Updated 4 years ago
- Official Implementation for CVPR 2023 paper "Divide and Conquer: Answering Questions with Object Factorization and Compositional Reasonin…☆10Jun 16, 2024Updated last year
- ☆22Jun 13, 2024Updated last year
- Danmuku dataset☆11Jul 7, 2023Updated 2 years ago
- visual question answering prompting recipes for large vision-language models☆28Sep 14, 2024Updated last year
- [ACL 2023] Code and data for our paper "Measuring Progress in Fine-grained Vision-and-Language Understanding"☆13Jun 11, 2023Updated 2 years ago
- ☆63Jan 3, 2025Updated last year
- [Paper][ISWC 2021] Zero-shot Visual Question Answering using Knowledge Graph☆72Feb 9, 2024Updated 2 years ago
- Pytorch implementation of "Enhancing Chinese Pre-trained Language Model via Heterogeneous Linguistics Graph", ACL 2022☆15Feb 28, 2022Updated 4 years ago
- ☆40Nov 23, 2022Updated 3 years ago
- Better Live Text for MacOS☆33Feb 8, 2026Updated last month
- EMNLP2023 - InfoSeek: A New VQA Benchmark focus on Visual Info-Seeking Questions☆25May 30, 2024Updated last year
- Code for CascadeBERT, Findings of EMNLP 2021☆12Mar 30, 2022Updated 3 years ago
- ☆15Dec 22, 2021Updated 4 years ago
- ☆16May 6, 2021Updated 4 years ago
- ☆152Oct 12, 2022Updated 3 years ago
- Data-centric AI building blocks for computer vision applications☆57Updated this week
- Official implementation for the MM'22 paper.☆14Jun 30, 2022Updated 3 years ago
- Implementation of our paper, 'Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval.'☆28Dec 3, 2023Updated 2 years ago
- ☆45Aug 14, 2023Updated 2 years ago
- This repository helps you evaluate your models on the FreshStack benchmark!☆33Dec 9, 2025Updated 3 months ago
- Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight☆37May 23, 2023Updated 2 years ago
- ☆39Feb 28, 2023Updated 3 years ago
- A simple, well-documented, pedagogical deep learning framework implemented entirely in Python☆12Sep 27, 2020Updated 5 years ago
- Dataset and starting code for visual entailment dataset☆119Apr 21, 2022Updated 3 years ago
- 本科毕业论文、源码及相关材料☆15Dec 30, 2019Updated 6 years ago
- Yet Another SEquence Tagger☆10Dec 8, 2022Updated 3 years ago
- [CVPR 2021] This repository is the official implementation of "PML: Progressive Margin Loss for Long-tailed Age Classification."☆17Mar 13, 2024Updated 2 years ago