CuthbertCai/Ask-Confirm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CuthbertCai/Ask-Confirm)

CuthbertCai / Ask-Confirm

Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query (ICCV2021)

☆20

Alternatives and similar repositories for Ask-Confirm

Users that are interested in Ask-Confirm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

uvavision / DrillDown
View on GitHub
[NeurIPS 2019] Drill-down: Interactive Retrieval of Complex Scenes using Natural Language Queries
☆12Apr 15, 2022Updated 4 years ago
XLearning-SCU / 2021-CVPR-MRL
View on GitHub
Learning Cross-modal Retrieval with Noisy Labels (CVPR 2021, PyTorch Code)
☆13Apr 7, 2021Updated 5 years ago
rxtan2 / video-grounding-narrations
View on GitHub
☆12Mar 12, 2023Updated 3 years ago
showlab / Region_Learner
View on GitHub
The Pytorch implementation for "Video-Text Pre-training with Learned Regions"
☆43Jul 15, 2022Updated 4 years ago
arijitray1993 / COLA
View on GitHub
COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!
☆25May 14, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
zhangy0822 / USER
View on GitHub
USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text Retrieval, TIP 2024
☆33Jun 18, 2025Updated last year
naver-ai / eccv-caption
View on GitHub
Extended COCO Validation (ECCV) Caption dataset (ECCV 2022)
☆56Jul 7, 2026Updated 2 weeks ago
showlab / DemoVLP
View on GitHub
[Arxiv2022] Revitalize Region Feature for Democratizing Video-Language Pre-training
☆22Mar 19, 2022Updated 4 years ago
facebookresearch / stepdiff
View on GitHub
Data release for Step Differences in Instructional Video (CVPR24)
☆15Jun 19, 2024Updated 2 years ago
ReID-Team / ReID_extra_testdata
View on GitHub
☆12Dec 24, 2020Updated 5 years ago
TencentYoutuResearch / SelfSupervisedLearning-DSM
View on GitHub
code for AAAI21 paper "Enhancing Unsupervised Video Representation Learning by Decoupling the Scene and the Motion“
☆28Jan 7, 2021Updated 5 years ago
KT27-A / Optical_Flow_GPU_Opencv3
View on GitHub
Extracting optical flow based on GPU in Opencv3
☆12Jul 29, 2019Updated 6 years ago
CSU-JPG / Awesome-VLM-Reasoning
View on GitHub
☆21May 19, 2025Updated last year
sunnychencool / AOQ
View on GitHub
Adaptive Offline Quintuplet Loss for Image-Text Matching (AOQ)
☆34Jul 2, 2020Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
StanLei52 / TQVSR
View on GitHub
[Findings of EMNLP 2022] AssistSR: Task-oriented Video Segment Retrieval for Personal AI Assistant
☆24Sep 11, 2023Updated 2 years ago
TencentYoutuResearch / PersonReID-NAFS
View on GitHub
Code for "Contextual Non-Local Alignment over Full-Scale Representation for Text-Based Person Search"
☆63Apr 16, 2021Updated 5 years ago
ecom-research / ComposeAE
View on GitHub
Official code for WACV 2021 paper - Compositional Learning of Image-Text Query for Image Retrieval
☆55Oct 8, 2021Updated 4 years ago
iLearn-Lab / SIGIR21-DIME
View on GitHub
Dynamic Modality Interaction Modeling for Image-Text Retrieval. SIGIR'21
☆69Apr 5, 2026Updated 3 months ago
yahoo / maaf
View on GitHub
Modality-Agnostic Attention Fusion for visual search with text feedback
☆25Mar 21, 2023Updated 3 years ago
CrossmodalGroup / BFAN
View on GitHub
Implementation of our ACMMM2019 paper, Focus Your Attention: A Bidirectional Focal Attention Network for Image-Text Matching
☆39Jun 19, 2023Updated 3 years ago
ShuaiBai623 / AIC2021-T5-CLV
View on GitHub
🏆 The 1st Place Submission to AICity Challenge 2021 Natural Language-Based Vehicle Retrieval Track (Alibaba-UTS submission)
☆94Apr 28, 2021Updated 5 years ago
facebookresearch / genecis
View on GitHub
Code and Models for "GeneCIS A Benchmark for General Conditional Image Similarity"
☆61Jun 12, 2023Updated 3 years ago
UCSB-AI / ComCLIP
View on GitHub
Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"
☆37Aug 18, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
TencentYoutuResearch / Ensemble-Grafting
View on GitHub
Code for CVPR 2020 paper “Filter Grafting for Deep Neural Networks”
☆14Dec 21, 2020Updated 5 years ago
weixmath / CVR
View on GitHub
☆18Jan 4, 2024Updated 2 years ago
frostinassiky / bsp
View on GitHub
Placeholder for code of BSP.
☆11Aug 13, 2021Updated 4 years ago
youngkyunJang / VDG
View on GitHub
Visual Delta Generator with Large Multi-modal Model for Semi-supervised Composed Image Retrieval - CVPR2024
☆21May 30, 2024Updated 2 years ago
GT-RIPL / Xmodal-Ctx
View on GitHub
Official PyTorch implementation of our CVPR 2022 paper: Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for …
☆61Oct 21, 2022Updated 3 years ago
penghu-cs / MRL
View on GitHub
Learning Cross-Modal Retrieval with Noisy Labels (CVPR 2021, PyTorch Code)
☆56Mar 5, 2023Updated 3 years ago
yonatanbitton / data_efficient_masked_language_modeling_for_vision_and_language
View on GitHub
Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".
☆18Sep 17, 2021Updated 4 years ago
cychomatica / One-Pixel-Shotcut
View on GitHub
One-Pixel Shortcut: on the Learning Preference of Deep Neural Networks (ICLR 2023 Spotlight)
☆14Sep 28, 2025Updated 9 months ago
dayu11 / Availability-Attacks-Create-Shortcuts
View on GitHub
☆10Jul 28, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
TencentYoutuResearch / Pruning-PFF
View on GitHub
PyTorch implementation of NeurIPS 2020 paper "Pruning Filter in Filter".
☆18Jan 4, 2021Updated 5 years ago
Cuberick-Orion / CIRPLANT
View on GitHub
Official implementation of the Composed Image Retrieval using Pretrained LANguage Transformers (CIRPLANT) | ICCV 2021 - Image Retrieval o…
☆39Jun 26, 2024Updated 2 years ago
leonnnop / VAR
View on GitHub
[CVPR 2022] Visual Abductive Reasoning
☆124Oct 22, 2024Updated last year
cdluminate / ladderloss
View on GitHub
Ladder Loss for Coherent Visual-Semantic Embedding, AAAI, 2020
☆13Aug 14, 2021Updated 4 years ago
liuzrcc / ImageShortcutSqueezing
View on GitHub
Image Shortcut Squeezing: Countering Perturbative Availability Poisons with Compression
☆14Mar 22, 2025Updated last year
Andy-Cheng / TEMPURA
View on GitHub
TEMPURA enables video-language models to reason about causal event relationships and generate fine-grained, timestamped descriptions of u…
☆27Jun 4, 2025Updated last year
Paranioar / Awesome_Matching_Pretraining_Transfering
View on GitHub
The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretr…
☆446Sep 25, 2025Updated 9 months ago