[NeurIPS 2019] Drill-down: Interactive Retrieval of Complex Scenes using Natural Language Queries
☆12Apr 15, 2022Updated 3 years ago
Alternatives and similar repositories for DrillDown
Users that are interested in DrillDown are comparing it to the libraries listed below
Sorting:
- Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query (ICCV2021)☆20Dec 4, 2021Updated 4 years ago
- Who's Waldo? Linking People Across Text and Images. ICCV 2021.☆13May 17, 2023Updated 2 years ago
- PyTorch implementation of "TALL: Temporal Activity Localization via Language Query. Gao et al. ICCV2017."☆14Apr 20, 2019Updated 6 years ago
- ☆28Nov 22, 2022Updated 3 years ago
- Modality-Agnostic Attention Fusion for visual search with text feedback☆25Mar 21, 2023Updated 2 years ago
- ☆24May 31, 2022Updated 3 years ago
- USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text Retrieval, TIP 2024☆33Jun 18, 2025Updated 8 months ago
- Code for CVPR'19 "Recursive Visual Attention in Visual Dialog"☆64Mar 24, 2023Updated 2 years ago
- ☆11Nov 30, 2022Updated 3 years ago
- The source code of the paper: "To Find Where You Talk: Temporal Sentence Localization in Video with Attention Based Location Regression"☆30Jan 8, 2019Updated 7 years ago
- Official implementation of the Composed Image Retrieval using Pretrained LANguage Transformers (CIRPLANT) | ICCV 2021 - Image Retrieval o…☆40Jun 26, 2024Updated last year
- MAC: Mining Activity Concepts for Language-based Temporal Localization☆36Nov 26, 2018Updated 7 years ago
- Official code for "CSCE-Net: Cross-Scale Context Extracted Hashing for Fine-Grained Image Binary Encoding"☆12Jun 1, 2023Updated 2 years ago
- This is a pytorch implement of SCDA (selective deep descriptors aggregation for fine-grained image retrieval), which is fully translated …☆20Mar 8, 2022Updated 4 years ago
- ☆10Aug 22, 2023Updated 2 years ago
- Code implementation of our BMVC 2022 paper: Cluster-level pseudo-labelling for source-free cross-domain facial expression recognition☆11Dec 18, 2022Updated 3 years ago
- code for the paper "Adversarial Reinforced Instruction Attacker for Robust Vision-Language Navigation" (TPAMI 2021)☆10Jul 15, 2022Updated 3 years ago
- PyTorch Implementation of TCN (T-PAMI2021)☆10Oct 24, 2021Updated 4 years ago
- Load and visualize different datasets in video question answering☆10May 11, 2021Updated 4 years ago
- This is an official implementation of video classification for our CVPR 2020 paper "Non-Local Neural Networks With Grouped Bilinear Atten…☆12Jan 30, 2021Updated 5 years ago
- PyTorch implementation for "Gated Transfer Network for Transfer Learning"☆11Jun 3, 2019Updated 6 years ago
- ☆11Aug 16, 2019Updated 6 years ago
- Character Grounding and Re-Identification in Story of Videos and Text Descriptions☆10Jan 17, 2021Updated 5 years ago
- ☆164Mar 7, 2022Updated 4 years ago
- A Pytorch implemention for some state-of-the-art models for" Temporally Language Grounding in Untrimmed Videos"☆95Sep 21, 2019Updated 6 years ago
- Implementations of some few-shot action recognition methods.☆43Jun 7, 2021Updated 4 years ago
- ☆14Jul 13, 2021Updated 4 years ago
- One-Pixel Shortcut: on the Learning Preference of Deep Neural Networks (ICLR 2023 Spotlight)