reallsp / SAFLinks
☆11Updated last year
Alternatives and similar repositories for SAF
Users that are interested in SAF are comparing it to the libraries listed below
Sorting:
- Code for ECCV 2022 Workshop paper "See Finer, See More: Implicit Modality Alignment for Text-based Person Retrieval"☆21Updated 2 months ago
- ☆34Updated 2 years ago
- Implementation of our CVPR2022 paper, Negative-Aware Attention Framework for Image-Text Matching.☆119Updated 2 years ago
- Code of SSAN☆66Updated last year
- Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person Retrieval (CVPR 2023)☆246Updated 5 months ago
- https://layer6ai-labs.github.io/xpool/☆125Updated 2 years ago
- ☆30Updated 2 years ago
- 【IJCAI 2023】RaSa: Relation and Sensitivity Aware Representation Learning for Text-based Person Search☆63Updated 2 years ago
- The official code of "Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark"☆162Updated last month
- 【AAAI 2024】An Empirical Study of CLIP for Text-based Person Search☆67Updated last year
- ☆21Updated 2 years ago
- ☆30Updated last year
- [ECCV 2022] A pytorch implementation for TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval☆77Updated 2 years ago
- Code for Harnessing the Power of MLLMs for Transferable Text-to-Image Person ReID (CVPR 2024)☆75Updated last year
- [CVPR2024] UFineBench: Towards Text-based Person Retrieval with Ultra-fine Granularity☆66Updated 11 months ago
- Code for "Contextual Non-Local Alignment over Full-Scale Representation for Text-Based Person Search"☆62Updated 4 years ago
- CLIP-Driven Fine-grained Text-Image Person Re-identification☆52Updated last year
- [BMVC 2021] Text-Based Person Search with Limited Data☆45Updated 3 years ago
- [CVPR 2024] TeachCLIP for Text-to-Video Retrieval☆37Updated 3 months ago
- Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021 (Oral)☆160Updated last week
- Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022☆95Updated 2 years ago
- Temporal Sentence Grounding in Videos / Natural Language Video Localization / Video Moment Retrieval的相关工作☆29Updated 3 years ago
- ☆49Updated last year
- SeqTR: A Simple yet Universal Network for Visual Grounding☆142Updated 10 months ago
- Pytorch Code for "Unified Coarse-to-Fine Alignment for Video-Text Retrieval" (ICCV 2023)☆66Updated last year
- [CVPR 2022] This repository is for the paper ``DIFNet: Boosting Visual Information Flow for Image Captioning'' .☆20Updated 2 years ago
- ☆35Updated 4 years ago
- Official repository for "Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting" [CVPR 2023]☆123Updated 2 years ago
- Summary of Related Research on Image-Text Matching☆71Updated 2 years ago
- The official code of "PLIP: Language-Image Pre-training for Person Representation Learning"☆120Updated 8 months ago