zchoi / PKOL
[TIP 2022] Official code of paper “Video Question Answering with Prior Knowledge and Object-sensitive Learning”
☆46Updated last year
Alternatives and similar repositories for PKOL:
Users that are interested in PKOL are comparing it to the libraries listed below
- [IJCAI 2022] Official Pytorch code for paper “S2 Transformer for Image Captioning”☆83Updated 8 months ago
- Repository for an end-to-end image captioning method PTSN(ACM MM22).☆61Updated 2 years ago
- [TCSVT23] Official code for "SPT: Spatial Pyramid Transformer for Image Captioning".☆10Updated 8 months ago
- ☆11Updated last year
- The code of the paper of "A Differentiable Semantic Metric Approximation in Probabilistic Embedding for Cross-Modal Retrieval" accepted b…☆19Updated last year
- ☆19Updated 9 months ago
- ☆57Updated last month
- The code of MGCC: Text-based Occluded Person Re-identification via Multi-Granularity Contrastive Consistency Learning☆17Updated 2 months ago
- Paper Reading of IMCC groups.☆18Updated 2 weeks ago
- Word4Per is an innovative framework for Zero-Shot Composed Person Retrieval (ZS-CPR), integrating visual and textual information for enha…☆23Updated 4 months ago
- AMC: Adaptive Multi-expert Collaborative Network for Text-guided Image Retrieval☆19Updated 8 months ago
- ☆91Updated last year
- CLIP-Driven Fine-grained Text-Image Person Re-identification☆48Updated last year
- The code of the paper "Negative Pre-aware for Noisy Cross-modal Matching" in AAAI 2024.☆22Updated 11 months ago
- [ICCV2023] - CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation☆35Updated 7 months ago
- ☆74Updated last year
- [CVPR 2024] Offical implemention of the paper "DePT: Decoupled Prompt Tuning"☆100Updated 5 months ago
- Transferable Decoding with Visual Entities for Zero-Shot Image Captioning, ICCV 2023☆152Updated 8 months ago
- The code of "Image-text Retrieval via Preserving Main Semantic of Vision" in ICME 2023.☆14Updated last year
- This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our em…☆62Updated last month
- Context-I2W: Mapping Images to Context-dependent words for Accurate Zero-Shot Composed Image Retrieval [AAAI 2024 Oral]☆51Updated 5 months ago
- ☆21Updated 2 years ago
- The download methods of Vision-language Continual Pretraining Dataset P9D.☆11Updated 4 months ago
- Official github repo for ICCV2023 paper 'Multi-event Video-Text Retrieval'☆18Updated last year
- Progressive Spatio-Temporal Prototype Matching for Text-Video Retrieval --ICCV2023 Oral☆91Updated last year
- ☆17Updated 2 years ago
- Instruction Tuning in Continual Learning paradigm☆47Updated 3 months ago
- [NeurIPS 2024] Conjugated Semantic Pool Improves OOD Detection with Pre-trained Vision-Language Models☆35Updated 6 months ago
- 中科大跨模态智能组-每周论文分享☆16Updated 2 years ago
- ☆17Updated 5 months ago