FeiElysia / ViECapLinks
Transferable Decoding with Visual Entities for Zero-Shot Image Captioning, ICCV 2023
☆160Updated last year
Alternatives and similar repositories for ViECap
Users that are interested in ViECap are comparing it to the libraries listed below
Sorting:
- Progressive Spatio-Temporal Prototype Matching for Text-Video Retrieval --ICCV2023 Oral☆91Updated 2 years ago
- MomentDiff: Generative Video Moment Retrieval from Random to Real--NeurIPS 2023☆80Updated 2 years ago
- [IEEE T-PAMI 2023] Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering☆77Updated 2 years ago
- Accepted by ICCV2023, Revisiting Foreground and Background Separation in Weakly-supervised Temporal Action Localization: A Clustering-bas…☆103Updated last year
- Balanced Classification: A Unified Framework for Long-Tailed Object Detection (TMM 2023)☆100Updated 6 months ago
- ☆89Updated last year
- Code release for Your “On-the-fly Category Discovery (CVPR 2023)”☆53Updated 2 years ago
- ☆88Updated 2 years ago
- Official implementation of BMVC2023 Oral paper: 《Describe Your Facial Expressions by Linking Image Encoders and Large Language Models》☆62Updated 3 months ago
- [CVPR 2024] SimDA: Simple Diffusion Adapter for Efficient Video Generation☆130Updated last year
- [ACM MM 2021 Oral] Official repo of "Neighbor-view Enhanced Model for Vision and Language Navigation"☆77Updated 2 years ago
- "Towards Semi-supervised Learning with Non-random Missing Labels" by Yue Duan (ICCV 2023)☆78Updated 11 months ago
- [IEEE T-PAMI 2023] Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering☆20Updated 2 years ago
- Noise of Web (NoW) is a challenging noisy correspondence learning (NCL) benchmark containing 100K image-text pairs for robust image-text …☆15Updated 2 months ago
- ☆84Updated 5 months ago
- Official implementation of "Self-slimmed Vision Transformer" (ECCV2022)☆72Updated 3 years ago
- The official implementation of "Cross-modal Causal Relation Alignment for Video Question Grounding. (CVPR 2025 Highlight)"☆37Updated 6 months ago
- [ICCV 2023] Official implement of <Disentangle then Parse: Night-time Semantic Segmentation with Illumination Disentanglement>☆71Updated last year
- [BMVC2023] Spatial and Planar Consistency for Semi-Supervised Volumetric Medical Image Segmentation☆77Updated last year
- Pytorch Code for "Unified Coarse-to-Fine Alignment for Video-Text Retrieval" (ICCV 2023)☆66Updated last year
- The implementaion of CoDT on the task of NTU-60+->PKUMMD☆72Updated 2 years ago
- ☆62Updated last year
- [CVPR 2023 Highlight & TPAMI] Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning☆121Updated 10 months ago
- Towards Better Stability and Adaptability: Improve Online Self-Training for Model Adaptation in Semantic Segmentation(CVPR-2023)☆79Updated last year
- [CVPR-2023] Official Codes for "TranSG: Transformer-Based Skeleton Graph Prototype Contrastive Learning with Structure-Trajectory Prompte…☆93Updated last year
- The official implement of DS2DP [TGRS 2022]☆62Updated 8 months ago
- "MutexMatch: Semi-Supervised Learning with Mutex-Based Consistency Regularization" by Yue Duan (TNNLS)☆71Updated 11 months ago
- Context-I2W: Mapping Images to Context-dependent words for Accurate Zero-Shot Composed Image Retrieval [AAAI 2024 Oral]☆55Updated 5 months ago
- [IJCAI-2024] The official code of Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition☆10Updated 2 months ago
- ☆30Updated 2 years ago