FeiElysia / ViECapLinks
Transferable Decoding with Visual Entities for Zero-Shot Image Captioning, ICCV 2023
☆160Updated last year
Alternatives and similar repositories for ViECap
Users that are interested in ViECap are comparing it to the libraries listed below
Sorting:
- Progressive Spatio-Temporal Prototype Matching for Text-Video Retrieval --ICCV2023 Oral☆91Updated last year
- MomentDiff: Generative Video Moment Retrieval from Random to Real--NeurIPS 2023☆80Updated last year
- Accepted by ICCV2023, Revisiting Foreground and Background Separation in Weakly-supervised Temporal Action Localization: A Clustering-bas…☆102Updated last year
- [IEEE T-PAMI 2023] Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering☆76Updated 2 years ago
- Balanced Classification: A Unified Framework for Long-Tailed Object Detection (TMM 2023)☆100Updated 5 months ago
- Code release for Your “On-the-fly Category Discovery (CVPR 2023)”☆53Updated 2 years ago
- ☆87Updated 2 years ago
- ☆88Updated last year
- [IEEE T-PAMI 2023] Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering☆19Updated 2 years ago
- Official implementation of BMVC2023 Oral paper: 《Describe Your Facial Expressions by Linking Image Encoders and Large Language Models》☆62Updated 2 months ago
- [ACM MM 2021 Oral] Official repo of "Neighbor-view Enhanced Model for Vision and Language Navigation"☆77Updated 2 years ago
- ☆84Updated 4 months ago
- [CVPR 2024] SimDA: Simple Diffusion Adapter for Efficient Video Generation☆129Updated last year
- The implementaion of CoDT on the task of NTU-60+->PKUMMD☆72Updated 2 years ago
- "Towards Semi-supervised Learning with Non-random Missing Labels" by Yue Duan (ICCV 2023)☆77Updated 10 months ago
- [ICCV 2023] Official implement of <Disentangle then Parse: Night-time Semantic Segmentation with Illumination Disentanglement>☆71Updated last year
- Towards Better Stability and Adaptability: Improve Online Self-Training for Model Adaptation in Semantic Segmentation(CVPR-2023)☆79Updated last year
- The code is for PBRnet for action detection☆73Updated 4 years ago
- Noise of Web (NoW) is a challenging noisy correspondence learning (NCL) benchmark containing 100K image-text pairs for robust image-text …☆15Updated last month
- Panoptic Scene Graph Biased Annotation☆35Updated last year
- a unified and simple codebase for weakly-supervised temporal action localization☆19Updated 2 years ago
- Official implementation of "Self-slimmed Vision Transformer" (ECCV2022)☆72Updated 3 years ago
- The official implement of DS2DP [TGRS 2022]☆62Updated 7 months ago
- [BMVC2023] Spatial and Planar Consistency for Semi-Supervised Volumetric Medical Image Segmentation☆77Updated last year
- [ICCV-2023] The official code of Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation☆137Updated 3 months ago
- "MutexMatch: Semi-Supervised Learning with Mutex-Based Consistency Regularization" by Yue Duan (TNNLS)☆71Updated 10 months ago
- accepted by ICME2023 oral(CCF B)☆61Updated 2 years ago
- The official implementation of "Cross-modal Causal Relation Alignment for Video Question Grounding. (CVPR 2025 Highlight)"☆33Updated 5 months ago
- [IJCV-2023] Official Codes for "Hierarchical Skeleton Meta-Prototype Contrastive Learning with Hard Skeleton Mining for Unsupervised Pers…☆63Updated last year
- [NeurIPS 2022 Spotlight] Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations☆140Updated last year