FeiElysia / ViECapLinks
Transferable Decoding with Visual Entities for Zero-Shot Image Captioning, ICCV 2023
☆161Updated last year
Alternatives and similar repositories for ViECap
Users that are interested in ViECap are comparing it to the libraries listed below
Sorting:
- Progressive Spatio-Temporal Prototype Matching for Text-Video Retrieval --ICCV2023 Oral☆91Updated 2 years ago
- MomentDiff: Generative Video Moment Retrieval from Random to Real--NeurIPS 2023☆80Updated 2 years ago
- Accepted by ICCV2023, Revisiting Foreground and Background Separation in Weakly-supervised Temporal Action Localization: A Clustering-bas…☆103Updated last year
- [IEEE T-PAMI 2023] Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering☆77Updated 2 years ago
- Balanced Classification: A Unified Framework for Long-Tailed Object Detection (TMM 2023)☆101Updated 7 months ago
- ☆90Updated 2 years ago
- Code release for Your “On-the-fly Category Discovery (CVPR 2023)”☆53Updated 2 years ago
- ☆89Updated last month
- Official implementation of BMVC2023 Oral paper: 《Describe Your Facial Expressions by Linking Image Encoders and Large Language Models》☆65Updated 4 months ago
- [ACM MM 2021 Oral] Official repo of "Neighbor-view Enhanced Model for Vision and Language Navigation"☆78Updated 3 years ago
- The implementaion of CoDT on the task of NTU-60+->PKUMMD☆72Updated 2 years ago
- [CVPR 2024] SimDA: Simple Diffusion Adapter for Efficient Video Generation☆130Updated last year
- [IEEE T-PAMI 2023] Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering☆20Updated 2 years ago
- [ICCV 2023] Official implement of <Disentangle then Parse: Night-time Semantic Segmentation with Illumination Disentanglement>☆72Updated last year
- Official implementation of "Self-slimmed Vision Transformer" (ECCV2022)☆72Updated 3 years ago
- The official implementation of "Cross-modal Causal Relation Alignment for Video Question Grounding. (CVPR 2025 Highlight)"☆40Updated 7 months ago
- "Towards Semi-supervised Learning with Non-random Missing Labels" by Yue Duan (ICCV 2023)☆77Updated 3 weeks ago
- Towards Better Stability and Adaptability: Improve Online Self-Training for Model Adaptation in Semantic Segmentation(CVPR-2023)☆79Updated last year
- The code is for PBRnet for action detection☆73Updated 4 years ago
- [BMVC2023] Spatial and Planar Consistency for Semi-Supervised Volumetric Medical Image Segmentation☆76Updated last year
- Pytorch Code for "Unified Coarse-to-Fine Alignment for Video-Text Retrieval" (ICCV 2023)☆66Updated last year
- [IJCAI 2023] Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment☆53Updated last year
- [CVPR-2023] Official Codes for "TranSG: Transformer-Based Skeleton Graph Prototype Contrastive Learning with Structure-Trajectory Prompte…☆95Updated last year
- [CVPR 2023 Highlight & TPAMI] Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning☆122Updated 11 months ago
- Noise of Web (NoW) is a challenging noisy correspondence learning (NCL) benchmark containing 100K image-text pairs for robust image-text …☆15Updated 3 weeks ago
- "MutexMatch: Semi-Supervised Learning with Mutex-Based Consistency Regularization" by Yue Duan (TNNLS)☆71Updated 3 weeks ago
- The official implement of DS2DP [TGRS 2022]☆62Updated 10 months ago
- a unified and simple codebase for weakly-supervised temporal action localization☆19Updated 2 years ago
- [NeurIPS 2022 Spotlight] Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations☆143Updated last year
- [ICCV-2023] The official code of Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation☆138Updated 5 months ago