GZU-SAMLab / CDKMLinks
Common and Distinct Knowledge Mining Network with Content Interaction for Dense Captioning
☆29Updated 2 years ago
Alternatives and similar repositories for CDKM
Users that are interested in CDKM are comparing it to the libraries listed below
Sorting:
- LCM-Captioner is an efficient model for Text-based Image Captioning(TextCap).☆26Updated 2 years ago
- Meta-contrastive Learning with Support-based Query Interaction for Few-shot Fine-grained Visual Classification☆33Updated 2 years ago
- Phenotype segmentation method based on spectral reconstruction for UAV field vegetation.☆27Updated 2 years ago
- We propose a text-guided image inpainting method with multi-grained image-text semantic learning (MISL), consisting of global-local gener…☆27Updated 2 years ago
- Mutil-stage knowledge distillation (MSKD) can facilitate the accuracy of plant disease detection, which may be a new and vital direction …☆28Updated 2 years ago
- AA-trans: Core attention aggregating transformer with informationentropy selector for fine-grained visual classification☆36Updated 2 years ago
- Count-Supervised Network (CSNet) can complete the counting of wheat ears with only quantitative supervision. CSNet: A Count-supervised N…☆31Updated last year
- ☆1,104Updated last year
- Dual Pseudo-Labels Interactive Self-Training for Semi-Supervised Visible-Infrared Person Re-Identification☆12Updated last year
- [NeurIPS 2024 spotlight] Offical implementation of MSFA and release of SARDet_100K dataset for Large-Scale Synthetic Aperture Radar (SAR…☆625Updated 5 months ago
- The Pytorch implemetation of "FeatWalk: Enhancing Few-Shot Classification through Local View Leveraging", AAAI 2024.☆11Updated last year
- Enhancing True Correspondence Discrimination through Relation Consistency for Robust Noisy Correspondence Learning (CVPR 2025, pytorch co…☆12Updated 3 weeks ago
- Multi-view dual attention network for 3D object recognition (Neural Computing and Applications, 2021)☆12Updated 3 years ago
- ☆10Updated last year
- About [MM2024] Learning with Alignments: Tackling the Inter- and Intra-domain Shifts for Cross-multidomain Facial Expression Recognition☆12Updated 11 months ago
- ☆112Updated last year
- Datasets for Multi-view Learning☆53Updated 6 months ago
- ICCV 2025 论文和开源项目合集☆2,733Updated 3 months ago
- ☆14Updated 4 months ago
- ☆14Updated last year
- ☆251Updated last year
- [Neural Networks 2025]Text-guided Image Restoration and Semantic Enhancement for Text-to-Image Person Retrieval☆11Updated 9 months ago
- Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person Retrieval (CVPR 2023)☆246Updated 6 months ago
- [Official Repo] Visual Mamba: A Survey and New Outlooks☆707Updated 8 months ago
- Code release for Bi-Directional Feature Reconstruction Network for Fine-grained Few-shot Image Classification☆63Updated 2 years ago
- 异常/缺陷检测方向,谷歌学术和arxiv论文的相关更新☆37Updated last week
- The implementation of cvpr 2023 paper "Unsupervised Visible-Infrared Person Re-Identification via Progressive Graph Matching and Alternat…☆53Updated last year
- ☆40Updated 8 months ago
- Noisy-Correspondence Learning for Text-to-Image Person Re-identification (CVPR 2024 Pytorch Code)☆100Updated 10 months ago
- [ACM MM 2024] Pytorch Code for the paper "Robust Variational Contrastive Learning for Partially View-unaligned Clustering"☆12Updated 9 months ago