GZU-SAMLab / CDKMLinks
Common and Distinct Knowledge Mining Network with Content Interaction for Dense Captioning
☆29Updated last year
Alternatives and similar repositories for CDKM
Users that are interested in CDKM are comparing it to the libraries listed below
Sorting:
- Phenotype segmentation method based on spectral reconstruction for UAV field vegetation.☆27Updated last year
- We propose a text-guided image inpainting method with multi-grained image-text semantic learning (MISL), consisting of global-local gener…☆27Updated last year
- Meta-contrastive Learning with Support-based Query Interaction for Few-shot Fine-grained Visual Classification☆33Updated last year
- LCM-Captioner is an efficient model for Text-based Image Captioning(TextCap).☆26Updated 2 years ago
- Mutil-stage knowledge distillation (MSKD) can facilitate the accuracy of plant disease detection, which may be a new and vital direction …☆28Updated last year
- AA-trans: Core attention aggregating transformer with informationentropy selector for fine-grained visual classification☆34Updated last year
- Count-Supervised Network (CSNet) can complete the counting of wheat ears with only quantitative supervision. CSNet: A Count-supervised N…☆29Updated last year
- T3Bench: Benchmarking Current Progress in Text-to-3D Generation☆1,098Updated last year
- ☆1,075Updated last year
- A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.☆644Updated 3 weeks ago
- ☆27Updated 2 years ago
- ☆936Updated last year
- [IJCV 2025] The project is an official implementation of our paper "Learning Structure-Supporting Dependencies via Keypoint Interactive T…☆15Updated last month
- Multimodal Sentiment Analysis with Image-Text Interaction Network☆14Updated last year
- [CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".☆764Updated 2 years ago
- This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our em…☆68Updated 3 weeks ago
- ☆17Updated 4 months ago
- [Neural Networks 2025]Text-guided Image Restoration and Semantic Enhancement for Text-to-Image Person Retrieval☆10Updated 7 months ago
- ☆23Updated 10 months ago
- Cross-modal Active Complementary Learning with Self-refining Correspondence (NeurIPS 2023, Pytorch Code)☆16Updated last year
- Implementation of our paper, 'Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval.'☆25Updated last year
- XCurve is an end-to-end PyTorch library for X-Curve metrics optimizations in machine learning.☆142Updated last year
- Summary of Related Research on Image-Text Matching☆70Updated 2 years ago
- Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person Retrieval (CVPR 2023)☆244Updated 4 months ago
- [2024 ECCV] Label-anticipated Event Disentanglement for Audio-Visual Video Parsing☆11Updated 8 months ago
- ☆19Updated last year
- ☆69Updated 4 months ago
- ☆16Updated last year
- ICCV 2025 论文和开源项目合集☆2,636Updated last month
- 中国海洋大学硕士博士学位论 文 LaTeX 模板(2025版)☆53Updated 5 months ago