GZU-SAMLab / CDKMLinks
Common and Distinct Knowledge Mining Network with Content Interaction for Dense Captioning
☆29Updated last year
Alternatives and similar repositories for CDKM
Users that are interested in CDKM are comparing it to the libraries listed below
Sorting:
- Phenotype segmentation method based on spectral reconstruction for UAV field vegetation.☆26Updated last year
- LCM-Captioner is an efficient model for Text-based Image Captioning(TextCap).☆26Updated 2 years ago
- We propose a text-guided image inpainting method with multi-grained image-text semantic learning (MISL), consisting of global-local gener…☆27Updated last year
- Meta-contrastive Learning with Support-based Query Interaction for Few-shot Fine-grained Visual Classification☆33Updated last year
- Mutil-stage knowledge distillation (MSKD) can facilitate the accuracy of plant disease detection, which may be a new and vital direction …☆28Updated last year
- AA-trans: Core attention aggregating transformer with informationentropy selector for fine-grained visual classification☆34Updated last year
- Count-Supervised Network (CSNet) can complete the counting of wheat ears with only quantitative supervision. CSNet: A Count-supervised N…☆29Updated last year
- T3Bench: Benchmarking Current Progress in Text-to-3D Generation☆1,100Updated last year
- ☆1,064Updated last year
- This is an official PyTorch implementation of ASDA (accepted by ACMMM 2024).☆23Updated 9 months ago
- [IEEE T-PAMI 2023] Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering☆19Updated 2 years ago
- XCurve is an end-to-end PyTorch library for X-Curve metrics optimizations in machine learning.☆143Updated last year
- vHeat: Building Vision Models upon Heat Conduction☆236Updated last month
- Multimodal Sentiment Analysis with Image-Text Interaction Network☆14Updated last year
- ☆937Updated last year
- An official codebase of Scene-Aware Label Graph Learning for Multi-Label Image Classification, ICCV 2023.☆15Updated last year
- Transferable Decoding with Visual Entities for Zero-Shot Image Captioning, ICCV 2023☆157Updated 10 months ago
- ☆111Updated last year
- ☆12Updated 2 years ago
- ☆252Updated last year
- [CVPR 2023] Official implementation for "CDDFuse: Correlation-Driven Dual-Branch Feature Decomposition for Multi-Modality Image Fusion."☆535Updated 6 months ago
- A paper list of some recent Computer Vision(CV) works☆525Updated this week
- 这里包含了Vit的代码以及数据集部分。☆129Updated last year
- ICML2024-ReconBoost: Boosting Can Achieve Modality Reconcilement☆25Updated 2 months ago
- The official code for [Heterogeneity-Aware Federated Deep Multi-View Clustering towards Diverse Feature Representations] ( ACM MM 24 )☆12Updated 8 months ago
- ☆11Updated 9 months ago
- [Official Repo] Visual Mamba: A Survey and New Outlooks☆689Updated 5 months ago
- A Simple Java Implementation of a Forum(论坛的简单Java实现,基于Springboot和thymeleaf).☆15Updated last year
- ☆103Updated 7 months ago
- A curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..☆679Updated 3 months ago