GZU-SAMLab / McQNetLinks
Meta-contrastive Learning with Support-based Query Interaction for Few-shot Fine-grained Visual Classification
☆33Updated 2 years ago
Alternatives and similar repositories for McQNet
Users that are interested in McQNet are comparing it to the libraries listed below
Sorting:
- We propose a text-guided image inpainting method with multi-grained image-text semantic learning (MISL), consisting of global-local gener…☆27Updated 2 years ago
- LCM-Captioner is an efficient model for Text-based Image Captioning(TextCap).☆26Updated 2 years ago
- Mutil-stage knowledge distillation (MSKD) can facilitate the accuracy of plant disease detection, which may be a new and vital direction …☆28Updated 2 years ago
- Phenotype segmentation method based on spectral reconstruction for UAV field vegetation.☆28Updated 2 years ago
- AA-trans: Core attention aggregating transformer with informationentropy selector for fine-grained visual classification☆37Updated 2 years ago
- Count-Supervised Network (CSNet) can complete the counting of wheat ears with only quantitative supervision. CSNet: A Count-supervised N…☆32Updated last year
- Common and Distinct Knowledge Mining Network with Content Interaction for Dense Captioning☆29Updated 2 years ago
- T3Bench: Benchmarking Current Progress in Text-to-3D Generation☆1,100Updated 2 years ago
- ☆1,129Updated last year
- [中国图象图形学报&ChinaMM2025] 非空间配准多模态目标检测决策融合策略☆39Updated 6 months ago
- A curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..☆827Updated 2 weeks ago
- [CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".☆802Updated 2 years ago
- ☆938Updated 2 years ago
- This repository provides our TUT dataset.☆22Updated last year
- [Official Repo] Visual Mamba: A Survey and New Outlooks☆731Updated 11 months ago
- ☆14Updated last year
- [IJCV 2024] Official code for "Towards Task Sampler Learning for Meta-Learning"☆12Updated 3 months ago
- [NeurIPS 2025] VT-FSL: Bridging Vision and Text with LLMs for Few-Shot Learning☆24Updated last month
- ☆80Updated 3 months ago
- Official Implement of AAAI 2024 paper 'Gramformer: Learning Crowd Counting via Graph-Modulated Transformer'☆23Updated last year
- [Neural Networks 2025]Text-guided Image Restoration and Semantic Enhancement for Text-to-Image Person Retrieval☆11Updated last year
- Repo for "Synergy of Sight and Semantics: Visual Intention Understanding with CLIP"☆12Updated 10 months ago
- VMamba: Visual State Space Models,code is based on mamba☆3,039Updated 11 months ago
- (TPAMI 2024) A Survey on Open Vocabulary Learning☆986Updated last month
- A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.☆749Updated 2 months ago
- ☆257Updated 2 years ago
- About [MM2024] Learning with Alignments: Tackling the Inter- and Intra-domain Shifts for Cross-multidomain Facial Expression Recognition☆13Updated last year
- 这里包含了Vit的代码以及数据集部分。☆133Updated last year
- 2025年全网最全即插即用模块,免费分享!CVPR2025,AAAI2025,ICLR2025,TNNLS2025,arXiv2025......包含人工智能全领域(机器学习、深度学习等),适用于图像分类、目标检测、实例分割、语义分割、全景分割、姿态识别、医学图像分割、视频…☆1,414Updated 8 months ago
- ☆18Updated last year