GZU-SAMLab / LCM-CaptionerLinks
LCM-Captioner is an efficient model for Text-based Image Captioning(TextCap).
☆26Updated 2 years ago
Alternatives and similar repositories for LCM-Captioner
Users that are interested in LCM-Captioner are comparing it to the libraries listed below
Sorting:
- We propose a text-guided image inpainting method with multi-grained image-text semantic learning (MISL), consisting of global-local gener…☆27Updated last year
- Meta-contrastive Learning with Support-based Query Interaction for Few-shot Fine-grained Visual Classification☆33Updated last year
- Phenotype segmentation method based on spectral reconstruction for UAV field vegetation.☆27Updated last year
- Mutil-stage knowledge distillation (MSKD) can facilitate the accuracy of plant disease detection, which may be a new and vital direction …☆28Updated last year
- Common and Distinct Knowledge Mining Network with Content Interaction for Dense Captioning☆29Updated last year
- Count-Supervised Network (CSNet) can complete the counting of wheat ears with only quantitative supervision. CSNet: A Count-supervised N…☆29Updated last year
- AA-trans: Core attention aggregating transformer with informationentropy selector for fine-grained visual classification☆34Updated last year
- T3Bench: Benchmarking Current Progress in Text-to-3D Generation☆1,098Updated last year
- ☆1,075Updated last year
- ☆936Updated last year
- [Neural Networks 2025]Text-guided Image Restoration and Semantic Enhancement for Text-to-Image Person Retrieval☆10Updated 7 months ago
- ☆17Updated 4 months ago
- ICCV 2025 论文和开源项目合集☆2,636Updated last month
- A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.☆644Updated 3 weeks ago
- [CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".☆764Updated 2 years ago
- A collection of papers about Referring Image Segmentation.☆746Updated this week
- ☆12Updated 8 months ago
- ☆27Updated 4 months ago
- (TPAMI 2024) A Survey on Open Vocabulary Learning☆946Updated 4 months ago
- Multimodal Sentiment Analysis with Image-Text Interaction Network☆14Updated last year
- XCurve is an end-to-end PyTorch library for X-Curve metrics optimizations in machine learning.☆142Updated last year
- ☆16Updated last year
- ☆69Updated 4 months ago
- A curated paper list of awesome skeleton-based action recognition.☆549Updated last week
- ❄️🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119☆1,145Updated last year
- ☆1,830Updated last year
- A curasted list of papers with the topic of Diffusion Models for Multi-Modal☆29Updated last year
- This repository is intended to store the code and data for ASAP (Advancing Semantic Alignment Promotes Multi-Modal Manipulation Detecting…☆13Updated last month
- [IJCV 2025] The project is an official implementation of our paper "Learning Structure-Supporting Dependencies via Keypoint Interactive T…☆15Updated last month
- 2025年全网最全即插即用模块,免费分享!CVPR2025,AAAI2025,ICLR2025,TNNLS2025,arXiv2025......包含人工智能全领域(机器学习、深度学习等),适用于图像分类、目标检测、实例分割、语义分割、全景分割、姿态识别、医学图像分割、视频…☆1,043Updated 2 months ago