[ACLW'24] LMPT: Prompt Tuning with Class-Specific Embedding Loss for Long-tailed Multi-Label Visual Recognition
☆57Aug 13, 2024Updated last year
Alternatives and similar repositories for LMPT
Users that are interested in LMPT are comparing it to the libraries listed below
Sorting:
- [COLING'25] HGCLIP: Exploring Vision-Language Models with Graph Representations for Hierarchical Understanding☆44Nov 30, 2024Updated last year
- [NeurIPS'24] CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models☆77Dec 4, 2024Updated last year
- Code for Chinese grammatical error correction based on knowledge distillation☆11Aug 16, 2022Updated 3 years ago
- [EMNLP'24] RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models☆97Dec 13, 2024Updated last year
- ☆23Jan 12, 2024Updated 2 years ago
- This repository stores the code of the data augmentation method from Chinese word and character levels, which adds noise to words and cha…☆21Aug 26, 2022Updated 3 years ago
- ☆27Jan 25, 2024Updated 2 years ago
- [MedIA'25] FLAIR: A Foundation LAnguage-Image model of the Retina for fundus image understanding.☆170Nov 27, 2025Updated 3 months ago
- ☆12May 19, 2023Updated 2 years ago
- The code repository for "Cross-Modal and Hierarchical Modeling of Video and Text" in PyTorch☆16Apr 22, 2019Updated 6 years ago
- VL-LTR: Learning Class-wise Visual-Linguistic Representation for Long-Tailed Visual Recognition☆69Oct 1, 2022Updated 3 years ago
- ☆17Jun 15, 2022Updated 3 years ago
- Official Implementation of "CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning" on MIC…☆18Feb 12, 2025Updated last year
- ☆19Dec 19, 2025Updated 2 months ago
- ☆20May 3, 2025Updated 10 months ago
- ICLR 2023 and ICML 2023 paper☆23Sep 16, 2024Updated last year
- S-CLIP: Semi-supervised Vision-Language Pre-training using Few Specialist Captions☆50May 26, 2023Updated 2 years ago
- [ICCV' 23 Oral] Novel Scenes & Classes: Towards Adaptive Open-set Object Detection☆47May 23, 2025Updated 9 months ago
- [AAAI 2023] Contrastive Masked Autoencoders for Self-Supervised Video Hashing☆27Jul 4, 2023Updated 2 years ago
- Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs (ECCV 2024)☆19Jul 15, 2024Updated last year
- ☆95Sep 23, 2023Updated 2 years ago
- Interactive Multi-Label CNN Learning with Partial Labels @ CVPR20☆22Dec 21, 2021Updated 4 years ago
- [AAAI 2023] Zero-Shot Enhancement of CLIP with Parameter-free Attention☆93Apr 29, 2023Updated 2 years ago
- Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.☆20May 7, 2022Updated 3 years ago
- [ECCV 2024] Official Implementation of "OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding"☆61Jul 5, 2025Updated 8 months ago
- Code and results accompanying our paper titled CHiLS: Zero-Shot Image Classification with Hierarchical Label Sets☆60Jun 4, 2023Updated 2 years ago
- Implementing ONNX runtime for android to run Segment Anything Model 2☆12Aug 1, 2025Updated 7 months ago
- SUPERVAIZER is a toolkit built for the age of AI interoperability. At its core, it implements Google's Agent-to-Agent (A2A) protocol, ena…☆14Feb 4, 2026Updated last month
- PyTorch implementation of Boosting Multi-Label Image Classification with Complementary Parallel Self-Distillation, IJCAI 2022.☆26Aug 25, 2022Updated 3 years ago
- [ICLR'25 Oral] MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models☆35Nov 3, 2024Updated last year
- Official implementation for paper Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos☆28Dec 8, 2023Updated 2 years ago
- ☆30Mar 2, 2023Updated 3 years ago
- [NeurIPS2023] Official implementation and model release of the paper "What Makes Good Examples for Visual In-Context Learning?"☆183Mar 4, 2024Updated 2 years ago
- ☆200May 10, 2023Updated 2 years ago
- Implementation for "DualCoOp: Fast Adaptation to Multi-Label Recognition with Limited Annotations" (NeurIPS 2022))☆71Oct 24, 2023Updated 2 years ago
- Source code for the paper "Structured Attention Graphs for Understanding Deep Image Classifications"☆31Dec 18, 2021Updated 4 years ago
- A collection of resources on applications of multi-modal learning in medical imaging.☆924Feb 8, 2026Updated last month
- [CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》☆151Jun 7, 2023Updated 2 years ago
- ☆11Mar 11, 2024Updated last year