richard-peng-xia / LMPTView external linksLinks
[ACLW'24] LMPT: Prompt Tuning with Class-Specific Embedding Loss for Long-tailed Multi-Label Visual Recognition
☆57Aug 13, 2024Updated last year
Alternatives and similar repositories for LMPT
Users that are interested in LMPT are comparing it to the libraries listed below
Sorting:
- [COLING'25] HGCLIP: Exploring Vision-Language Models with Graph Representations for Hierarchical Understanding☆44Nov 30, 2024Updated last year
- Code for Chinese grammatical error correction based on knowledge distillation☆11Aug 16, 2022Updated 3 years ago
- [EMNLP'24] RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models☆96Dec 13, 2024Updated last year
- ☆23Jan 12, 2024Updated 2 years ago
- This repository stores the code of the data augmentation method from Chinese word and character levels, which adds noise to words and cha…☆21Aug 26, 2022Updated 3 years ago
- ☆27Jan 25, 2024Updated 2 years ago
- [MedIA'25] FLAIR: A Foundation LAnguage-Image model of the Retina for fundus image understanding.☆168Nov 27, 2025Updated 2 months ago
- ☆12May 19, 2023Updated 2 years ago
- VL-LTR: Learning Class-wise Visual-Linguistic Representation for Long-Tailed Visual Recognition☆69Oct 1, 2022Updated 3 years ago
- Official Implementation of "CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning" on MIC…☆17Feb 12, 2025Updated last year
- ☆19Dec 19, 2025Updated last month
- ☆20May 3, 2025Updated 9 months ago
- ICLR 2023 and ICML 2023 paper☆23Sep 16, 2024Updated last year
- S-CLIP: Semi-supervised Vision-Language Pre-training using Few Specialist Captions☆50May 26, 2023Updated 2 years ago
- [AAAI 2023] Contrastive Masked Autoencoders for Self-Supervised Video Hashing☆27Jul 4, 2023Updated 2 years ago
- SUPERVAIZER is a toolkit built for the age of AI interoperability. At its core, it implements Google's Agent-to-Agent (A2A) protocol, ena…☆14Feb 4, 2026Updated last week
- Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs (ECCV 2024)☆19Jul 15, 2024Updated last year
- Interactive Multi-Label CNN Learning with Partial Labels @ CVPR20☆22Dec 21, 2021Updated 4 years ago
- Official Implementation of "Read-only Prompt Optimization for Vision-Language Few-shot Learning", ICCV 2023☆55Aug 19, 2023Updated 2 years ago
- [ECCV 2024] Official Implementation of "OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding"☆60Jul 5, 2025Updated 7 months ago
- Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.☆20May 7, 2022Updated 3 years ago
- Code and results accompanying our paper titled CHiLS: Zero-Shot Image Classification with Hierarchical Label Sets☆59Jun 4, 2023Updated 2 years ago
- Implementing ONNX runtime for android to run Segment Anything Model 2☆12Aug 1, 2025Updated 6 months ago
- Official implementation for paper Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos☆28Dec 8, 2023Updated 2 years ago
- ☆30Mar 2, 2023Updated 2 years ago
- [npj Digital Medicine'24] Aligning Knowledge Concepts to Whole Slide Images for Precise Histopathology Image Analysis☆33Jan 7, 2025Updated last year
- [ICLR'25 Oral] MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models☆35Nov 3, 2024Updated last year
- [NeurIPS2023] Official implementation and model release of the paper "What Makes Good Examples for Visual In-Context Learning?"☆183Mar 4, 2024Updated last year
- ☆200May 10, 2023Updated 2 years ago
- Implementation for "DualCoOp: Fast Adaptation to Multi-Label Recognition with Limited Annotations" (NeurIPS 2022))☆71Oct 24, 2023Updated 2 years ago
- [CVPR 2023] Learning with Fantasy: Semantic-Aware Virtual Contrastive Constraint for Few-Shot Class-Incremental Learning☆74Feb 24, 2024Updated last year
- Official implementation of "Open-Vocabulary Multi-Label Classification via Multi-Modal Knowledge Transfer".☆130Nov 7, 2024Updated last year
- [ ECCV 2020 Spotlight ] Pytorch implementation for "Distribution-Balanced Loss for Multi-Label Classification in Long-Tailed Datasets"☆374Jul 14, 2022Updated 3 years ago
- Source code for the paper "Structured Attention Graphs for Understanding Deep Image Classifications"☆31Dec 18, 2021Updated 4 years ago
- A collection of resources on applications of multi-modal learning in medical imaging.☆913Feb 8, 2026Updated last week
- Self hosted AI workflow for scraping Instagram Reels (audio and description). Extracting, summarising and categorising, then storing all …☆27Jan 10, 2026Updated last month
- 情人节告白,将意中人素材图片融合到主图中☆10Aug 7, 2019Updated 6 years ago
- ☆11Mar 11, 2024Updated last year
- This project is an AI Recruitment System designed to accelerate the hiring process for HR and technical recruiters.☆14Jan 3, 2025Updated last year