YBZh / LAPTLinks
ECCV2024, LAPT: Label-driven Automated Prompt Tuning for OOD Detection with Vision-Language Models
☆18Updated last year
Alternatives and similar repositories for LAPT
Users that are interested in LAPT are comparing it to the libraries listed below
Sorting:
- FPR: False Positive Rectification for Weakly Supervised Semantic Segmentation (ICCV 2023)☆24Updated 2 years ago
- Code release for "MDQE: Mining Discriminative Query Embeddings to Segment Occluded Instances on Challenging Videos"(CVPR2023)☆14Updated 2 years ago
- Code release for "BoxVIS: Video Instance Segmentation with Box Annotation"☆12Updated 2 years ago
- ☆12Updated last year
- SimCMF: A Simple Cross-modal Fine-tuning Strategy from Vision Foundation Models to Any Imaging Modality☆35Updated last year
- SyncNoise: Geometrically Consistent Noise Prediction for Text-based 3D Scene Editing☆18Updated last year
- [NIPS24] Official Implementation of Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation☆20Updated last year
- DiverGen (CVPR 2024) & BSGAL (ICML 2024)☆52Updated 5 months ago
- ☆12Updated 5 months ago
- [NeurlPS' 25] InstructRestore: Region-Customized Image Restoration with Human Instructions☆45Updated 2 months ago
- [ECCV'24] A novel weakly supervised framework for 3D object detection from 2D bounding boxes. It can easily extend to novel scenarios and…☆34Updated last year
- Cluster Document for IIL@HIT☆20Updated 2 years ago
- Initial code for computer vision experiments☆11Updated 2 years ago
- Adapters Strike Back (CVPR 2024)☆39Updated last year
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆95Updated 9 months ago
- An Empirical Study of GPT-4o Image Generation Capabilities☆29Updated 8 months ago
- (CVPR 2024) "Unsegment Anything by Simulating Deformation"☆29Updated last year
- Code of our CVPR2024 paper - DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data☆59Updated last year
- ☆32Updated last year
- [ICCV 2025] HQ-CLIP: Leveraging Large Vision-Language Models to Create High-Quality Image-Text Datasets☆59Updated 4 months ago
- [ICCV 2023]The PyTorch implementation of TL-Align: Token-Label Alignment for Vision Transformers.☆23Updated 2 years ago
- The official code for paper "GPSToken: Gaussian Parameterized Spatially-adaptive Tokenization for Image Representation and Generation"☆47Updated 3 months ago
- [NeurIPS2024]☆34Updated last year
- Not All Steps are Created Equal: Selective Diffusion Distillation for Image Manipulation (ICCV 2023)☆66Updated 2 years ago
- ☆31Updated 2 years ago
- [AAAI 2023] Symmetry-Aware Transformer-based Mirror Detection☆31Updated 3 years ago
- Inferring and Leveraging Parts from Object Shape for Improving Semantic Image Synthesis (CVPR 2023)☆18Updated last year
- Bag of Design Choices for Inference of High-Resolution Masked Generative Transformer☆16Updated last year
- We're Not Using Videos Effectively (TMLR 2024)☆17Updated last year
- Text-Image Alignment for Diffusion-based Perception (TADP) - CVPR 2024☆40Updated last year