kahnchana / clippy
Perceptual Grouping in Contrastive Vision-Language Models (ICCV'23)
☆37Updated last year
Alternatives and similar repositories for clippy:
Users that are interested in clippy are comparing it to the libraries listed below
- ☆26Updated last year
- Open-Vocabulary Instance Segmentation via Robust Cross-Modal Pseudo-Labeling @ CVPR22☆41Updated 2 years ago
- [ECCV'24] Official PyTorch implementation of In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation☆38Updated 4 months ago
- Official Pytorch codebase for Open-Vocabulary Instance Segmentation without Manual Mask Annotations [CVPR 2023]☆49Updated last month
- FreeDA: Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation (CVPR 2024)☆37Updated 5 months ago
- [CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners☆41Updated last year
- Official implementation of TCL (CVPR 2023)☆109Updated last year
- ☆11Updated 7 months ago
- ☆58Updated last year
- Official code repo of PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs☆25Updated last month
- ☆35Updated 10 months ago
- ☆16Updated last month
- PyTorch code and pretrained weights for the UNIC models.☆27Updated 5 months ago
- ☆58Updated last year
- repo for paper titled: Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment (AAAI'24 Oral)☆25Updated 9 months ago
- Code for Point-Level Regin Contrast (https//arxiv.org/abs/2202.04639)☆34Updated 2 years ago
- Code and Models for "GeneCIS A Benchmark for General Conditional Image Similarity"☆56Updated last year
- Code and data for the paper "Emergent Visual-Semantic Hierarchies in Image-Text Representations" (ECCV 2024)☆26Updated 6 months ago
- Official code for "Disentangling Visual Embeddings for Attributes and Objects" Published at CVPR 2022☆35Updated last year
- [ECCV 2024] PyTorch implementation of CropMAE, introduced in "Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders"☆49Updated 7 months ago
- ☆30Updated last week
- Code Release of F-LMM: Grounding Frozen Large Multimodal Models☆62Updated 6 months ago
- ☆20Updated last year
- [ICLR 2024] Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models.☆68Updated 6 months ago
- ☆50Updated 2 years ago
- [NeurIPS 2024] Official PyTorch implementation of "Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives"☆34Updated 2 months ago
- Prompt Generation Networks for Input-Space Adaptation of Frozen Vision Transformers. Jochem Loedeman, Maarten C. Stol, Tengda Han, Yuki M…☆41Updated 5 months ago
- [CVPRW'23 Best Paper Award] Zero-shot Unsupervised Transfer Instance Segmentation☆24Updated last year
- Large-Vocabulary Video Instance Segmentation dataset☆78Updated 7 months ago
- [ECCV 2024] Official code for "Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation"☆18Updated 4 months ago