☆30Mar 2, 2023Updated 3 years ago
Alternatives and similar repositories for HiCLIP
Users that are interested in HiCLIP are comparing it to the libraries listed below
Sorting:
- ☆29Jul 25, 2025Updated 7 months ago
- An official PyTorch implementation for CLIPPR☆30Jul 22, 2023Updated 2 years ago
- PyTorch implementation of Semi-Supervised Learning with Scarce Annotations https://arxiv.org/pdf/1905.08845.pdf☆13Jan 6, 2020Updated 6 years ago
- Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".☆16Jun 20, 2023Updated 2 years ago
- GeckoNum Benchmark for T2I Model Eval.☆15Dec 5, 2024Updated last year
- Official Implementation of Attentive Mask CLIP (ICCV2023, https://arxiv.org/abs/2212.08653)☆35May 29, 2024Updated last year
- This repository is for the paper "Is BERT Blind? Exploring the Effect of Vision-and-Language Pretraining on Visual Language Understanding…☆21Nov 2, 2023Updated 2 years ago
- ☆16Sep 29, 2024Updated last year
- ☆20Apr 23, 2024Updated last year
- [NeurIPS 2024] Official PyTorch implementation of "Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives"☆46Dec 1, 2024Updated last year
- RO-ViT CVPR 2023 "Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers"☆17Aug 24, 2023Updated 2 years ago
- A subset of YFCC100M. Tools, checking scripts and links of web drive to download datasets(uncompressed).☆19Nov 13, 2024Updated last year
- ☆21Dec 30, 2022Updated 3 years ago
- ☆16May 6, 2021Updated 4 years ago
- awesome-semantic-segmentation - list of awesome things around semantic segmentation☆21Apr 28, 2022Updated 3 years ago
- ☆24Oct 9, 2023Updated 2 years ago
- Official code repo of PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs☆26Jan 14, 2025Updated last year
- [TPAMI 2023] Generative Multi-Label Zero-Shot Learning☆54Jul 12, 2023Updated 2 years ago
- [ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and …☆63Jan 4, 2022Updated 4 years ago
- [NeurIPS 2024] Lumen: a Large multimodal model with versatile vision-centric capabilities☆25Sep 27, 2024Updated last year
- ViCToR: Improving Visual Comprehension via Token Reconstruction for Pretraining LMMs☆28Aug 15, 2025Updated 6 months ago
- OmniZip: Audio-Guided Dynamic Token Compression for Fast Omnimodal Large Language Models☆55Feb 1, 2026Updated last month
- Solve the berth allocation problem using genetic-algorithm.☆10Jun 8, 2017Updated 8 years ago
- (NeurIPS 2024) What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights☆28Oct 28, 2024Updated last year
- [ICCV23] Official implementation of eP-ALM: Efficient Perceptual Augmentation of Language Models.☆27Oct 27, 2023Updated 2 years ago
- ☆29Oct 18, 2022Updated 3 years ago
- The SVO-Probes Dataset for Verb Understanding☆30Jan 28, 2022Updated 4 years ago
- Code and data setup for the paper "Are Diffusion Models Vision-and-language Reasoners?"☆33Mar 15, 2024Updated last year
- [CVPR 2025] PyTorch implementation of paper "FLAME: Frozen Large Language Models Enable Data-Efficient Language-Image Pre-training"☆33Jul 8, 2025Updated 7 months ago
- An Enhanced CLIP Framework for Learning with Synthetic Captions☆40Apr 18, 2025Updated 10 months ago
- Task Residual for Tuning Vision-Language Models (CVPR 2023)☆76May 27, 2023Updated 2 years ago
- ☆129Jan 20, 2022Updated 4 years ago
- ☆33Nov 4, 2024Updated last year
- PyTorch implementation of "UNIT: Unifying Image and Text Recognition in One Vision Encoder", NeurlPS 2024.☆34Sep 26, 2024Updated last year
- ☆36May 24, 2024Updated last year
- Margin-based Vision Transformer☆66Nov 28, 2025Updated 3 months ago
- Official implementation of TagAlign☆35Dec 11, 2024Updated last year
- Shared Attention for Multi-label Zero-shot Learning accepted @ CVPR20☆32Dec 21, 2021Updated 4 years ago
- Repository for the Universal Lesion Segmentation Challenge '23☆40May 11, 2025Updated 9 months ago