samar-khanna / ExPLoRALinks
Official code repository for paper: "ExPLoRA: Parameter-Efficient Extended Pre-training to Adapt Vision Transformers under Domain Shifts"
☆31Updated 9 months ago
Alternatives and similar repositories for ExPLoRA
Users that are interested in ExPLoRA are comparing it to the libraries listed below
Sorting:
- Official Pytorch Implementation of Self-emerging Token Labeling☆33Updated last year
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆101Updated 10 months ago
- Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"☆35Updated 10 months ago
- Evaluation and dataset construction code for the CVPR 2025 paper "Vision-Language Models Do Not Understand Negation"☆26Updated 2 months ago
- ☆49Updated last week
- A Contrastive Learning Boost from Intermediate Pre-Trained Representations☆42Updated 9 months ago
- ☆34Updated last year
- OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024☆60Updated 4 months ago
- Fine-tuning OpenAI CLIP Model for Image Search on medical images☆76Updated 3 years ago
- A tiny package supporting distributed computation of COCO metrics for PyTorch models.☆15Updated 2 years ago
- [ECCV 2024] Official Release of SILC: Improving vision language pretraining with self-distillation☆44Updated 9 months ago
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆36Updated last year
- ☆62Updated last year
- ☆96Updated 3 months ago
- [NeurIPS 2023] HASSOD: Hierarchical Adaptive Self-Supervised Object Detection☆56Updated last year
- SAM-CLIP module for use with Autodistill.☆15Updated last year
- Official repository of paper "Subobject-level Image Tokenization" (ICML-25)☆77Updated last week
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Updated last year
- Official PyTorch implementation of "No Time to Waste: Squeeze Time into Channel for Mobile Video Understanding"☆33Updated last year
- [CVPR 2025] DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception☆65Updated last month
- This repo is the official implementation of iSeg: An Iterative Refinement-based Framework for Training-free Segmentation.☆37Updated 7 months ago
- ☆50Updated 6 months ago
- ☆58Updated last year
- [NeurIPS2022] This is the official implementation of the paper "Expediting Large-Scale Vision Transformer for Dense Prediction without Fi…☆85Updated last year
- Official Pytorch Implementation of Paper "A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Des…☆55Updated last year
- [ICML 2024] This repository includes the official implementation of our paper "Rejuvenating image-GPT as Strong Visual Representation Lea…☆98Updated last year
- How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challenges☆29Updated last year
- ☆24Updated 2 years ago
- Command-line tool for extracting DINO, CLIP, and SigLIP2 features for images and videos☆24Updated 3 weeks ago
- A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.☆18Updated 6 months ago