samar-khanna / ExPLoRA
Official code repository for paper: "ExPLoRA: Parameter-Efficient Extended Pre-training to Adapt Vision Transformers under Domain Shifts"
☆24Updated last month
Related projects ⓘ
Alternatives and complementary repositories for ExPLoRA
- Official Pytorch Implementation of Self-emerging Token Labeling☆30Updated 7 months ago
- SAM-CLIP module for use with Autodistill.☆12Updated last year
- ☆30Updated this week
- [NeurIPS 2023] HASSOD: Hierarchical Adaptive Self-Supervised Object Detection☆52Updated 9 months ago
- EdgeSAM model for use with Autodistill.☆25Updated 5 months ago
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆34Updated last year
- Pytorch Implementation of the paper: "Learning to (Learn at Test Time): RNNs with Expressive Hidden States"☆23Updated last week
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated last week
- Official implementation of ECCV24 paper: POA☆24Updated 3 months ago
- Repository for the paper: "TiC-CLIP: Continual Training of CLIP Models".☆94Updated 5 months ago
- Clipora is a powerful toolkit for fine-tuning OpenCLIP models using Low Rank Adapters (LoRA).☆18Updated 3 months ago
- SMILE: A Multimodal Dataset for Understanding Laughter☆13Updated last year
- Multimodal Video Understanding Framework (MVU)☆23Updated 6 months ago
- ☆58Updated 8 months ago
- ☆33Updated 10 months ago
- Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"☆16Updated last week
- ☆12Updated 2 months ago
- A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.☆16Updated last month
- Official PyTorch implementation of "No Time to Waste: Squeeze Time into Channel for Mobile Video Understanding"☆30Updated 6 months ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆18Updated 3 months ago
- Code repository for the public reproduction of the language modelling experiments on "MatFormer: Nested Transformer for Elastic Inference…☆18Updated last year
- LoRA fine-tuned Stable Diffusion Deployment☆31Updated last year
- [WACV 2025] Official implementation of "Online-LoRA: Task-free Online Continual Learning via Low Rank Adaptation" by Xiwen Wei, Guihong L…☆22Updated this week
- Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"☆33Updated 3 months ago
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Updated 7 months ago
- Load any clip model with a standardized interface☆21Updated 6 months ago
- ☆39Updated last year
- Data-Efficient Multimodal Fusion on a Single GPU☆47Updated 6 months ago
- Repository for the PopulAtion Parameter Averaging (PAPA) paper☆26Updated 7 months ago