JamesQFreeman / LoRA-ViT
Low rank adaptation for Vision Transformer
☆387Updated 11 months ago
Alternatives and similar repositories for LoRA-ViT:
Users that are interested in LoRA-ViT are comparing it to the libraries listed below
- A collection of parameter-efficient transfer learning papers focusing on computer vision and multimodal domains.☆397Updated 4 months ago
- ❄️🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119☆1,083Updated last year
- Low rank adaptation for segmentation anything model (SAM)☆213Updated 9 months ago
- Open source implementation of "Vision Transformers Need Registers"☆163Updated 3 weeks ago
- Official Open Source code for "Scaling Language-Image Pre-training via Masking"☆413Updated last year
- [Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)☆318Updated 4 months ago
- [NeurIPS 2022] Implementation of "AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition"☆338Updated 2 years ago
- ☆581Updated last year
- ☆498Updated 3 months ago
- 1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundatio…☆217Updated 5 months ago
- ☆501Updated 2 years ago
- This repository contains the implementation for the paper "EMP-SSL: Towards Self-Supervised Learning in One Training Epoch."☆224Updated last year
- ConvMAE: Masked Convolution Meets Masked Autoencoders☆491Updated last year
- CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks☆388Updated last week
- [CVPR 2023] CLIP is Also an Efficient Segmenter: A Text-Driven Approach for Weakly Supervised Semantic Segmentation☆181Updated 5 months ago
- When do we not need larger vision models?☆368Updated last week
- [CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"☆267Updated last month
- [CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want☆777Updated 6 months ago
- [TPAMI] Searching prompt modules for parameter-efficient transfer learning.☆225Updated last year
- ☆249Updated 2 years ago
- A PyTorch implementation of MAGE: MAsked Generative Encoder to Unify Representation Learning and Image Synthesis☆551Updated last year
- MetaFormer Baselines for Vision (TPAMI 2024)☆443Updated 8 months ago
- A method to increase the speed and lower the memory footprint of existing vision transformers.☆1,012Updated 8 months ago
- Official implementation of SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference☆145Updated 4 months ago
- [ICCV 2023 & AAAI 2023] Binary Adapters & FacT, [Tech report] Convpass☆175Updated last year
- Open-vocabulary Semantic Segmentation☆329Updated 4 months ago
- [CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".☆706Updated last year
- Holds code for our CVPR'23 tutorial: All Things ViTs: Understanding and Interpreting Attention in Vision.☆181Updated last year
- Reading list for research topics in Masked Image Modeling☆331Updated 2 months ago
- PyTorch implementation of Masked Autoencoder☆243Updated last year