all-things-vits / code-samplesLinks

Holds code for our CVPR'23 tutorial: All Things ViTs: Understanding and Interpreting Attention in Vision.

☆195

Alternatives and similar repositories for code-samples

Users that are interested in code-samples are comparing it to the libraries listed below

Sorting:

WalBouss / GEM
[CVPR24] Official Implementation of GEM (Grounding Everything Module)
☆132Updated 6 months ago
hsouri / Battle-of-the-Backbones
☆209Updated last year
kyegomez / Vit-RGTS
Open source implementation of "Vision Transformers Need Registers"
☆194Updated last week
brandontrabucco / da-fusion
Effective Data Augmentation With Diffusion Models
☆263Updated last year
khawar-islam / diffuseMix
Official PyTorch implementation of DiffuseMix : Label-Preserving Data Augmentation with Diffusion Models (CVPR'2024)
☆125Updated 7 months ago
TonyLianLong / CrossMAE
Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders
☆121Updated 6 months ago
ByungKwanLee / Full-Segment-Anything
This is Pytorch Implementation Code for adding new features in code of Segment-Anything. Here, the features support batch-input on the fu…
☆162Updated last year
wysoczanska / clip_dinoiser
Official implementation of 'CLIP-DINOiser: Teaching CLIP a few DINO tricks' paper.
☆261Updated last year
naver-ai / cl-vs-mim
(ICLR 2023) Official PyTorch implementation of "What Do Self-Supervised Vision Transformers Learn?"
☆111Updated last year
VinAIResearch / Dataset-Diffusion
Dataset Diffusion: Diffusion-based Synthetic Data Generation for Pixel-Level Semantic Segmentation (NeurIPS2023)
☆126Updated last year
ViTAE-Transformer / QFormer
The official repo for [TPAMI'23] "Vision Transformer with Quadrangle Attention"
☆219Updated last month
xmed-lab / CLIP_Surgery
[Pattern Recognition 25] CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks
☆445Updated 7 months ago
u2seg / U2Seg
[CVPR 2024] Code release for "Unsupervised Universal Image Segmentation"
☆221Updated last year
wangf3014 / SCLIP
Official implementation of SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference
☆172Updated last year
lisadunlap / ALIA
Augmenting with Language-guided Image Augmentation (ALIA)
☆81Updated last year
DmitryRyumin / WACV-2024-Papers
WACV 2024 Papers: Discover cutting-edge research from WACV 2024, the leading computer vision conference. Stay updated on the latest in co…
☆98Updated last year
chenhaoxing / DiffusionInst
This repo is the code of paper "DiffusionInst: Diffusion Model for Instance Segmentation" (ICASSP'24).
☆244Updated 9 months ago
LeapLabTHU / EfficientTrain
1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundatio…
☆225Updated last year
Atten4Vis / CAE
This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"
☆118Updated last year
wgcban / adamae
[CVPR'23] AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders
☆83Updated last year
facebookresearch / r-mae
PyTorch implementation of R-MAE https//arxiv.org/abs/2306.05411
☆113Updated 2 years ago
yossigandelsman / clip_text_span
official implementation of "Interpreting CLIP's Image Representation via Text-Based Decomposition"
☆232Updated 4 months ago
google-research / syn-rep-learn
Learning from synthetic data - code and models
☆323Updated last year
alinlab / ifseg
IFSeg: Image-free Semantic Segmentation via Vision-Language Model (CVPR 2023)
☆94Updated 2 years ago
xmed-lab / CLIPN
ICCV 2023: CLIPN for Zero-Shot OOD Detection: Teaching CLIP to Say No
☆140Updated last year
gkakogeorgiou / attmask
[ECCV 2022] What to Hide from Your Students: Attention-Guided Masked Image Modeling
☆74Updated last year
LAION-AI / scaling-laws-openclip
Reproducible scaling laws for contrastive language-image learning (https://arxiv.org/abs/2212.07143)
☆178Updated 4 months ago
Haochen-Wang409 / HPM
[CVPR'23 & TPAMI'25] Hard Patches Mining for Masked Image Modeling & Bootstrap Masked Visual Modeling via Hard Patch Mining
☆103Updated 6 months ago
Haochen-Wang409 / DropPos
[NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions
☆62Updated last year
UCSC-VLAA / DMAE
[CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"
☆108Updated 2 years ago