hila-chefer / RobustViTLinks
[NeurIPS 2022] Official PyTorch implementation of Optimizing Relevance Maps of Vision Transformers Improves Robustness. This code allows to finetune the explainability maps of Vision Transformers to enhance robustness.
☆133Updated 3 years ago
Alternatives and similar repositories for RobustViT
Users that are interested in RobustViT are comparing it to the libraries listed below
Sorting:
- understanding model mistakes with human annotations☆106Updated 2 years ago
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆80Updated 3 years ago
- Code release for "Improved baselines for vision-language pre-training"☆61Updated last year
- Official repository for "Revisiting Weakly Supervised Pre-Training of Visual Perception Models". https://arxiv.org/abs/2201.08371.☆182Updated 3 years ago
- ☆190Updated 2 years ago
- Patching open-vocabulary models by interpolating weights☆91Updated 2 years ago
- [CVPR 2023] Learning Visual Representations via Language-Guided Sampling☆149Updated 2 years ago
- PixMix: Dreamlike Pictures Comprehensively Improve Safety Measures (CVPR 2022)☆109Updated 3 years ago
- A Domain-Agnostic Benchmark for Self-Supervised Learning☆108Updated 2 years ago
- PyTorch code for MUST☆107Updated 7 months ago
- MetaShift: A Dataset of Datasets for Evaluating Contextual Distribution Shifts and Training Conflicts (ICLR 2022)☆109Updated 3 years ago
- Implementation of Uniformer, a simple attention and 3d convolutional net that achieved SOTA in a number of video classification tasks, de…☆102Updated 3 years ago
- Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training☆138Updated 2 years ago
- The ORBIT dataset is a collection of videos of objects in clean and cluttered scenes recorded by people who are blind/low-vision on a mob…☆109Updated 5 months ago
- CLIP Object Detection, search object on image using natural language #Zeroshot #Unsupervised #CLIP #ODS☆140Updated 3 years ago
- [NeurIPS 2021] ORL: Unsupervised Object-Level Representation Learning from Scene Images☆58Updated 4 years ago
- ☆87Updated 3 years ago
- Release of ImageNet-Captions☆51Updated 2 years ago
- REVISE: A Tool for Measuring and Mitigating Bias in Visual Datasets --- https://arxiv.org/abs/2004.07999☆110Updated 3 years ago
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆103Updated 2 years ago
- ☆34Updated 2 years ago
- Contains code for the paper "Vision Transformers are Robust Learners" (AAAI 2022).☆122Updated 3 years ago
- This is a offical PyTorch/GPU implementation of SupMAE.☆79Updated 3 years ago
- ☆103Updated last year
- A task-agnostic vision-language architecture as a step towards General Purpose Vision☆92Updated 4 years ago
- [ECCV2024][ICCV2023] Official PyTorch implementation of SeiT++ and SeiT☆56Updated last year
- ☆120Updated 2 years ago
- An open source implementation of CLIP.☆33Updated 3 years ago
- Code release for "Dropout Reduces Underfitting"☆317Updated 2 years ago
- Un-*** 50 billions multimodality dataset☆23Updated 3 years ago