AngelosNal / Vision-DiffMaskLinks
Official PyTorch implementation of Vision DiffMask, a post-hoc interpretation method for vision models.
☆29Updated last year
Alternatives and similar repositories for Vision-DiffMask
Users that are interested in Vision-DiffMask are comparing it to the libraries listed below
Sorting:
- Official code for our CVPR 2023 paper: Test of Time: Instilling Video-Language Models with a Sense of Time☆46Updated last year
- Implementation of Zorro, Masked Multimodal Transformer, in Pytorch☆98Updated last year
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆53Updated last year
- FFCV-SSL Fast Forward Computer Vision for Self-Supervised Learning.☆207Updated 2 years ago
- Personal implementation of ASIF by Antonio Norelli☆25Updated last year
- ☆120Updated 2 years ago
- [TMLR 2022] High-Modality Multimodal Transformer☆117Updated 10 months ago
- Internet Explorer explores the web in a self-supervised manner to progressively find relevant examples that improve performance on a desi…☆163Updated 2 years ago
- ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models (ICLR 2024, Official Implementation)☆16Updated last year
- Code release for "Improved baselines for vision-language pre-training"☆60Updated last year
- This repository contains the code for our paper "Probabilistic Contrastive Learning Recovers the Correct Aleatoric Uncertainty of Ambiguo…☆41Updated 2 years ago
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆61Updated 9 months ago
- Generate text captions for images from their embeddings.☆115Updated 2 years ago
- Uncertainty-aware representation learning (URL) benchmark☆105Updated 5 months ago
- [NeurIPS 2023] Factorized Contrastive Learning: Going Beyond Multi-view Redundancy☆70Updated last year
- Code for the paper "Hyperbolic Image-Text Representations", Desai et al, ICML 2023☆176Updated 2 years ago
- ☆186Updated last year
- [ICCV25] Official Implementation of LeGrad☆78Updated 10 months ago
- Language Quantized AutoEncoders☆109Updated 2 years ago
- Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]☆57Updated 8 months ago
- An ever-growing playground of notebooks showcasing CLIP's impressive zero-shot capabilities☆173Updated 3 years ago
- NEVIS'22: Benchmarking the next generation of never-ending learners☆102Updated 2 years ago
- Natural Language Descriptions of Deep Visual Features, ICLR 2022☆65Updated 2 years ago
- Official implementation of "Describing Differences in Image Sets with Natural Language" (CVPR 2024 Oral)☆120Updated last year
- Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch☆89Updated last year
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆21Updated last year
- Visual Language Transformer Interpreter - An interactive visualization tool for interpreting vision-language transformers☆94Updated 2 years ago
- Implementation of Discrete Key / Value Bottleneck, in Pytorch☆88Updated 2 years ago
- [ICLR 2023] MultiViz: Towards Visualizing and Understanding Multimodal Models☆97Updated last year
- ☆23Updated 8 months ago