ArthurZucker / RecvisProject
In this project, we propose to study Vision Transformers trained using the Barlow Twins self-supervised method, and compare the results with DINO. We demonstrate the effectiveness of the Barlow Twins method by showing that networks pretrained on the small PASCAL VOC 2012 dataset are able to generalize well. Authors: Apavou Clément & Zucker Arthu…
☆15Updated last year
Alternatives and similar repositories for RecvisProject:
Users that are interested in RecvisProject are comparing it to the libraries listed below
- Visualizing representations with diffusion based conditional generative model.☆89Updated last year
- ☆62Updated 4 months ago
- ☆50Updated last year
- Natural Language Descriptions of Deep Visual Features, ICLR 2022☆62Updated last year
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆48Updated 8 months ago
- Code for the paper: B-cosification: Transforming Deep Neural Networks to be Inherently Interpretable. NeurIPS 2024.☆29Updated this week
- Official code for "TOAST: Transfer Learning via Attention Steering"☆188Updated last year
- Code and models for the paper "The effectiveness of MAE pre-pretraining for billion-scale pretraining" https://arxiv.org/abs/2303.13496☆86Updated 7 months ago
- ☆182Updated last year
- Code for the paper "Hyperbolic Image-Text Representations", Desai et al, ICML 2023☆150Updated last year
- ☆23Updated 4 months ago
- ☆49Updated 8 months ago
- [ECCV 2024] Official Release of SILC: Improving vision language pretraining with self-distillation☆40Updated 4 months ago
- Code release for "Improved baselines for vision-language pre-training"☆60Updated 9 months ago
- ☆47Updated last year
- understanding model mistakes with human annotations☆106Updated 2 years ago
- ☆36Updated 7 months ago
- ☆64Updated last year
- Sparse Linear Concept Embeddings☆81Updated 6 months ago
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆99Updated last year
- Official code and data for NeurIPS 2023 paper "ImageNet-Hard: The Hardest Images Remaining from a Study of the Power of Zoom and Spatial …☆37Updated last year
- Un-*** 50 billions multimodality dataset☆24Updated 2 years ago
- A compilation of network architectures for vision and others without usage of self-attention mechanism☆77Updated 2 years ago
- Official PyTorch Implementation of "Rosetta Neurons: Mining the Common Units in a Model Zoo"☆30Updated last year
- [NeurIPS 23' Oral] Emergence of Shape Bias in Convolutional Neural Networks through Activation Sparsity☆26Updated 9 months ago
- Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"☆51Updated 5 months ago
- Official repository of paper "Subobject-level Image Tokenization"☆65Updated 9 months ago
- FFCV-SSL Fast Forward Computer Vision for Self-Supervised Learning.☆204Updated last year
- An official code release of the paper RGB no more: Minimally Decoded JPEG Vision Transformers☆56Updated last year
- Patching open-vocabulary models by interpolating weights☆91Updated last year