ArthurZucker / RecvisProjectLinks
In this project, we propose to study Vision Transformers trained using the Barlow Twins self-supervised method, and compare the results with DINO. We demonstrate the effectiveness of the Barlow Twins method by showing that networks pretrained on the small PASCAL VOC 2012 dataset are able to generalize well. Authors: Apavou Clément & Zucker Arthu…
☆16Updated 2 years ago
Alternatives and similar repositories for RecvisProject
Users that are interested in RecvisProject are comparing it to the libraries listed below
Sorting:
- Visualizing representations with diffusion based conditional generative model.☆106Updated 2 years ago
- Code release for "Improved baselines for vision-language pre-training"☆62Updated last year
- ☆192Updated 2 years ago
- ☆56Updated 2 years ago
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆55Updated last year
- Official code for "TOAST: Transfer Learning via Attention Steering"☆188Updated 2 years ago
- Official code and data for NeurIPS 2023 paper "ImageNet-Hard: The Hardest Images Remaining from a Study of the Power of Zoom and Spatial …☆41Updated 2 years ago
- Patching open-vocabulary models by interpolating weights☆91Updated 2 years ago
- (ECCV 2022) BayesCap: Bayesian Identity Cap for Calibrated Uncertainty in Frozen Neural Networks☆51Updated 3 years ago
- A simple minimal implementation of Reversible Vision Transformers☆127Updated last year
- Implementation of Discrete Key / Value Bottleneck, in Pytorch☆88Updated 2 years ago
- FFCV-SSL Fast Forward Computer Vision for Self-Supervised Learning.☆210Updated 2 years ago
- Implementation of LogAvgExp for Pytorch☆37Updated 9 months ago
- A compilation of network architectures for vision and others without usage of self-attention mechanism☆81Updated 3 years ago
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆74Updated last year
- [ICCV25] Official Implementation of LeGrad☆86Updated last year
- Code and models for the paper "The effectiveness of MAE pre-pretraining for billion-scale pretraining" https://arxiv.org/abs/2303.13496☆92Updated 9 months ago
- Implementation of Hourglass Transformer, in Pytorch, from Google and OpenAI☆98Updated 4 years ago
- Code and weights for the paper "Cluster and Predict Latents Patches for Improved Masked Image Modeling"☆130Updated 3 weeks ago
- Optimizable stack of images at different resolutions, a useful representation of images for deep learning tasks. Docs: https://johnowhita…☆11Updated 3 years ago
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆104Updated 2 years ago
- Uncertainty-aware representation learning (URL) benchmark☆106Updated 10 months ago
- Repository for the paper: "TiC-CLIP: Continual Training of CLIP Models" ICLR 2024☆111Updated last year
- ☆32Updated last year
- Video descriptions of research papers relating to foundation models and scaling☆30Updated 2 years ago
- [NeurIPS 2022] Official PyTorch implementation of Optimizing Relevance Maps of Vision Transformers Improves Robustness. This code allows …☆133Updated 3 years ago
- A repository to house some personal attempts to beat some state-of-the-art for medical datasets☆101Updated 2 years ago
- Natural Language Descriptions of Deep Visual Features, ICLR 2022☆65Updated 2 years ago
- FID computation in Jax/Flax.☆29Updated last year
- Switch EMA: A Free Lunch for Better Flatness and Sharpness☆28Updated last year