ArthurZucker / RecvisProjectLinks
In this project, we propose to study Vision Transformers trained using the Barlow Twins self-supervised method, and compare the results with DINO. We demonstrate the effectiveness of the Barlow Twins method by showing that networks pretrained on the small PASCAL VOC 2012 dataset are able to generalize well. Authors: Apavou Clément & Zucker Arthu…
☆16Updated last year
Alternatives and similar repositories for RecvisProject
Users that are interested in RecvisProject are comparing it to the libraries listed below
Sorting:
- Visualizing representations with diffusion based conditional generative model.☆97Updated 2 years ago
- ☆51Updated last year
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆53Updated last year
- ☆32Updated last year
- ☆185Updated last year
- [ICCV25] Official Implementation of LeGrad☆78Updated 10 months ago
- Code release for "Improved baselines for vision-language pre-training"☆60Updated last year
- Code and models for the paper "The effectiveness of MAE pre-pretraining for billion-scale pretraining" https://arxiv.org/abs/2303.13496☆92Updated 4 months ago
- Official code for "TOAST: Transfer Learning via Attention Steering"☆188Updated 2 years ago
- Video descriptions of research papers relating to foundation models and scaling☆31Updated 2 years ago
- A Contrastive Learning Boost from Intermediate Pre-Trained Representations☆43Updated 11 months ago
- A simple minimal implementation of Reversible Vision Transformers☆125Updated last year
- Code and weights for the paper "Cluster and Predict Latents Patches for Improved Masked Image Modeling"☆116Updated 4 months ago
- Uncertainty-aware representation learning (URL) benchmark☆105Updated 5 months ago
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆103Updated last year
- understanding model mistakes with human annotations☆106Updated 2 years ago
- FFCV-SSL Fast Forward Computer Vision for Self-Supervised Learning.☆207Updated 2 years ago
- ☆19Updated 2 years ago
- ☆10Updated last year
- [NeurIPS 2024] Code for the paper: B-cosification: Transforming Deep Neural Networks to be Inherently Interpretable.☆33Updated 3 months ago
- Procedural Image Programs for Representation Learning - NeurIPS 2022☆35Updated 11 months ago
- Patching open-vocabulary models by interpolating weights☆91Updated last year
- Switch EMA: A Free Lunch for Better Flatness and Sharpness☆26Updated last year
- Repository for the paper: "TiC-CLIP: Continual Training of CLIP Models".☆103Updated last year
- Optimizable stack of images at different resolutions, a useful representation of images for deep learning tasks. Docs: https://johnowhita…☆11Updated 2 years ago
- ☆56Updated last year
- A repository to house some personal attempts to beat some state-of-the-art for medical datasets☆99Updated last year
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆67Updated 11 months ago
- An official PyTorch implementation for CLIPPR☆29Updated 2 years ago
- Natural Language Descriptions of Deep Visual Features, ICLR 2022☆65Updated 2 years ago