facebookresearch / ViP-MAELinks
This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision
☆36Updated 2 years ago
Alternatives and similar repositories for ViP-MAE
Users that are interested in ViP-MAE are comparing it to the libraries listed below
Sorting:
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆101Updated 10 months ago
- An official PyTorch implementation for CLIPPR☆29Updated 2 years ago
- ImageNet-12k subset of ImageNet-21k (fall11)☆21Updated 2 years ago
- Code for T-MARS data filtering☆35Updated last year
- Code for the paper Self-Supervised Learning of Split Invariant Equivariant Representations☆28Updated last year
- ☆38Updated last year
- This is a offical PyTorch/GPU implementation of SupMAE.☆78Updated 2 years ago
- [WACV2025 Oral] DeepMIM: Deep Supervision for Masked Image Modeling☆53Updated 2 months ago
- Original code base for On Pretraining Data Diversity for Self-Supervised Learning☆13Updated 7 months ago
- This repository is the implementation of the paper Training Free Pretrained Model Merging (CVPR2024).☆31Updated last year
- Code for "Don't trust your eyes: on the (un)reliability of feature visualizations" (ICML 2024)☆32Updated last year
- Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]☆57Updated 7 months ago
- ☆51Updated last year
- Official implementation of "Continual Learning by Modeling Intra-Class Variation" (MOCA). [TMLR 2023]☆16Updated 2 years ago
- Switch EMA: A Free Lunch for Better Flatness and Sharpness☆26Updated last year
- Un-*** 50 billions multimodality dataset☆23Updated 2 years ago
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆28Updated last year
- ☆34Updated last year
- ☆11Updated last year
- Official PyTorch Implementation of "Rosetta Neurons: Mining the Common Units in a Model Zoo"☆30Updated last year
- Code base of SynthCLIP: CLIP training with purely synthetic text-image pairs from LLMs and TTIs.☆100Updated 4 months ago
- Repository for the paper Do SSL Models Have Déjà Vu? A Case of Unintended Memorization in Self-supervised Learning☆36Updated 2 years ago
- Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"☆55Updated 11 months ago
- [ICLR 2022] "As-ViT: Auto-scaling Vision Transformers without Training" by Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wa…☆76Updated 3 years ago
- Code release for "Understanding Bias in Large-Scale Visual Datasets"☆21Updated 8 months ago
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Updated last month
- Benchmarking Attention Mechanism in Vision Transformers.☆18Updated 2 years ago
- [NeurIPS'24] Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization☆33Updated 10 months ago
- SIEVE: Multimodal Dataset Pruning using Image-Captioning Models (CVPR 2024)☆16Updated last year
- [CVPR 2024] DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model☆17Updated last year