amirbar / StoPLinks
☆10Updated last year
Alternatives and similar repositories for StoP
Users that are interested in StoP are comparing it to the libraries listed below
Sorting:
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders☆114Updated 2 months ago
- Distributed Optimization Infra for learning CLIP models☆26Updated 8 months ago
- More dimensions = More fun☆22Updated 11 months ago
- Official PyTorch Implementation☆18Updated 2 years ago
- ☆37Updated 11 months ago
- [ICML 2024] This repository includes the official implementation of our paper "Rejuvenating image-GPT as Strong Visual Representation Lea…☆98Updated last year
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆35Updated last year
- CatMAE☆14Updated last year
- Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]☆57Updated 6 months ago
- Official repository of paper "Subobject-level Image Tokenization" (ICML-25)☆72Updated 2 months ago
- ☆32Updated last month
- ☆50Updated last year
- Code for the paper Self-Supervised Learning of Split Invariant Equivariant Representations☆28Updated last year
- ☆51Updated 3 months ago
- ☆18Updated 6 months ago
- (ICLR 2023) Official PyTorch implementation of "What Do Self-Supervised Vision Transformers Learn?"☆112Updated last year
- This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision☆36Updated 2 years ago
- ☆54Updated last year
- Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)☆40Updated last year
- ☆19Updated 2 years ago
- Code and models for the paper "The effectiveness of MAE pre-pretraining for billion-scale pretraining" https://arxiv.org/abs/2303.13496☆90Updated 2 months ago
- Official code repo of PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs☆26Updated 5 months ago
- This repository includes the official implementation our paper "Scaling White-Box Transformers for Vision"☆48Updated last year
- CycleReward is a reward model trained on cycle consistency preferences to measure image-text alignment.☆29Updated last week
- [NeurIPS 2024] Official PyTorch implementation of "Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives"☆41Updated 6 months ago
- Visualizing representations with diffusion based conditional generative model.☆95Updated 2 years ago
- ☆29Updated 2 years ago
- ☆65Updated 3 years ago
- Code for the paper "Hyperbolic Image-Text Representations", Desai et al, ICML 2023☆169Updated last year
- ☆29Updated 2 years ago