fundamentalvision / Siamese-Image-ModelingLinks
☆16Updated 2 years ago
Alternatives and similar repositories for Siamese-Image-Modeling
Users that are interested in Siamese-Image-Modeling are comparing it to the libraries listed below
Sorting:
- This is a offical PyTorch/GPU implementation of SupMAE.☆78Updated 2 years ago
- Rethinking Nearest Neighbors for Visual Classification☆31Updated 3 years ago
- [ICLR 2023] “ Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Better Representations”, Ziyu Jian…☆24Updated 2 years ago
- Official Pytorch Implementation of: "Semantic Diversity Learning for Zero-Shot Multi-label Classification"(ICCV, 2021) paper☆30Updated 2 years ago
- [WACV2025 Oral] DeepMIM: Deep Supervision for Masked Image Modeling☆53Updated 2 months ago
- Beyond Masking: Demystifying Token-Based Pre-Training for Vision Transformers☆26Updated 3 years ago
- i-mae Pytorch Repo☆20Updated last year
- [ICME 2022] code for the paper, SimVit: Exploring a simple vision transformer with sliding windows.☆68Updated 2 years ago
- Contrastive Object-level Pre-training with Spatial Noise Curriculum Learning☆20Updated 3 years ago
- PyTorch implementation of Asymmetric Siamese (https://arxiv.org/abs/2204.00613)☆99Updated 3 years ago
- Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms☆20Updated 3 years ago
- [ICLR 2022] "As-ViT: Auto-scaling Vision Transformers without Training" by Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wa…☆76Updated 3 years ago
- Normalization Matters in Weakly Supervised Object Localization (ICCV 2021)☆11Updated 3 years ago
- The Curious Layperson: Fine-Grained Image Recognition without Expert Labels (BMVC 2021 best student paper)☆23Updated 3 years ago
- Code for Point-Level Regin Contrast (https//arxiv.org/abs/2202.04639)☆35Updated 2 years ago
- ☆31Updated 3 years ago
- repo for paper titled: Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment (AAAI'24 Oral)☆25Updated last year
- SOIT: Segmenting Objects with Instance-Aware Transformers☆14Updated 3 years ago
- Official code for "Dynamic Token Normalization Improves Vision Transformer", ICLR 2022.☆28Updated 3 years ago
- Official codes for ConMIM (ICLR 2023)☆60Updated 2 years ago
- [ICLR2024] Exploring Target Representations for Masked Autoencoders☆56Updated last year
- [CVPR 2022] "The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy" by Tianlong C…☆25Updated 3 years ago
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆27Updated last year
- Code implementation for paper "On the Efficacy of Small Self-Supervised Contrastive Models without Distillation Signals".☆16Updated 3 years ago
- Code and models for the paper Glance-and-Gaze Vision Transformer☆28Updated 4 years ago
- open source the research work for published on arxiv. https://arxiv.org/abs/2106.02689☆52Updated 3 years ago
- Exploring Hierarchical Graph Representation for Large-Scale Zero-Shot Image Classification. ECCV 2022.☆18Updated 3 years ago
- ☆57Updated 3 years ago
- [ECCV 2022] This repository includes the official implementation our paper "In Defense of Image Pre-Training for Spatiotemporal Recogniti…☆19Updated 2 years ago
- (CVPR 2022) Automated Progressive Learning for Efficient Training of Vision Transformers☆25Updated 4 months ago