BrianPulfer / vision-retention-networks
Unofficial reimplementation of ViR: Vision Retention Networks by Hatamizadeh et. al. (https://arxiv.org/abs/2310.19731)
☆18Updated 6 months ago
Alternatives and similar repositories for vision-retention-networks:
Users that are interested in vision-retention-networks are comparing it to the libraries listed below
- [CVPR'23] SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer☆62Updated 9 months ago
- Source code for "To Adapt or Not to Adapt? Real-Time Adaptation for Semantic Segmentation", ICCV 2023☆48Updated 8 months ago
- [GCPR 2023] UGainS: Uncertainty Guided Anomaly Instance Segmentation☆13Updated 6 months ago
- (ICCV 2023) MasQCLIP for Open-Vocabulary Universal Image Segmentation☆37Updated last year
- Transformers w/o Attention, based fully on MLPs☆93Updated 10 months ago
- Official implementation of MOST: Multiple object localization with self-supervised transformers published at ICCV 2023☆17Updated 11 months ago
- [CVPR 2023] Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention During Vision Transformer Inference☆28Updated 11 months ago
- ☆65Updated 4 months ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆74Updated 6 months ago
- Official implementation of "SViT: Revisiting Token Pruning for Object Detection and Instance Segmentation"☆29Updated last year
- [NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions☆61Updated 9 months ago
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆100Updated 5 months ago
- More dimensions = More fun☆21Updated 6 months ago
- (ICLR 2024, CVPR 2024) SparseFormer☆71Updated 3 months ago
- Official Implementation for CVPR 2024 paper: CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor☆104Updated 8 months ago
- [NeurIPS 2024] official code release for our paper "Revisiting the Integration of Convolution and Attention for Vision Backbone".☆32Updated last month
- (ECCV 2024) Can OOD Object Detectors Learn from Foundation Models?☆25Updated 2 months ago
- Image Instance Segmentation - Zero Shot - OpenAI's CLIP + Meta's SAM☆65Updated last year
- [CVPR24] Official Implementation of GEM (Grounding Everything Module)☆109Updated 4 months ago
- One summary of efficient segment anything models☆91Updated 6 months ago
- [ICCV 2023] Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment☆43Updated last year
- This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆68Updated 8 months ago
- Holds code for our CVPR'23 tutorial: All Things ViTs: Understanding and Interpreting Attention in Vision.☆181Updated last year
- Official Implementation of "Denoising Diffusion Semantic Segmentation with Mask Prior Modeling"☆68Updated last year
- Open-Vocabulary Panoptic Segmentation☆22Updated 5 months ago
- The official implementation of "Segment Anything with Multiple Modalities".☆84Updated 5 months ago
- [ICML 2023] Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN☆26Updated 6 months ago
- Ofiicial Implementation for Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data☆56Updated 7 months ago
- MIMIC: Masked Image Modeling with Image Correspondences☆16Updated 8 months ago
- [ECCV-24] This is the official implementation of the paper "SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation".☆20Updated 4 months ago