rayleizhu / GLMix
[NeurIPS 2024] official code release for our paper "Revisiting the Integration of Convolution and Attention for Vision Backbone".
☆37Updated 2 months ago
Alternatives and similar repositories for GLMix:
Users that are interested in GLMix are comparing it to the libraries listed below
- ☆25Updated 3 weeks ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆75Updated last week
- FFNet: MetaMixer-based Efficient Convolutional Mixer Design☆27Updated last month
- GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model [CVPR -2025]☆89Updated 3 weeks ago
- [ICLR 2025] SAMRefiner: Taming Segment Anything Model for Universal Mask Refinement☆38Updated 2 months ago
- DiverGen (CVPR 2024) & BSGAL (ICML 2024)☆45Updated 2 weeks ago
- ☆43Updated 3 months ago
- [ECCV2024] ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation☆83Updated 2 weeks ago
- [NeurIPS2024 Spotlight] The official implementation of MambaTree: Tree Topology is All You Need in State Space Model☆92Updated 10 months ago
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆79Updated 2 weeks ago
- ☆64Updated last month
- [CVPR2025] Breaking the Low-Rank Dilemma of Linear Attention☆13Updated last month
- The official implementation of "Adapter is All You Need for Tuning Visual Tasks".☆119Updated this week
- Official repository of InLine attention (NeurIPS 2024)☆45Updated 3 months ago
- RecConv: Efficient Recursive Convolutions for Multi-Frequency Representations☆14Updated 3 months ago
- [ICLR 2024 poster] Efficient Modulation for Vision Networks☆52Updated 9 months ago
- Harnessing CLIP, DINO and SAM for Open Vocabulary Segmentation☆49Updated last month
- [ECCV-24] This is the official implementation of the paper "SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation".☆20Updated 6 months ago
- Official implement of ICML2024 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation☆48Updated 8 months ago
- [CVPR2024] Open-Vocabulary Semantic Segmentation with Image Embedding Balancing☆29Updated 9 months ago
- RepNeXt: A Fast Multi-Scale CNN using Structural Reparameterization☆37Updated 6 months ago
- Learning 1D Causal Visual Representation with De-focus Attention Networks☆33Updated 10 months ago
- [NIPS24] Official Implementation of Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation☆18Updated 5 months ago
- Official code for BA-SAM:Scalable Bias-Mode Attention Mask for Segment Anything Model☆17Updated 9 months ago
- ☆26Updated last year
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆37Updated last year
- CAD - Memory Efficient Convolutional Adapter for Segment Anything☆11Updated 6 months ago
- Adapters Strike Back (CVPR 2024)☆35Updated 8 months ago
- SimCMF: A Simple Cross-modal Fine-tuning Strategy from Vision Foundation Models to Any Imaging Modality☆31Updated 4 months ago
- CVPR2024: Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models☆70Updated 9 months ago