ShqWW / dwconv2dLinks
This is an efficient cuda implementation of 2D depthwise convolution for large kernel, it can be used in Pytorch deep learning framework.
☆11Updated last year
Alternatives and similar repositories for dwconv2d
Users that are interested in dwconv2d are comparing it to the libraries listed below
Sorting:
- [ICML 2024] Official PyTorch implementation of "SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-paramete…☆106Updated 11 months ago
- [AAAI 2025] Linear-complexity Visual Sequence Learning with Gated Linear Attention☆110Updated last year
- A curated list of papers on the applications of RWKV in computer vision.☆198Updated last month
- Official repository of Polarity-aware Linear Attention for Vision Transformers (ICLR 2025)☆67Updated 2 months ago
- [ICLR2025] Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion☆158Updated 4 months ago
- Code for paper LocalMamba: Visual State Space Model with Windowed Selective Scan☆258Updated last year
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆79Updated 3 months ago
- ☆41Updated 3 weeks ago
- [ICCV2025] Introduce Mamba2 to Vision.☆143Updated 3 weeks ago
- Code Implementation of EfficientVMamba☆219Updated last year
- [NeurIPS2024 Spotlight] The official implementation of MambaTree: Tree Topology is All You Need in State Space Model☆96Updated last year
- [NeurIPS2024] Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model☆75Updated 6 months ago
- Official repository of MLLA (NeurIPS 2024)☆335Updated last week
- ☆88Updated 11 months ago
- [ECCV 2024 Workshop Best Paper Award] Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion☆34Updated 9 months ago
- FFNet: MetaMixer-based Efficient Convolutional Mixer Design☆28Updated 4 months ago
- [CVPR 2024] SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design☆114Updated last year
- Vision Mamba 2: More Efficient Visual Representation Learning with State Space Duality☆26Updated last year
- This repository is the official implementation of "Partial Channel Network: Compute Fewer, Perform Better", which includes training, eval…☆14Updated 5 months ago
- Official repository of Slide-Transformer (CVPR2023)☆172Updated 10 months ago
- An efficient pytorch implementation of selective scan in one file, works with both cpu and gpu, with corresponding mathematical derivatio…☆93Updated last year
- vHeat: Building Vision Models upon Heat Conduction☆232Updated last month
- Project Page for "Multi-Task Dense Prediction via Mixture of Low-Rank Experts"☆82Updated last month
- The official implementation of [CVPR 2025] "5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks".☆343Updated last month
- RepNeXt: A Fast Multi-Scale CNN using Structural Reparameterization☆42Updated 9 months ago
- [CVPR 25] Official Implementation (Pytorch) of "EfficientViM: Efficient Vision Mamba with Hidden State Mixer-based State Space Duality"☆80Updated 3 months ago
- ☆77Updated last year
- ☆44Updated 4 months ago
- [CVPR'24] Official implementation of paper "FreeKD: Knowledge Distillation via Semantic Frequency Prompt".☆46Updated last year
- [CVPR 2024] Rewrite the Stars☆403Updated last year