yangzhou321 / VQSALinks
CVPR2023: Vector Quantization with Self-Attention for Quality-Independent Representation Learning.
☆14Updated last year
Alternatives and similar repositories for VQSA
Users that are interested in VQSA are comparing it to the libraries listed below
Sorting:
- An efficient pytorch implementation of selective scan in one file, works with both cpu and gpu, with corresponding mathematical derivatio…☆93Updated last year
- The official project website of "Dynamic Mobile-Former: Strengthening Dynamic Convolution with Attention and Residual Connection in Kerne…☆9Updated 2 years ago
- Official repository of Slide-Transformer (CVPR2023)☆171Updated 10 months ago
- GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model [CVPR -2025]☆105Updated 3 months ago
- [ICLR 2025 Spotlight] Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures☆478Updated 5 months ago
- Code Implementation of EfficientVMamba☆216Updated last year
- Official PyTorch implementation of DiffuseMix : Label-Preserving Data Augmentation with Diffusion Models (CVPR'2024)☆118Updated 4 months ago
- The official repo for [TPAMI'23] "Vision Transformer with Quadrangle Attention"☆217Updated last year
- [ICCV2025] Introduce Mamba2 to Vision.☆141Updated 3 weeks ago
- (CVPR2024)RMT: Retentive Networks Meet Vision Transformer☆356Updated 11 months ago
- [NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"☆212Updated 9 months ago
- A partial implementation of Generative Infinite Vocabulary Transformer (GIVT) from Google Deepmind, in PyTorch.☆19Updated last year
- iFormer: Inception Transformer☆248Updated 2 years ago
- Open source implementation of "Vision Transformers Need Registers"☆184Updated 3 months ago
- An official code release of the paper RGB no more: Minimally Decoded JPEG Vision Transformers☆55Updated 2 years ago
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆75Updated 2 years ago
- Code for paper LocalMamba: Visual State Space Model with Windowed Selective Scan☆258Updated last year
- The official implementation of DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis☆198Updated last year
- WACV 2024 Papers: Discover cutting-edge research from WACV 2024, the leading computer vision conference. Stay updated on the latest in co…☆96Updated 10 months ago
- 1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundatio…☆221Updated 10 months ago
- [NeurIPS 2022] Implementation of "AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition"☆363Updated 2 years ago
- List of papers related to State Space Models (Mamba) in Vision.☆38Updated last year
- [ICCV'23] Official repository of paper SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applicatio…☆290Updated last year
- Official repository of FLatten Transformer (ICCV2023)☆438Updated 8 months ago
- A PyTorch implementation of the paper "ZigMa: A DiT-Style Mamba-based Diffusion Model" (ECCV 2024)☆326Updated 4 months ago
- ☆143Updated last year
- Masked Autoencoder meets GANs☆27Updated last year
- Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis☆240Updated 5 months ago
- ☆85Updated last year
- This repository contains the pytorch code for our work IEEE ISBI 2024 paper "ConvLoRA and AdaBN Based Domain Adaptation via Self-Training…☆80Updated 9 months ago