yangzhou321 / VQSALinks
CVPR2023: Vector Quantization with Self-Attention for Quality-Independent Representation Learning.
☆14Updated last year
Alternatives and similar repositories for VQSA
Users that are interested in VQSA are comparing it to the libraries listed below
Sorting:
- An efficient pytorch implementation of selective scan in one file, works with both cpu and gpu, with corresponding mathematical derivatio…☆101Updated 3 months ago
- iFormer: Inception Transformer☆247Updated 3 years ago
- [BMVC 2022] Official repository for "How to Train Vision Transformer on Small-scale Datasets?"☆167Updated 2 years ago
- Open source implementation of "Vision Transformers Need Registers"☆210Updated last week
- An official code release of the paper RGB no more: Minimally Decoded JPEG Vision Transformers☆57Updated 2 years ago
- Official PyTorch implementation of DiffuseMix : Label-Preserving Data Augmentation with Diffusion Models (CVPR'2024)☆132Updated 3 weeks ago
- This repository contains the pytorch code for our work IEEE ISBI 2024 paper "ConvLoRA and AdaBN Based Domain Adaptation via Self-Training…☆94Updated last year
- Official implementation for paper "Knowledge Diffusion for Distillation", NeurIPS 2023☆94Updated 2 years ago
- ImageNet-1K data download, processing for using as a dataset☆125Updated 3 years ago
- Official Pytorch Implementation of Our CVPR2023 Paper: "Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image…☆63Updated 2 years ago
- Effective Data Augmentation With Diffusion Models☆269Updated last year
- Official repository of Slide-Transformer (CVPR2023)☆175Updated last year
- [NeurIPS 2022] Implementation of "AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition"☆379Updated 3 years ago
- The official repo for [TPAMI'23] "Vision Transformer with Quadrangle Attention"☆234Updated 4 months ago
- [ICCV2025] Introduce Mamba2 to Vision.☆185Updated 3 months ago
- [ICCV'23] Official repository of paper SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applicatio…☆310Updated 6 months ago
- [ICLR 2025 Spotlight] Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures☆540Updated 11 months ago
- GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model [CVPR -2025]☆131Updated 10 months ago
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆81Updated 2 years ago
- 1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundatio…☆226Updated last year
- Official Implementation of the ECCV 2024 Paper: "CLAP: Isolating Content from Style through Contrastive Learning with Augmented Prompts"☆54Updated 3 months ago
- Code for paper LocalMamba: Visual State Space Model with Windowed Selective Scan☆275Updated last year
- (CVPR2024)RMT: Retentive Networks Meet Vision Transformer☆378Updated last year
- Official code for Cumulative Spatial Knowledge Distillation for Vision Transformers (ICCV-2023) https://openaccess.thecvf.com/content/ICC…☆15Updated 2 years ago
- [ICCV23] Robust Mixture-of-Expert Training for Convolutional Neural Networks by Yihua Zhang, Ruisi Cai, Tianlong Chen, Guanhua Zhang, Hua…☆67Updated 2 years ago
- Masked Autoencoder meets GANs☆30Updated 2 years ago
- [ICML 2023] Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN☆28Updated last year
- Official Pytorch Implementation of Our CVPR2023 Paper: "Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dyna…☆191Updated 2 years ago
- (ICLR 2023) Official PyTorch implementation of "What Do Self-Supervised Vision Transformers Learn?"☆115Updated last year
- [ICLR2025] This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆90Updated 8 months ago