jangsoohyuk / SuiTLinks
Superpixel Tokenization for Vision Transformers: Preserving Semantic Integrity in Visual Tokens
☆42Updated 8 months ago
Alternatives and similar repositories for SuiT
Users that are interested in SuiT are comparing it to the libraries listed below
Sorting:
- PyTorch Implementation of "Your ViT is Secretly a Hybrid Discriminative-Generative Diffusion Model"☆49Updated 3 years ago
- The official implementation of Generalizable Implicit Neural Representations with Instance Pattern Composers(CVPR’23 highlight).☆41Updated 2 years ago
- [ICLR 2022] "Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice" by Peihao Wang, Wen…☆81Updated last year
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆79Updated 2 years ago
- [CVPR 2024] VkD : Improving Knowledge Distillation using Orthogonal Projections☆57Updated last year
- Official code for "Bridging the Gap between Classification and Localization for Weakly Supervised Object Localization (CVPR 2022)"☆30Updated 2 years ago
- Description: Frequency Augmented Variational Autoencoder for better Image Reconstruction☆44Updated 2 years ago
- Adapters Strike Back (CVPR 2024)☆38Updated last year
- Code for the paper "Do text-free diffusion models learn discriminative visual representations?"☆31Updated last year
- ☆57Updated 2 years ago
- A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting [ECCV 2024]☆102Updated last year
- [CVPR '23] Unite and Conquer: Plug & Play Multi-Modal Synthesis using Diffusion Models☆36Updated last year
- [TIP] Exploring Effective Factors for Improving Visual In-Context Learning☆19Updated 5 months ago
- A Spitting Image: Modular Superpixel Tokenization in Vision Transformers☆21Updated 3 months ago
- Log-Polar Space Convolution for Convolutional Neural Networks☆13Updated 3 years ago
- Unpaired Image-to-Image Translation with Shortest Path Regularization☆58Updated 2 years ago
- Pytorch reimplementation of Decoder Denoising Pretraining for Semantic Segmentation☆51Updated 2 years ago
- ☆32Updated last year
- [ICLR 2024] Official code for the paper 'Elucidating the Exposure Bias in Diffusion Models'☆27Updated last year
- [CVPR 2023] Zero-shot Generative Model Adaptation via Image-specific Prompt Learning☆83Updated 2 years ago
- [ICLR2025] This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆88Updated 6 months ago
- Pytorch implementation of TDPM☆36Updated 2 years ago
- [ICCV 2023] On the Effectiveness of Spectral Discriminators for Perceptual Quality Improvement☆66Updated 2 years ago
- ☆45Updated 2 years ago
- unofficial implementation of DiffMAE☆16Updated last year
- [NeurIPS 2022] code for the paper, SemMAE: Semantic-guided masking for learning masked autoencoders☆41Updated 2 years ago
- Implementation of Binary Latent Diffusion☆51Updated 2 years ago
- Official code for DRANet: Disentangling Representation and Adaptation Networks for Unsupervised Cross-Domain Adaptation☆28Updated 2 years ago
- ☆54Updated 4 years ago
- code for paper "Masked Frequency Modeling for Self-Supervised Visual Pre-Training" (https://arxiv.org/pdf/2206.07706.pdf)☆25Updated 2 years ago