jangsoohyuk / SuiTLinks
Superpixel Tokenization for Vision Transformers: Preserving Semantic Integrity in Visual Tokens
☆42Updated 9 months ago
Alternatives and similar repositories for SuiT
Users that are interested in SuiT are comparing it to the libraries listed below
Sorting:
- PyTorch Implementation of "Your ViT is Secretly a Hybrid Discriminative-Generative Diffusion Model"☆49Updated 3 years ago
- Official code for "Bridging the Gap between Classification and Localization for Weakly Supervised Object Localization (CVPR 2022)"☆30Updated 2 years ago
- The official implementation of Generalizable Implicit Neural Representations with Instance Pattern Composers(CVPR’23 highlight).☆41Updated 2 years ago
- [ICLR 2024] Official code for the paper 'Elucidating the Exposure Bias in Diffusion Models'☆27Updated last year
- Pytorch implementation of TDPM☆36Updated 2 years ago
- [ICLR 2022] "Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice" by Peihao Wang, Wen…☆82Updated 2 years ago
- A Spitting Image: Modular Superpixel Tokenization in Vision Transformers☆21Updated 4 months ago
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆80Updated 2 years ago
- A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting [ECCV 2024]☆103Updated last year
- [NeurIPS2022] FreGAN: Exploiting Frequency Components for Training GANs under Limited Data☆57Updated 3 years ago
- Adapters Strike Back (CVPR 2024)☆40Updated last year
- Locally Hierarchical Auto-Regressive Modeling for Image Generation (HQ-Transformer)☆28Updated last year
- Few-Shot Diffusion Models☆115Updated 3 years ago
- Pytorch reimplementation of Decoder Denoising Pretraining for Semantic Segmentation☆51Updated 2 years ago
- ☆54Updated 4 years ago
- Implementation of Binary Latent Diffusion☆51Updated 2 years ago
- [CVPR '23] Unite and Conquer: Plug & Play Multi-Modal Synthesis using Diffusion Models☆36Updated last year
- Official Repository of "Unpaired Image-to-Image Translation via Neural Schrödinger Bridge" (ICLR 2024)☆260Updated last year
- Description: Frequency Augmented Variational Autoencoder for better Image Reconstruction☆44Updated 2 years ago
- code for paper "Masked Frequency Modeling for Self-Supervised Visual Pre-Training" (https://arxiv.org/pdf/2206.07706.pdf)☆25Updated 2 years ago
- ☆44Updated 2 months ago
- Official PyTorch Implementation of Peripheral Vision Transformer, NeurIPS 2022☆40Updated 3 years ago
- Code for the paper "Do text-free diffusion models learn discriminative visual representations?"☆33Updated 2 years ago
- ☆57Updated last year
- Official PyTorch implementation of "Generalized Consistency Trajectory Models for Image Manipulation"☆44Updated last year
- [CVPR 2024] VkD : Improving Knowledge Distillation using Orthogonal Projections☆57Updated last year
- open source the research work for published on arxiv. https://arxiv.org/abs/2106.02689☆53Updated 3 years ago
- CVPR 2022☆152Updated last year
- Unpaired Image-to-Image Translation with Shortest Path Regularization☆58Updated 2 years ago
- [CVPR 2023] Source code for NoisyTwins: Class-consistent and Diverse Image Generation Through StyleGANs☆36Updated last year