IBM/RegionViT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/IBM/RegionViT)

IBM / RegionViT

open source the research work for published on arxiv. https://arxiv.org/abs/2106.02689

☆54

Alternatives and similar repositories for RegionViT

Users that are interested in RegionViT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wilkinghoff / sub-cluster-AdaCos
View on GitHub
Accompanying code for the paper Sub-Cluster AdaCos: Learning Representations for Anomalous Sound Detection.
☆11Jun 7, 2022Updated 4 years ago
CBCZJL / Building-Vision-Transformers-with-Hierarchy-Aware-Feature-Aggregation
View on GitHub
ICCV23 Building Vision Transformers with Hierarchy Aware Feature Aggregation
☆22Jul 15, 2025Updated last year
yuexy / PS-ViT
View on GitHub
Official implementation of the paper Vision Transformer with Progressive Sampling, ICCV 2021.
☆153Jan 14, 2022Updated 4 years ago
Atten4Vis / DemystifyLocalViT
View on GitHub
Official code for paper "On the Connection between Local Attention and Dynamic Depth-wise Convolution" ICLR 2022 Spotlight
☆185Nov 17, 2022Updated 3 years ago
enyac-group / supmae
View on GitHub
This is a offical PyTorch/GPU implementation of SupMAE.
☆80Aug 30, 2022Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
ChenhongyiYang / GPViT
View on GitHub
[ICLR 2023 Spotlight] GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation
☆102May 26, 2023Updated 3 years ago
lim142857 / Sparsifiner
View on GitHub
Demo code for CVPR2023 paper "Sparsifiner: Learning Sparse Instance-Dependent Attention for Efficient Vision Transformers"
☆15Jul 4, 2023Updated 3 years ago
rayleizhu / GLMix
View on GitHub
[NeurIPS 2024] official code release for our paper "Revisiting the Integration of Convolution and Attention for Vision Backbone".
☆43Jan 21, 2025Updated last year
Meituan-AutoML / Twins
View on GitHub
Two simple and effective designs of vision transformer, which is on par with the Swin transformer
☆611Feb 14, 2023Updated 3 years ago
aimagelab / MaPeT
View on GitHub
Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training
☆16Jul 1, 2025Updated last year
rayleizhu / BiFormer
View on GitHub
[CVPR 2023] Official code release of our paper "BiFormer: Vision Transformer with Bi-Level Routing Attention"
☆581May 22, 2023Updated 3 years ago
AlexeyAB / SPVT-Transformer
View on GitHub
☆13Nov 7, 2021Updated 4 years ago
raoyongming / AMixer
View on GitHub
[ECCV 2022] AMixer: Adaptive Weight Mixing for Self-attention Free Vision Transformers
☆29Nov 14, 2022Updated 3 years ago
Euphoria16 / TL-Align
View on GitHub
[ICCV 2023]The PyTorch implementation of TL-Align: Token-Label Alignment for Vision Transformers.
☆23Jul 16, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
cheerss / CrossFormer
View on GitHub
The official code for the paper: https://openreview.net/forum?id=_PHymLIxuI
☆403Jan 14, 2024Updated 2 years ago
EndoluminalSurgicalVision-IMR / CCFV
View on GitHub
[MICCAI 2023] Official implementation of our MICCAI 2023 paper "Pick the Best Pre-trained Model: Towards Transferability Estimation for M…
☆13Jul 27, 2023Updated 2 years ago
wilkinghoff / dcase2022
View on GitHub
Submission for task 2 "Unsupervised Anomalous Sound Detection for Machine Condition Monitoring Applying Domain Generalization Techniques"…
☆16Sep 19, 2022Updated 3 years ago
MarvinYu1995 / HyCTAS
View on GitHub
HyCTAS
☆12Sep 30, 2025Updated 9 months ago
AntXinyuan / SSP
View on GitHub
Semantic-decoupled Spatial Partition Guided Point-supervised Oriented Object Detection
☆13Jul 7, 2026Updated 2 weeks ago
Muzammal-Naseer / IPViT
View on GitHub
Official repository for "Intriguing Properties of Vision Transformers" (NeurIPS 2021--Spotlight)
☆183Aug 9, 2022Updated 3 years ago
ucasligang / SimViT
View on GitHub
[ICME 2022] code for the paper, SimVit: Exploring a simple vision transformer with sliding windows.
☆67Oct 11, 2022Updated 3 years ago
VITA-Group / AugMax
View on GitHub
[NeurIPS'21] "AugMax: Adversarial Composition of Random Augmentations for Robust Training" by Haotao Wang, Chaowei Xiao, Jean Kossaifi, Z…
☆125Dec 29, 2021Updated 4 years ago
raven38 / OSSGAN
View on GitHub
Official implementation of OSSGAN [CVPR 2022]
☆21May 2, 2022Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
facebookresearch / augmentation-corruption
View on GitHub
This repository provides code for "On Interaction Between Augmentations and Corruptions in Natural Corruption Robustness".
☆47Nov 6, 2022Updated 3 years ago
LKJacky / Differentiable-Model-Scaling
View on GitHub
This is the official repo for "Differentiable Model Scaling using Differentiable Topk"
☆12May 16, 2024Updated 2 years ago
ChengyueGongR / PatchVisionTransformer
View on GitHub
☆74Dec 8, 2022Updated 3 years ago
cschaefer26 / StyleMelGAN
View on GitHub
☆10Apr 8, 2024Updated 2 years ago
virajprabhu / PACMAC
View on GitHub
Pytorch code for Adapting Self-Supervised Vision Transformers by Probing Attention-Conditioned Masking Consistency (NeurIPS 2022)
☆20Oct 10, 2022Updated 3 years ago
patrickvonplaten / audio-gen-dreambooth
View on GitHub
☆23Jun 13, 2023Updated 3 years ago
hhb072 / STViT
View on GitHub
☆152Jun 25, 2024Updated 2 years ago
OliverRensu / DeepMIM
View on GitHub
[WACV2025 Oral] DeepMIM: Deep Supervision for Masked Image Modeling
☆56May 10, 2025Updated last year
SHI-Labs / Neighborhood-Attention-Transformer
View on GitHub
Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022
☆1,182May 15, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
shkarupa-alex / tfreplknet
View on GitHub
Keras (TensorFlow v2) reimplementation of Re-parameterized Large Kernel Network (RepLKNet)
☆17Dec 8, 2022Updated 3 years ago
Beckschen / TransMix
View on GitHub
[CVPR 2022] This repository includes the official project for the paper: TransMix: Attend to Mix for Vision Transformers.
☆158Dec 6, 2022Updated 3 years ago
andyrull / width-and-Depth-pruning-for-Vision-Transformer
View on GitHub
☆20Apr 24, 2022Updated 4 years ago
Shanghua-Gao / RBN
View on GitHub
The official repo of the CVPR2021 oral paper: Representative Batch Normalization with Feature Calibration
☆85Sep 17, 2022Updated 3 years ago
wangck20 / GlobalMamba
View on GitHub
☆27Oct 15, 2024Updated last year
BA-Transform / BAT-Image-Classification
View on GitHub
This is an official implementation of our CVPR 2020 paper "Non-Local Neural Networks With Grouped Bilinear Attentional Transforms".
☆13Jan 30, 2021Updated 5 years ago
shendu0321 / IncepFormer
View on GitHub
IncepFormer Official repo
☆33Mar 8, 2023Updated 3 years ago