rishikksh20/CrossViT-pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rishikksh20/CrossViT-pytorch)

rishikksh20 / CrossViT-pytorch

Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification

☆208

Alternatives and similar repositories for CrossViT-pytorch

Users that are interested in CrossViT-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

IBM / CrossViT
View on GitHub
Official implementation of CrossViT. https://arxiv.org/abs/2103.14899
☆417Jan 12, 2022Updated 4 years ago
linhezheng19 / CAT
View on GitHub
Official implement of "CAT: Cross Attention in Vision Transformer".
☆169Jun 25, 2022Updated 4 years ago
jiankang1991 / IGARSS2020_BWMS
View on GitHub
Codes for IGARSS2020 paper: Band-Wise Multi-Scale CNN Architecture for Remote Sensing Image Scene Classification.
☆12Nov 18, 2020Updated 5 years ago
zhoudaquan / dvit_repo
View on GitHub
☆141Dec 18, 2021Updated 4 years ago
SSYSteve / Human-behaviour-based-depression-analysis-using-hand-crafted-statistics-and-deep-learned
View on GitHub
Human behaviour-based automatic depression analysis using hand-crafted statistics and deep learned spectral features
☆12Dec 8, 2021Updated 4 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
kenya-sk / show_attend_and_tell
View on GitHub
This repository reimplements "Show, Attend and Tell" model and add extra deep learning techniques.
☆12Oct 3, 2023Updated 2 years ago
jcwang123 / DM2TNet
View on GitHub
☆11Feb 7, 2023Updated 3 years ago
chzh9311 / structural-triangulation
View on GitHub
The official implementation of Structural Triangulation
☆33Mar 9, 2023Updated 3 years ago
HMS97 / GLNET
View on GitHub
Convolutional Neural Networks Based Remote Sensing Scene Classification under Clear and Cloudy Environments
☆15Feb 27, 2023Updated 3 years ago
ReaFly / SemiMedSeg
View on GitHub
MICCAI 2021 : Self-Supervised Correction Learning for Semi-Supervised Biomedical Image Segmentation (Pytorch implementation).
☆26Jan 12, 2023Updated 3 years ago
rishikksh20 / CeiT-pytorch
View on GitHub
Implementation of Convolutional enhanced image Transformer
☆106Mar 27, 2021Updated 5 years ago
lucidrains / vit-pytorch
View on GitHub
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…
☆25,428Jun 22, 2026Updated last month
mpapadomanolaki / multi-task-L-UNet
View on GitHub
☆42May 24, 2021Updated 5 years ago
ShubingOuyangcug / GCSANet
View on GitHub
remote sensing scene classification
☆12Mar 1, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Alibaba-MIIL / ZS_SDL
View on GitHub
Official Pytorch Implementation of: "Semantic Diversity Learning for Zero-Shot Multi-label Classification"(ICCV, 2021) paper
☆31Aug 23, 2022Updated 3 years ago
luyao777 / HBP-pytorch
View on GitHub
Hierarchical Bilinear Pooling for Fine-Grained Visual Recognition reimplement in Pytorch.
☆105Jul 11, 2020Updated 6 years ago
HiLab-git / DCA-Net
View on GitHub
☆12May 19, 2024Updated 2 years ago
Markin-Wang / CLEViT
View on GitHub
[IJCAI 2023] CLE-ViT: Contrastive Learning Encoded Transformer for Ultra-Fine-Grained Visual Categorization.
☆10Nov 3, 2023Updated 2 years ago
AlexeyAB / SPVT-Transformer
View on GitHub
☆13Nov 7, 2021Updated 4 years ago
hathawayxxh / CRCKD
View on GitHub
The source code of 'Categorical Relation-Preserving Contrastive Knowledge Distillation for Medical Image Classification' (MICCAI 2021)
☆19Sep 17, 2021Updated 4 years ago
microsoft / Swin-Transformer
View on GitHub
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
☆16,004Jul 24, 2024Updated last year
yushuiwx / MH-MoE
View on GitHub
☆20Nov 5, 2024Updated last year
qi-zhe / CLNet
View on GitHub
☆13May 18, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
4m4n5 / saasn-stain-normalization
View on GitHub
Pytorch implementation of Self Attentive Adversarial Stain Normalization (SAASN).
☆14Feb 13, 2023Updated 3 years ago
Chenglin-Yang / LESA
View on GitHub
Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms
☆20Nov 29, 2021Updated 4 years ago
ofsoundof / LocalViT
View on GitHub
☆118Jan 17, 2026Updated 6 months ago
grant-jpg / FUSSNet
View on GitHub
The code repo for "FUSSNet: Fusing Two Sources of Uncertainty forSemi-Supervised Medical Image Segmentation"
☆14Mar 2, 2022Updated 4 years ago
microsoft / CvT
View on GitHub
This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.
☆609May 16, 2023Updated 3 years ago
zhoudaquan / Refiner_ViT
View on GitHub
☆110Sep 15, 2021Updated 4 years ago
yonatanbitton / data_efficient_masked_language_modeling_for_vision_and_language
View on GitHub
Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".
☆18Sep 17, 2021Updated 4 years ago
lucidrains / global-self-attention-network
View on GitHub
A Pytorch implementation of Global Self-Attention Network, a fully-attention backbone for vision tasks
☆94Nov 21, 2020Updated 5 years ago
HXLH50K / U-Net-Transformer
View on GitHub
☆115May 27, 2021Updated 5 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
mlpc-ucsd / CoaT
View on GitHub
(ICCV 2021 Oral) CoaT: Co-Scale Conv-Attentional Image Transformers
☆236Feb 3, 2022Updated 4 years ago
wangsp1999 / CD-Research
View on GitHub
☆11May 17, 2023Updated 3 years ago
wilile26811249 / CMT_CNN-meet-Vision-Transformer
View on GitHub
A PyTorch implementation of CMT based on paper CMT: Convolutional Neural Networks Meet Vision Transformers.
☆72Mar 18, 2023Updated 3 years ago
changlin31 / BossNAS
View on GitHub
(ICCV 2021) BossNAS: Exploring Hybrid CNN-transformers with Block-wisely Self-supervised Neural Architecture Search
☆143Dec 6, 2021Updated 4 years ago
Terry-Xu-666 / visual_inference_chain
View on GitHub
This repository contains the official code for our paper: Thinking Before Looking: Improving Multimodal LLM Reasoning via Mitigating Visu…
☆25Nov 15, 2024Updated last year
ExplainableML / LanguageGuidance_for_DML
View on GitHub
This repository contains the code for our CVPR 2022 paper on "Integrating Language Guidance into Vision-based Deep Metric Learning".
☆44Aug 9, 2022Updated 3 years ago
Deferf / CLIP_Video_Representation
View on GitHub
Use CLIP to represent video for Retrieval Task
☆71Mar 1, 2021Updated 5 years ago