dk-liang / Awesome-Visual-Transformer
Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)
☆3,381Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Awesome-Visual-Transformer
- Recent Transformer-based CV and related works.☆1,320Updated last year
- Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners☆2,598Updated last year
- Official DeiT repository☆4,053Updated 7 months ago
- OpenMMLab Self-Supervised Learning Toolbox and Benchmark☆3,197Updated last year
- Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)☆1,934Updated 2 years ago
- Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.☆4,048Updated 3 months ago
- Deformable DETR: Deformable Transformers for End-to-End Object Detection.☆3,228Updated 5 months ago
- PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722☆4,783Updated last month
- Code release for ConvNeXt model☆5,760Updated last year
- ICCV 2023 论文和开源项目合集☆2,502Updated last year
- Official implementation of PVT series☆1,722Updated 2 years ago
- Summary of related papers on visual attention. Related code will be released based on Jittor gradually.☆2,774Updated 2 weeks ago
- A paper list of some recent Transformer-based CV works.☆1,114Updated this week
- Implementation of various self-attention mechanisms focused on computer vision. Ongoing repository.☆1,179Updated 3 years ago
- ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet☆1,149Updated last year
- OpenMMLab Pre-training Toolbox and Benchmark☆3,442Updated last week
- RepVGG: Making VGG-style ConvNets Great Again☆3,327Updated last year
- ☆2,542Updated 2 years ago
- This is a collection of our NAS and Vision Transformer work.☆1,679Updated 3 months ago
- Implementation of the Swin Transformer in PyTorch.☆794Updated 3 years ago
- Official PyTorch code for "BAM: Bottleneck Attention Module (BMVC2018)" and "CBAM: Convolutional Block Attention Module (ECCV2018)"☆2,060Updated last year
- [ICLR 2020] Contrastive Representation Distillation (CRD), and benchmark of recent knowledge distillation methods☆2,190Updated last year
- Collection of common code that's shared among different research projects in FAIR computer vision team.☆2,017Updated 3 weeks ago
- label-smooth, amsoftmax, partial-fc, focal-loss, triplet-loss, lovasz-softmax. Maybe useful☆2,176Updated 3 weeks ago
- This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and …☆1,806Updated last year
- This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".☆13,868Updated 3 months ago
- The OCR approach is rephrased as Segmentation Transformer: https://arxiv.org/abs/1909.11065. This is an official implementation of semant…☆3,156Updated last year
- ECCV 2024 论文和开源项目合集,同时欢迎各位大佬提交issue,分享ECCV 2024论文和开源项目☆1,950Updated 3 months ago
- ResNeSt: Split-Attention Networks☆3,236Updated last year
- ☆10,399Updated 5 months ago