lucidrains / transformer-in-transformerView external linksLinks
Implementation of Transformer in Transformer, pixel level attention paired with patch level attention for image classification, in Pytorch
β310Dec 27, 2021Updated 4 years ago
Alternatives and similar repositories for transformer-in-transformer
Users that are interested in transformer-in-transformer are comparing it to the libraries listed below
Sorting:
- Implementation of the π Attention layer from the paper, Scaling Local Self-Attention For Parameter Efficient Visual Backbonesβ201Mar 24, 2021Updated 4 years ago
- Implementation of E(n)-Transformer, which incorporates attention mechanisms into Welling's E(n)-Equivariant Graph Neural Networkβ226Jun 2, 2024Updated last year
- An attempt at the implementation of Glom, Geoffrey Hinton's new idea that integrates concepts from neural fields, top-down-bottom-up procβ¦β196Mar 27, 2021Updated 4 years ago
- Official DeiT repositoryβ4,322Mar 15, 2024Updated last year
- [NeurIPSβ2021] "TransGAN: Two Pure Transformers Can Make One Strong GAN, and That Can Scale Up", Yifan Jiang, Shiyu Chang, Zhangyang Wangβ1,690Nov 3, 2022Updated 3 years ago
- A simple implementation of a deep linear Pytorch moduleβ21Oct 16, 2020Updated 5 years ago
- Pytorch implementation of "All Tokens Matter: Token Labeling for Training Better Vision Transformers"β433Sep 5, 2023Updated 2 years ago
- ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNetβ1,191Oct 27, 2023Updated 2 years ago
- β110Sep 15, 2021Updated 4 years ago
- Official implementation of PVT seriesβ1,882Oct 27, 2022Updated 3 years ago
- Implementation of Feedback Transformer in Pytorchβ108Mar 2, 2021Updated 4 years ago
- β20Mar 14, 2021Updated 4 years ago
- [CVPR 2021 & IJCV 2024] Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformersβ1,109Sep 2, 2024Updated last year
- [CVPR 2021] Involution: Inverting the Inherence of Convolution for Visual Recognition, a brand new neural operatorβ1,317Jul 16, 2021Updated 4 years ago
- Generative Adversarial Transformersβ1,344Jun 14, 2022Updated 3 years ago
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorchβ58Jan 13, 2021Updated 5 years ago
- [CVPR 2021] Anycost GANs for Interactive Image Synthesis and Editingβ782Oct 3, 2023Updated 2 years ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixingβ49Jan 27, 2022Updated 4 years ago
- Noah Researchβ934Updated this week
- Implementation of Ο-GAN, for 3d-aware image synthesis, in Pytorchβ124Feb 22, 2021Updated 4 years ago
- [NeurIPS 2021] [T-PAMI] DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsificationβ650Jul 11, 2023Updated 2 years ago
- Implementation of COCO-LM, Correcting and Contrasting Text Sequences for Language Model Pretraining, in Pytorchβ46Mar 3, 2021Updated 4 years ago
- [NeurIPS 2021] ORL: Unsupervised Object-Level Representation Learning from Scene Imagesβ58Dec 6, 2021Updated 4 years ago
- [CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize β¦β1,975Jan 24, 2024Updated 2 years ago
- β280Mar 22, 2021Updated 4 years ago
- End-to-End Object Detection with Fully Convolutional Networkβ496Jan 10, 2022Updated 4 years ago
- This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".β1,023Sep 29, 2022Updated 3 years ago
- PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)β1,365Jun 1, 2024Updated last year
- Official Pytorch implementation of ReXNet (Rank eXpansion Network) with pretrained modelsβ452Jan 30, 2022Updated 4 years ago
- Implementation of Cross Transformer for spatially-aware few-shot transfer, in Pytorchβ54Mar 30, 2021Updated 4 years ago
- RelationNet++: Bridging Visual Representations for Object Detection via Transformer Decoderβ210Mar 18, 2021Updated 4 years ago
- Implementation of Invariant Point Attention, used for coordinate refinement in the structure module of Alphafold2, as a standalone Pytorcβ¦β171Nov 25, 2022Updated 3 years ago
- Two simple and effective designs of vision transformer, which is on par with the Swin transformerβ608Feb 14, 2023Updated 3 years ago
- Exploring Self-attention for Image Recognition, CVPR2020.β752Jun 15, 2020Updated 5 years ago
- Unofficial Pytorch Lightning implementation of Contrastive Syn-to-Real Generalization (ICLR, 2021)β17Aug 11, 2021Updated 4 years ago
- Implementation of OmniNet, Omnidirectional Representations from Transformers, in Pytorchβ59Mar 19, 2021Updated 4 years ago
- Dense reppoints: Representing visual objects with dense point sets https://arxiv.org/abs/1912.11473β145Aug 6, 2020Updated 5 years ago
- A better PyTorch implementation of image local attention which reduces the GPU memory by an order of magnitude.β142Dec 21, 2021Updated 4 years ago
- Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pyβ¦β24,993Updated this week