Nested Hierarchical Transformer https://arxiv.org/pdf/2105.12723.pdf
☆202Mar 3, 2026Updated 2 weeks ago
Alternatives and similar repositories for nested-transformer
Users that are interested in nested-transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Sep 17, 2021Updated 4 years ago
- ☆249Mar 16, 2022Updated 4 years ago
- Unofficially Implements https://arxiv.org/abs/2112.05682 to get Linear Memory Cost on Attention for PyTorch☆12Jan 16, 2022Updated 4 years ago
- This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".☆292Sep 28, 2022Updated 3 years ago
- [ICLR 2022] "As-ViT: Auto-scaling Vision Transformers without Training" by Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wa…☆76Feb 21, 2022Updated 4 years ago
- ☆17Nov 4, 2022Updated 3 years ago
- Escaping the Big Data Paradigm with Compact Transformers, 2021 (Train your Vision Transformers in 30 mins on CIFAR-10 with a single GPU!)☆540Nov 5, 2024Updated last year
- Source code and model weights for the PGGAN model utilised for the paper: Evaluating the Clinical Realism of Synthetic Chest X-Rays Gener…☆12Mar 2, 2021Updated 5 years ago
- ☆246Jul 23, 2021Updated 4 years ago
- ImageNet-12k subset of ImageNet-21k (fall11)☆21Jun 13, 2023Updated 2 years ago
- ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet☆1,194Oct 27, 2023Updated 2 years ago
- Official DeiT repository☆4,327Mar 15, 2024Updated 2 years ago
- [CVPR 2022] Official code for "Unified Contrastive Learning in Image-Text-Label Space"☆408Nov 10, 2023Updated 2 years ago
- [NeurIPS 2021] [T-PAMI] DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification☆652Jul 11, 2023Updated 2 years ago
- code release of research paper "Exploring Long-Sequence Masked Autoencoders"☆100Oct 14, 2022Updated 3 years ago
- ☆821Jul 30, 2022Updated 3 years ago
- L-Verse: Bidirectional Generation Between Image and Text☆107Apr 1, 2025Updated 11 months ago
- Tensorflow implementation for "Improved Transformer for High-Resolution GANs" (NeurIPS 2021).☆93Jul 30, 2024Updated last year
- PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)☆1,366Jun 1, 2024Updated last year
- Official code Cross-Covariance Image Transformer (XCiT)☆674Sep 28, 2021Updated 4 years ago
- ☆37Oct 26, 2021Updated 4 years ago
- ☆27Jul 1, 2024Updated last year
- GitHub repository for KDD 2021 work: ProtoPShare: Prototypical Parts Sharing for Similarity Discovery in Interpretable Image Classificati…☆14May 30, 2021Updated 4 years ago
- [AAAI 2022] This is the official PyTorch implementation of "Less is More: Pay Less Attention in Vision Transformers"☆97Jun 19, 2022Updated 3 years ago
- Contrastive Language-Audio Pretraining☆87Mar 6, 2022Updated 4 years ago
- Codebase for Image Classification Research, written in PyTorch.☆2,166Mar 20, 2024Updated 2 years ago
- Refactoring dalle-pytorch and taming-transformers for TPU VM☆60Aug 30, 2021Updated 4 years ago
- An end-to-end PyTorch framework for image and video classification☆1,613Jun 27, 2024Updated last year
- Temporally Efficient Vision Transformer for Video Instance Segmentation, CVPR 2022, Oral☆239Mar 4, 2023Updated 3 years ago
- [CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize …☆1,984Jan 24, 2024Updated 2 years ago
- Official implementation of "An Image is Worth 16x16 Words, What is a Video Worth?" (2021 paper)☆221Aug 23, 2022Updated 3 years ago
- Official repository for the "Big Transfer (BiT): General Visual Representation Learning" paper.☆1,538Jul 30, 2024Updated last year
- [ECCV 2022] PadInv: High-fidelity GAN Inversion with Padding Space☆87Dec 17, 2022Updated 3 years ago
- Improving Representation Learning for Histopathologic Images with Cluster Constraints☆17Jan 20, 2024Updated 2 years ago
- PyTorch implementation of Asymmetric Siamese (https://arxiv.org/abs/2204.00613)☆99May 2, 2022Updated 3 years ago
- Code for the ECCV 2022 paper "Unleashing Transformers"☆185Apr 17, 2023Updated 2 years ago
- [CVPR2022 - Oral] Official Jax Implementation of Learned Queries for Efficient Local Attention☆119Apr 19, 2022Updated 3 years ago
- Datasets list for various computer vision tasks☆16Sep 7, 2019Updated 6 years ago
- Code to reproduce the results in the FAIR research papers "Semi-Supervised Learning of Visual Features by Non-Parametrically Predicting V…☆492Apr 28, 2023Updated 2 years ago