yikaiw / CENLinks
[TPAMI 2023, NeurIPS 2020] Code release for "Deep Multimodal Fusion by Channel Exchanging"
☆310Updated last year
Alternatives and similar repositories for CEN
Users that are interested in CEN are comparing it to the libraries listed below
Sorting:
- Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification☆206Updated 4 years ago
- Dense Contrastive Learning (DenseCL) for self-supervised representation learning, CVPR 2021 Oral.☆562Updated last year
- Implementation of CVPR 2020 paper "MMTM: Multimodal Transfer Module for CNN Fusion"☆120Updated 5 years ago
- Official implementation of CrossViT. https://arxiv.org/abs/2103.14899☆412Updated 3 years ago
- Official implement of "CAT: Cross Attention in Vision Transformer".☆167Updated 3 years ago
- The official code for the paper: https://openreview.net/forum?id=_PHymLIxuI☆397Updated last year
- This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".☆290Updated 3 years ago
- CVPR2022 (Oral) - Rethinking Semantic Segmentation: A Prototype View☆380Updated 3 years ago
- CVPR2022, BatchFormer: Learning to Explore Sample Relationships for Robust Representation Learning, https://arxiv.org/abs/2203.01522☆253Updated 2 years ago
- DPT: Deformable Patch-based Transformer for Visual Recognition (ACM MM2021)☆156Updated 4 years ago
- ☆193Updated 2 years ago
- This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.☆227Updated 3 years ago
- unofficial implementation of CondConv: Conditionally Parameterized Convolutions for Efficient Inference in PyTorch.☆164Updated last year
- ICCV2021 (Oral) - Exploring Cross-Image Pixel Contrast for Semantic Segmentation☆687Updated 3 years ago
- PyTorch implementation of Non-Local Neural Networks (https://arxiv.org/pdf/1711.07971.pdf)☆251Updated 2 years ago
- [IEEE TPAMI'23] Pyramid Pooling Transformer for Scene Understanding☆217Updated 5 months ago
- Bottleneck Transformers for Visual Recognition☆280Updated 4 years ago
- ☆216Updated 3 years ago
- Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers☆199Updated 4 years ago
- [NeurIPS 2021] [T-PAMI] Global Filter Networks for Image Classification☆489Updated 2 years ago
- Gated Channel Transformation for Visual Recognition (CVPR 2020)☆136Updated 5 years ago
- PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformers☆228Updated 4 years ago
- [ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"☆291Updated 3 years ago
- Two simple and effective designs of vision transformer, which is on par with the Swin transformer☆606Updated 2 years ago
- Official code for Conformer: Local Features Coupling Global Representations for Visual Recognition☆583Updated 4 years ago
- [ICCV 2021] Code for approximated exponential maximum pooling☆299Updated 2 years ago
- [CVPR 2022] Code release for "Multimodal Token Fusion for Vision Transformers"☆182Updated 3 years ago
- Implementation of SETR model, Original paper: Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers.☆132Updated 4 years ago
- A Pytorch Implementation of the following paper "Visual Transformers: Token-based Image Representation and Processing for Computer Vision…☆182Updated 4 years ago
- iFormer: Inception Transformer☆247Updated 2 years ago