haamoon / mmtmLinks
Implementation of CVPR 2020 paper "MMTM: Multimodal Transfer Module for CNN Fusion"
☆116Updated 4 years ago
Alternatives and similar repositories for mmtm
Users that are interested in mmtm are comparing it to the libraries listed below
Sorting:
- Offical implementation of paper "MSAF: Multimodal Split Attention Fusion"☆81Updated 3 years ago
- Implementation of CVPR 2019 paper "Mfas: Multimodal fusion architecture search"☆78Updated 5 years ago
- Pytorch 3DNet attention feature map Visualization by [Cam](https://arxiv.org/abs/1512.04150); C3D, R3D, I3D, MF Net is support now!☆66Updated 4 years ago
- [TPAMI 2023, NeurIPS 2020] Code release for "Deep Multimodal Fusion by Channel Exchanging"☆307Updated 10 months ago
- Repository to contain the code for the CVPR 2020 publication: Multi-Modal Domain Adaptation for Fine-Grained Action Recognition☆62Updated 4 years ago
- MS-TCN++: Multi-Stage Temporal Convolutional Network for Action Segmentation (TPAMI 2020)☆157Updated 2 years ago
- ABAW3 (CVPRW): A Joint Cross-Attention Model for Audio-Visual Fusion in Dimensional Emotion Recognition☆45Updated last year
- ☆66Updated 4 years ago
- Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification☆200Updated 4 years ago
- PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning☆87Updated 3 years ago
- Gluon implementation of channel-attention modules: SE, ECA, GCT☆40Updated 4 years ago
- [ACMMM 2020] Code release for "Learning Deep Multimodal Feature Representation with Asymmetric Multi-layer Fusion"☆28Updated 3 years ago
- Codes for ECCV2022 paper - contrastive deep supervision☆69Updated 2 years ago
- Video Transformer Network☆41Updated 3 years ago
- This is the repository for "Efficient Low-rank Multimodal Fusion with Modality-Specific Factors", Liu and Shen, et. al. ACL 2018☆264Updated 5 years ago
- Learning from Temporal Gradient for Semi-supervised Action Recognition (CVPR 2022)☆29Updated 2 years ago
- DPT: Deformable Patch-based Transformer for Visual Recognition (ACM MM2021)☆156Updated 3 years ago
- A PyTorch implementation of CMT based on paper CMT: Convolutional Neural Networks Meet Vision Transformers.☆70Updated 2 years ago
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆86Updated 3 years ago
- ☆29Updated 6 months ago
- Official code repo for TCLR: Temporal Contrastive Learning for Video Representation [CVIU-2022]☆37Updated last year
- Unsupervised Film Genre Classification using Spatio-Temporal Contrastive Learning☆32Updated last year
- TAM: Temporal Adaptive Module for Video Recognition☆203Updated 2 years ago
- A public Python implementation for generating Dynamic Images introduced in 'Dynamic Image Networks for Action Recognition' by Bilen et a…☆39Updated 4 years ago
- PyTorch implementation of Non-Local Neural Networks (https://arxiv.org/pdf/1711.07971.pdf)☆252Updated 2 years ago
- [BMVC 2021] The official PyTorch implementation of Feature Fusion Vision Transformer for Fine-Grained Visual Categorization☆49Updated 2 years ago
- [CVPR 2022] Official Pytorch Implementation for "Spatio-temporal Relation Modeling for Few-shot Action Recognition". SOTA Results for Few…☆100Updated 2 years ago
- Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.☆73Updated 3 years ago
- ☆235Updated last year
- ☆62Updated 3 years ago