haamoon / mmtm
Implementation of CVPR 2020 paper "MMTM: Multimodal Transfer Module for CNN Fusion"
☆113Updated 4 years ago
Alternatives and similar repositories for mmtm:
Users that are interested in mmtm are comparing it to the libraries listed below
- Pytorch 3DNet attention feature map Visualization by [Cam](https://arxiv.org/abs/1512.04150); C3D, R3D, I3D, MF Net is support now!☆66Updated 4 years ago
- Offical implementation of paper "MSAF: Multimodal Split Attention Fusion"☆80Updated 3 years ago
- Implementation of CVPR 2019 paper "Mfas: Multimodal fusion architecture search"☆77Updated 4 years ago
- Repository to contain the code for the CVPR 2020 publication: Multi-Modal Domain Adaptation for Fine-Grained Action Recognition☆61Updated 4 years ago
- ABAW3 (CVPRW): A Joint Cross-Attention Model for Audio-Visual Fusion in Dimensional Emotion Recognition☆43Updated last year
- Gluon implementation of channel-attention modules: SE, ECA, GCT☆39Updated 4 years ago
- [TPAMI 2023, NeurIPS 2020] Code release for "Deep Multimodal Fusion by Channel Exchanging"☆306Updated 8 months ago
- A public Python implementation for generating Dynamic Images introduced in 'Dynamic Image Networks for Action Recognition' by Bilen et a…☆39Updated 4 years ago
- Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification☆196Updated 3 years ago
- [ACMMM 2020] Code release for "Learning Deep Multimodal Feature Representation with Asymmetric Multi-layer Fusion"☆28Updated 3 years ago
- Codes for ECCV2022 paper - contrastive deep supervision☆68Updated 2 years ago
- MS-TCN++: Multi-Stage Temporal Convolutional Network for Action Segmentation (TPAMI 2020)☆151Updated 2 years ago
- Official implementation of ACMMM'20 paper 'Self-supervised Video Representation Learning Using Inter-intra Contrastive Framework'☆111Updated 3 years ago
- TAM: Temporal Adaptive Module for Video Recognition☆199Updated 2 years ago
- This repository contains the source code for the paper "Improving the performance of unimodal dynamic hand gesture recognition with multi…☆29Updated 4 years ago
- PyTorch implementation of AAAI 2021 paper: A Hybrid Attention Mechanism for Weakly-Supervised Temporal Action Localization☆41Updated 3 years ago
- KSSNet: Multi-Label Classification with Label Graph Superimposing☆60Updated 5 years ago
- This is the repository for "Efficient Low-rank Multimodal Fusion with Modality-Specific Factors", Liu and Shen, et. al. ACL 2018☆263Updated 4 years ago
- ☆66Updated 3 years ago
- I3D and 3D-ResNets in PyTorch☆191Updated 6 years ago
- Listen to Look: Action Recognition by Previewing Audio (CVPR 2020)☆129Updated 3 years ago
- Signal Level Deep Metric Learning for One-Shot Action Recognition☆22Updated 7 months ago
- PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning☆84Updated 3 years ago
- This is a implementation of integrating a simple but efficient attention block in CNN + bidirectional LSTM for video classification.☆23Updated 7 months ago
- ☆28Updated 2 years ago
- A PyTorch implementation of CMT based on paper CMT: Convolutional Neural Networks Meet Vision Transformers.☆71Updated 2 years ago
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆86Updated 3 years ago
- Implementation of ViViT: A Video Vision Transformer - Zipping Coding Challenge☆31Updated 3 years ago
- Video Transformer Network☆40Updated 3 years ago
- 3D-ResNeXt101 with Grad-CAM Demo. (Pytorch)☆24Updated 4 years ago