haamoon / mmtmLinks
Implementation of CVPR 2020 paper "MMTM: Multimodal Transfer Module for CNN Fusion"
☆120Updated 5 years ago
Alternatives and similar repositories for mmtm
Users that are interested in mmtm are comparing it to the libraries listed below
Sorting:
- Offical implementation of paper "MSAF: Multimodal Split Attention Fusion"☆81Updated 4 years ago
- Pytorch 3DNet attention feature map Visualization by [Cam](https://arxiv.org/abs/1512.04150); C3D, R3D, I3D, MF Net is support now!☆65Updated 5 years ago
- [TPAMI 2023, NeurIPS 2020] Code release for "Deep Multimodal Fusion by Channel Exchanging"☆312Updated last year
- Implementation of CVPR 2019 paper "Mfas: Multimodal fusion architecture search"☆81Updated 5 years ago
- MS-TCN++: Multi-Stage Temporal Convolutional Network for Action Segmentation (TPAMI 2020)☆171Updated 2 years ago
- ☆69Updated 4 years ago
- Repository to contain the code for the CVPR 2020 publication: Multi-Modal Domain Adaptation for Fine-Grained Action Recognition☆67Updated 5 years ago
- [NeurIPS 2021] Space-time Mixing Attention for Video Transformer☆18Updated 3 years ago
- Implementation of ViViT: A Video Vision Transformer - Zipping Coding Challenge☆33Updated 4 years ago
- Learning from Temporal Gradient for Semi-supervised Action Recognition (CVPR 2022)☆30Updated 3 years ago
- Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification☆207Updated 4 years ago
- Video Transformer Network☆41Updated 4 years ago
- PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning☆89Updated 4 years ago
- Official implementation of ACMMM'20 paper 'Self-supervised Video Representation Learning Using Inter-intra Contrastive Framework'☆111Updated 4 years ago
- PyTorch implementation of AAAI 2021 paper: A Hybrid Attention Mechanism for Weakly-Supervised Temporal Action Localization☆42Updated 4 years ago
- A public Python implementation for generating Dynamic Images introduced in 'Dynamic Image Networks for Action Recognition' by Bilen et a…☆40Updated 5 years ago
- Signal Level Deep Metric Learning for One-Shot Action Recognition☆22Updated last year
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆87Updated 4 years ago
- ABAW3 (CVPRW): A Joint Cross-Attention Model for Audio-Visual Fusion in Dimensional Emotion Recognition☆49Updated last year
- Spatio-Temporal AudioVisual Saliency Network☆53Updated last year
- [CVPR 2022] Official Pytorch Implementation for "Spatio-temporal Relation Modeling for Few-shot Action Recognition". SOTA Results for Few…☆101Updated 3 years ago
- TCM: Temporal Correlation Module☆17Updated 4 years ago
- This is the repository for "Efficient Low-rank Multimodal Fusion with Modality-Specific Factors", Liu and Shen, et. al. ACL 2018☆270Updated 5 years ago
- Semi-Supervised Action Recognition with Temporal Contrastive Learning☆59Updated last year
- TAM: Temporal Adaptive Module for Video Recognition☆206Updated 3 years ago
- Unofficial implementation of the paper 'Deep Co-Training for Semi-Supervised Image Recognition'☆63Updated 6 years ago
- [BMVC 2021] The official PyTorch implementation of Feature Fusion Vision Transformer for Fine-Grained Visual Categorization☆50Updated 3 years ago
- V4D: 4D Convolutional Neural Networks for Video-level Representation Learning☆70Updated 5 years ago
- Pytorch implementation of our T-PAMI 2021 paper: Self-supervised Video Representation Learning by Uncovering Motion and Appearance Stati…☆50Updated 4 years ago
- ☆244Updated 2 years ago