UtopAIBuilder / Grad-CAM-for-video-and-regression-task
Exploring the applicability of Grad-CAM for explanation in video based dataset
☆30Updated last year
Alternatives and similar repositories for Grad-CAM-for-video-and-regression-task:
Users that are interested in Grad-CAM-for-video-and-regression-task are comparing it to the libraries listed below
- ☆67Updated 3 years ago
- Official code repo for TCLR: Temporal Contrastive Learning for Video Representation [CVIU-2022]☆37Updated last year
- This is a implementation of integrating a simple but efficient attention block in CNN + bidirectional LSTM for video classification.☆23Updated 7 months ago
- ☆27Updated 2 years ago
- Implementation of CVPR 2020 paper "MMTM: Multimodal Transfer Module for CNN Fusion"☆112Updated 4 years ago
- The notebook explains the various steps to obtain the results of publication: "Is Space-Time Attention All You Need for Video Understandi…☆42Updated 3 years ago
- [ICCV 2023] Learning Support and Trivial Prototypes for Interpretable Image Classification☆21Updated 2 months ago
- 3D-ResNeXt101 with Grad-CAM Demo. (Pytorch)☆24Updated 4 years ago
- Video Swin Transformer - PyTorch☆244Updated 3 years ago
- PIP-Net: Patch-based Intuitive Prototypes Network for Interpretable Image Classification (CVPR 2023)☆66Updated last year
- I3D features extractor with resnet50 backbone☆72Updated 2 years ago
- Semi-Supervised Action Recognition with Temporal Contrastive Learning☆56Updated 11 months ago
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆86Updated 3 years ago
- Codebase for "Multimodal Distillation for Egocentric Action Recognition" (ICCV 2023)☆23Updated last year
- Pytorch 3DNet attention feature map Visualization by [Cam](https://arxiv.org/abs/1512.04150); C3D, R3D, I3D, MF Net is support now!☆66Updated 4 years ago
- Official code repository for SPAct: Self-supervised Privacy Preservation for Action Recognition [CVPR-2022]☆21Updated 2 years ago
- Official repository for "Self-Supervised Video Transformer" (CVPR'22)☆106Updated 8 months ago
- fourierer / Video_Classification_ResNet3D_R2plus1D_ip-CSN_train-UCF101-HMDB51-Kinetics400-from-scratchUsing ResNet3D-50,R(2+1)D-50, and ip_CSN-50 to train UCD-101,HMDB-51 and Kinetics-400 from scratch.☆27Updated 4 years ago
- Code for the paper: Anticipative Feature Fusion Transformer for Multi-Modal Action Anticipation.☆31Updated last year
- ☆36Updated 2 years ago
- Official Repo for CVPR 2024 Paper "FACT: Frame-Action Cross-Attention Temporal Modeling for Efficient Fully-Supervised Action Segmentatio…☆56Updated last month
- ABAW3 (CVPRW): A Joint Cross-Attention Model for Audio-Visual Fusion in Dimensional Emotion Recognition☆41Updated last year
- [IJCARS'22]Trans-SVNet: hybrid embedding aggregation Transformer for surgical workflow analysis, 1st Prize of Best Paper Award of IJCARS-…☆15Updated 2 years ago
- Code for Diffusion Action Segmentation (ICCV 2023)☆59Updated last year
- MS-TCN++: Multi-Stage Temporal Convolutional Network for Action Segmentation (TPAMI 2020)☆148Updated 2 years ago
- Code of TVT: Transferable Vision Transformer for Unsupervised Domain Adaptation, WACV 2023☆69Updated 2 years ago
- This repo is official implementation of the paper "Multimodal transformer for Nurse Activity Recognition", published in CVPM2022, CVPRW.☆17Updated 9 months ago
- Video Test-Time Adaptation for Action Recognition (CVPR 2023)☆41Updated 4 months ago
- ☆12Updated 2 years ago
- Official implementation of "Everything at Once - Multi-modal Fusion Transformer for Video Retrieval". CVPR 2022☆99Updated 2 years ago