UtopAIBuilder / Grad-CAM-for-video-and-regression-taskLinks
Exploring the applicability of Grad-CAM for explanation in video based dataset
☆32Updated last year
Alternatives and similar repositories for Grad-CAM-for-video-and-regression-task
Users that are interested in Grad-CAM-for-video-and-regression-task are comparing it to the libraries listed below
Sorting:
- ☆68Updated 4 years ago
- Video Swin Transformer - PyTorch☆259Updated 3 years ago
- Implementation of ViViT: A Video Vision Transformer☆540Updated 4 years ago
- Implementation of CVPR 2020 paper "MMTM: Multimodal Transfer Module for CNN Fusion"☆117Updated 5 years ago
- PyTorch implementation of a collections of scalable Video Transformer Benchmarks.☆299Updated 3 years ago
- CNN LSTM architecture implemented in Pytorch for Video Classification☆291Updated 2 years ago
- MS-TCN++: Multi-Stage Temporal Convolutional Network for Action Segmentation (TPAMI 2020)☆160Updated 2 years ago
- Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and T…☆601Updated 5 months ago
- The notebook explains the various steps to obtain the results of publication: "Is Space-Time Attention All You Need for Video Understandi…☆42Updated 4 years ago
- This is a implementation of integrating a simple but efficient attention block in CNN + bidirectional LSTM for video classification.☆24Updated 11 months ago
- [ICLR 2023, Oral] SimPer: Simple Self-Supervised Learning of Periodic Targets☆127Updated last year
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆86Updated 3 years ago
- ☆238Updated last year
- Normalizing Flows for Human Pose Anomaly Detection [ICCV 2023]☆88Updated last year
- Pytorch 3DNet attention feature map Visualization by [Cam](https://arxiv.org/abs/1512.04150); C3D, R3D, I3D, MF Net is support now!☆66Updated 4 years ago
- 3D-ResNeXt101 with Grad-CAM Demo. (Pytorch)☆24Updated 4 years ago
- I3D features extractor with resnet50 backbone☆73Updated 2 years ago
- Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification☆203Updated 4 years ago
- fourierer / Video_Classification_ResNet3D_R2plus1D_ip-CSN_train-UCF101-HMDB51-Kinetics400-from-scratchUsing ResNet3D-50,R(2+1)D-50, and ip_CSN-50 to train UCD-101,HMDB-51 and Kinetics-400 from scratch.☆28Updated 4 years ago
- Code for the paper: Anticipative Feature Fusion Transformer for Multi-Modal Action Anticipation.☆32Updated last year
- ABAW3 (CVPRW): A Joint Cross-Attention Model for Audio-Visual Fusion in Dimensional Emotion Recognition☆45Updated last year
- ☆56Updated 4 years ago
- [ICCV 2023] Learning Support and Trivial Prototypes for Interpretable Image Classification☆23Updated 3 months ago
- Code for ''Alleviating Over-segmentation Errors by Detecting Action Boundaries'' accepted in WACV2021☆59Updated 2 years ago
- PIP-Net: Patch-based Intuitive Prototypes Network for Interpretable Image Classification (CVPR 2023)☆72Updated last year
- ☆31Updated 2 years ago
- Pytorch implementation for t-SNE with cuda to accelerate☆335Updated 2 years ago
- ProtoTrees: Neural Prototype Trees for Interpretable Fine-grained Image Recognition, published at CVPR2021☆102Updated 3 years ago
- Video Transformer Network☆41Updated 4 years ago
- A pytorch implementation of interpretable convolutional neural network.☆67Updated 4 years ago