whwu95 / MVFNet
【AAAI'2021】MVFNet: Multi-View Fusion Network for Efficient Video Recognition
☆133Updated 2 years ago
Alternatives and similar repositories for MVFNet:
Users that are interested in MVFNet are comparing it to the libraries listed below
- [CVPR2021] The source code for our paper 《Removing the Background by Adding the Background: Towards Background Robust Self-supervised Vid…☆133Updated 3 years ago
- ☆143Updated last year
- Colar: Effective and Efficient Online Action Detection by Consulting Exemplars, CVPR 2022.☆219Updated 2 years ago
- [AAAI2021] The source code for our paper 《Enhancing Unsupervised Video Representation Learning by Decoupling the Scene and the Motion》.☆96Updated last year
- 【AAAI'2023 & IJCV】Transferring Vision-Language Models for Visual Recognition: A Classifier Perspective☆191Updated 8 months ago
- ☆116Updated 3 years ago
- 【ACMMM'2021】DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning☆42Updated 3 years ago
- Official PyTorch implementation of ACTION-Net: Multipath Excitation for Action Recognition (CVPR'21)☆200Updated 3 years ago
- 【ICCV'2023】What Can Simple Arithmetic Operations Do for Temporal Modeling?☆73Updated last year
- Implementation of ICCV21 paper: PnP-DETR: Towards Efficient Visual Analysis with Transformers☆121Updated last year
- Skeleton-based action recognition models in PyTorch, including Two-Stream CNN, HCN, HCN-Baseline, Ta-CNN and Dynamic GCN☆145Updated 2 years ago
- The official implementation of BackTAL, TPAMI 2021.☆169Updated 2 years ago
- [NeurIPS 2021] Revitalizing CNN Attentions via Transformers in Self-Supervised Visual Representation Learning☆110Updated 3 years ago
- 【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models☆147Updated 5 months ago
- [ICME 2022] Self-Supervised Video Object Segmentation by Motion-Aware Mask Propagation.☆30Updated last year
- Explainable Person Re-Identification with Attribute-guided Metric Distillation☆99Updated 2 years ago
- A general video understanding codebase from SenseTime X-Lab☆444Updated 3 years ago
- [ECCV2022] Learning Quality-aware Dynamic Memory for Video Object Segmentation☆122Updated last year
- ☆19Updated 4 years ago
- [AAAI2020] Cross-Modality Paired-Images Generation for RGB-InfraredPerson Re-Identification☆120Updated 2 years ago
- The official source code for the paper Consensus-Aware Visual-Semantic Embedding for Image-Text Matching (ECCV 2020)☆164Updated 3 years ago
- ☆27Updated 3 months ago
- Peeking into occluded joints: A novel framework for crowd pose estimation(ECCV2020)☆125Updated 3 years ago
- Amodal-Instance-Segmentation-through-KINS-Dataset☆128Updated 5 years ago
- TSPNet: Hierarchical Feature Learning via Temporal Semantic Pyramid for Sign Language Translation☆93Updated 4 years ago
- Code for our ICCV 2021 Paper "OadTR: Online Action Detection with Transformers".☆90Updated last year
- code for AAAI 2020 paper "ACT"☆90Updated last year
- A PyTorch implementation for Convolutional Hierarchical Attention Network for Query-Focused Video Summarization☆59Updated last year
- A strong implementation of PCB (Beyond Part Models), outperforming all existing implementations.☆97Updated 4 years ago
- A Pytorch implementation of CVPR 2021 paper "RSG: A Simple but Effective Module for Learning Imbalanced Datasets"☆106Updated 3 years ago