whwu95 / MVFNet
【AAAI'2021】MVFNet: Multi-View Fusion Network for Efficient Video Recognition
☆133Updated 3 years ago
Alternatives and similar repositories for MVFNet:
Users that are interested in MVFNet are comparing it to the libraries listed below
- [CVPR2021] The source code for our paper 《Removing the Background by Adding the Background: Towards Background Robust Self-supervised Vid…☆134Updated 4 years ago
- Colar: Effective and Efficient Online Action Detection by Consulting Exemplars, CVPR 2022.☆220Updated 3 years ago
- [AAAI2021] The source code for our paper 《Enhancing Unsupervised Video Representation Learning by Decoupling the Scene and the Motion》.☆96Updated 2 years ago
- Official PyTorch implementation of ACTION-Net: Multipath Excitation for Action Recognition (CVPR'21)☆202Updated 3 years ago
- ☆143Updated 2 years ago
- ☆116Updated 4 years ago
- 【AAAI'2023 & IJCV】Transferring Vision-Language Models for Visual Recognition: A Classifier Perspective☆192Updated 10 months ago
- 【ICCV'2023】What Can Simple Arithmetic Operations Do for Temporal Modeling?☆73Updated last year
- Skeleton-based action recognition models in PyTorch, including Two-Stream CNN, HCN, HCN-Baseline, Ta-CNN and Dynamic GCN☆146Updated 2 years ago
- Implementation of ICCV21 paper: PnP-DETR: Towards Efficient Visual Analysis with Transformers☆121Updated last year
- 【ACMMM'2021】DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning☆42Updated 3 years ago
- [NeurIPS 2021] Revitalizing CNN Attentions via Transformers in Self-Supervised Visual Representation Learning☆110Updated 3 years ago
- 【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models☆147Updated 6 months ago
- Explainable Person Re-Identification with Attribute-guided Metric Distillation☆98Updated 2 years ago
- [ICME 2022] Self-Supervised Video Object Segmentation by Motion-Aware Mask Propagation.☆30Updated last year
- The official implementation of BackTAL, TPAMI 2021.☆169Updated 3 years ago
- A general video understanding codebase from SenseTime X-Lab☆445Updated 3 years ago
- [AAAI2020] Cross-Modality Paired-Images Generation for RGB-InfraredPerson Re-Identification☆121Updated 2 years ago
- ☆19Updated 4 years ago
- Official PyTorch Implementation of ProxyGML Loss for Deep Metric Learning, NeurIPS 2020 (spotlight)☆56Updated 3 years ago
- The official source code for the paper Consensus-Aware Visual-Semantic Embedding for Image-Text Matching (ECCV 2020)☆165Updated 3 years ago
- [ECCV2022] Learning Quality-aware Dynamic Memory for Video Object Segmentation☆122Updated last year
- CVPR2021: Temporal Context Aggregation Network for Temporal Action Proposal Refinement☆71Updated 3 years ago
- TSPNet: Hierarchical Feature Learning via Temporal Semantic Pyramid for Sign Language Translation☆93Updated 4 years ago
- TAM: Temporal Adaptive Module for Video Recognition☆200Updated 2 years ago
- [ECCV'22 Oral] Pixel-wise Energy-biased Abstention Learning for Anomaly Segmentation on Complex Urban Driving Scenes☆139Updated last year
- A PyTorch implementation for Convolutional Hierarchical Attention Network for Query-Focused Video Summarization☆59Updated last year
- code for AAAI 2020 paper "ACT"☆90Updated last year
- Code for CVPR2021 paper "Learning Salient Boundary Feature for Anchor-free Temporal Action Localization"☆175Updated 3 years ago
- A strong implementation of PCB (Beyond Part Models), outperforming all existing implementations.☆97Updated 4 years ago