alibaba-mmai-research / TAdaConvView external linksLinks
[ICLR 2022] TAda! Temporally-Adaptive Convolutions for Video Understanding. This codebase provides solutions for video classification, video representation learning and temporal detection.
☆241Aug 23, 2023Updated 2 years ago
Alternatives and similar repositories for TAdaConv
Users that are interested in TAdaConv are comparing it to the libraries listed below
Sorting:
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆41Sep 25, 2023Updated 2 years ago
- [ICCV2023 Oral] Implicit Temporal Modeling with Learnable Alignment for Video Recognition☆41Nov 29, 2023Updated 2 years ago
- TAM: Temporal Adaptive Module for Video Recognition☆207Aug 18, 2022Updated 3 years ago
- TCTrack: Temporal Contexts for Aerial Tracking (CVPR2022) & TCTrack++ (TPAMI)☆198Aug 29, 2023Updated 2 years ago
- [ICLR2021] official implementation of CT-Net☆37Dec 29, 2021Updated 4 years ago
- ☆49Nov 12, 2022Updated 3 years ago
- Multimodal Fusion via Teacher-Student Network for Indoor Action Recognition☆25Jul 12, 2022Updated 3 years ago
- [ECCV 2024] ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video☆20Jul 29, 2024Updated last year
- [CVPR 2022] Code for Motion-aware Contrastive Video Representation Learning via Foreground-background Merging☆50Sep 30, 2023Updated 2 years ago
- [ICLR2022] official implementation of UniFormer☆896Mar 29, 2024Updated last year
- [ICCV 2021] MGSampler: An Explainable Sampling Strategy for Video Action Recognition☆51Jul 9, 2022Updated 3 years ago
- [ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding☆2,181Jul 11, 2024Updated last year
- EssentialMC2 Video Understanding.☆115Oct 26, 2022Updated 3 years ago
- [CVPR2022] Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos☆101Oct 30, 2022Updated 3 years ago
- A simple but efficient transformer model for video action recognition☆62Oct 8, 2022Updated 3 years ago
- [TIP 2022] End-to-end Temporal Action Detection with Transformer☆162Feb 19, 2023Updated 2 years ago
- 【ICCV'2023】What Can Simple Arithmetic Operations Do for Temporal Modeling?☆73Jan 26, 2024Updated 2 years ago
- [IEEE T-IP 2022] TCGL: Temporal Contrastive Graph for Self-supervised Video Representation Learning☆24Dec 19, 2023Updated 2 years ago
- This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"☆602Dec 6, 2023Updated 2 years ago
- CVPR2022:Learning from Untrimmed Videos: Self-Supervised Video Representation Learning with Hierarchical Consistency☆18Aug 10, 2022Updated 3 years ago
- [CVPR 2022] Official Pytorch Implementation for "Spatio-temporal Relation Modeling for Few-shot Action Recognition". SOTA Results for Few…☆101Sep 30, 2022Updated 3 years ago
- 【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models☆155Sep 9, 2024Updated last year
- [ICCV 2021] Relaxed Transformer Decoders for Direct Action Proposal Generation☆92Apr 5, 2022Updated 3 years ago
- TCM: Temporal Correlation Module☆17Apr 24, 2021Updated 4 years ago
- ICCV DeeperAction Challenge - Kinetics-TPS Challenge on Part-level Action Parsing and Action Recognition.☆14Jun 4, 2021Updated 4 years ago
- An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"☆94Sep 13, 2024Updated last year
- Official Code for "GMNet: Graph Matching Network for Large Scale Part Semantic Segmentation in the Wild", U. Michieli, E. Borsato, L. Ros…☆28Nov 30, 2020Updated 5 years ago
- ☆21Oct 22, 2020Updated 5 years ago
- [ECCV2024] Video Foundation Models & Data for Multimodal Understanding☆2,196Dec 15, 2025Updated 2 months ago
- ☆193Oct 22, 2022Updated 3 years ago
- ☆42Apr 7, 2024Updated last year
- Code + pre-trained models for the paper Keeping Your Eye on the Ball Trajectory Attention in Video Transformers☆233Jun 13, 2022Updated 3 years ago
- OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark☆4,918Aug 14, 2024Updated last year
- Code release for ActionFormer (ECCV 2022)☆539Apr 11, 2024Updated last year
- Official code for the paper: MAR: Masked Autoencoders for Efficient Action Recognition☆32Dec 7, 2022Updated 3 years ago
- 【ACMMM'2021】DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning☆42Jul 7, 2021Updated 4 years ago
- MoViNets PyTorch implementation: Mobile Video Networks for Efficient Video Recognition;☆290May 26, 2022Updated 3 years ago
- SmallBigNet: Integrating Core and Contextual Views for Video Classification (CVPR2020)☆41Mar 10, 2022Updated 3 years ago
- Test-Time Training on Video Streams☆66Jul 24, 2023Updated 2 years ago