thearkaprava / MS-TembaLinks
Official Repository of 'Multi-Scale Temporal Mamba for Efficient Temporal Action Detection'
☆22Updated this week
Alternatives and similar repositories for MS-Temba
Users that are interested in MS-Temba are comparing it to the libraries listed below
Sorting:
- Improving Mamaba performance on Video Understanding task☆39Updated 8 months ago
- [CVPR 2024] Adapting Short-Term Transformers for Action Detection in Untrimmed Videos☆11Updated last year
- [CVPR 2023] Enlarge Instance-specific and Class-specific Information for Open-set Action Recognition☆28Updated 2 years ago
- Placeholder☆10Updated 2 years ago
- CAD - Memory Efficient Convolutional Adapter for Segment Anything☆12Updated 9 months ago
- The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024☆16Updated 9 months ago
- [NeurIPS 2023] Official implementation of the paper "CAST: Cross-Attention in Space and Time for Video Action Recognition"☆52Updated last year
- Video Feature Enhancement with PyTorch☆31Updated 7 months ago
- LongShortNet for Streaming Perception task.☆13Updated last year
- [AAAI 2024] AVSegFormer: Audio-Visual Segmentation with Transformer☆67Updated 4 months ago
- ☆17Updated 8 months ago
- [ECCV 2024] The official repo for "SA-DVAE: Improving Zero-Shot Skeleton-Based Action Recognition by Disentangled Variational Autoencoder…☆31Updated 11 months ago
- [AAAI 2024 Oral] M2CLIP: A Multimodal, Multi-Task Adapting Framework for Video Action Recognition☆63Updated 6 months ago
- [CVPR 2024 Accepted] TaskWeave: Decoupling and Inter-Task Feedback for Joint Moment Retrieval and Highlight Detection☆26Updated 9 months ago
- Official Implementation of "Semantics-Consistent Feature Search for Self-Supervised Visual Representation Learning" in AAAI2024.☆13Updated last year
- The official repo for "Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes", ECCV 2024☆43Updated 7 months ago
- ☆26Updated last year
- 「AAAI 2024」 Referred by Multi-Modality: A Unified Temporal Transformers for Video Object Segmentation☆81Updated last month
- Official code for WACV 2024 paper, "Annotation-free Audio-Visual Segmentation"☆31Updated 9 months ago
- Official code for CVPR 2024 paper, "Audio-Visual Segmentation via Unlabeled Frame Exploitation""☆14Updated last year
- [T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection☆35Updated last year
- [ECCV 2024 Workshop Best Paper Award] Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion☆34Updated 9 months ago
- The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"☆83Updated 6 months ago
- CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian Evaluation☆75Updated 11 months ago
- [CVPR 2024] Dual Prototype Attention for Unsupervised Video Object Segmentation☆32Updated last year
- ☆14Updated 3 months ago
- ☆27Updated last year
- [IEEE TCSVT] Official Pytorch Implementation of CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation.☆43Updated 6 months ago
- Official code for CVPR2024 “VideoMAC: Video Masked Autoencoders Meet ConvNets”☆12Updated last year
- Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]☆30Updated last year