Improving Mamaba performance on Video Understanding task
☆45Dec 30, 2025Updated 2 months ago
Alternatives and similar repositories for VideoMambaPro
Users that are interested in VideoMambaPro are comparing it to the libraries listed below
Sorting:
- The suite of modeling video with Mamba☆293May 14, 2024Updated last year
- [ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding☆1,088Jul 6, 2024Updated last year
- ☆82Feb 27, 2025Updated last year
- [AAAI'25 Oral] NightReID: A Large-Scale Nighttime Person Re-Identification Benchmark☆11Jun 10, 2025Updated 9 months ago
- [PRCV-2024] State Space Model based Frame-Event Tracking☆48Dec 6, 2025Updated 3 months ago
- Collaborative Learning of Anomalies with Privacy (CLAP) for Unsupervised Video Anomaly Detection: A New Baseline☆23Sep 30, 2024Updated last year
- In this repository, a simple implementation of Video augmentation is provided to augment videos for machine learning training tasks.☆20Dec 4, 2024Updated last year
- [ACCV 2024] PyTorch Implementation of the Paper 'VideoPatchCore': Official Version☆31Sep 23, 2025Updated 5 months ago
- [ACMMM 2023] BMMAL: Towards Balanced Active Learning for Multimodal Classification☆16Sep 25, 2023Updated 2 years ago
- ☆26Oct 15, 2024Updated last year
- A dataset with classified film shots☆11Aug 8, 2022Updated 3 years ago
- Papers of "A Survey on Multimodal LLMs from the Perspective of Input-Output Space Extension"☆17Feb 4, 2026Updated last month
- [ICME 2025 Oral] Official implementation of "GlanceVAD: Exploring Glance Supervision for Label-efficient Video Anomaly Detection"☆34Mar 23, 2025Updated 11 months ago
- TrackGPT: Track What You Need in Videos via Text Prompts☆25May 16, 2023Updated 2 years ago
- Official Implementation of Attentive Mask CLIP (ICCV2023, https://arxiv.org/abs/2212.08653)☆36May 29, 2024Updated last year
- [NeurIPS2024] Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model☆80Dec 25, 2024Updated last year
- ☆25Dec 23, 2024Updated last year
- We have implemented Track # 1 for ICME 2024: Spatial Action Localization on Chaotic World dataset. Our mAP on the validation set reaches …☆12Nov 11, 2024Updated last year
- [NeurIPS24 Spotlight] Voxel Mamba: Group-Free State Space Models for Point Cloud based 3D Object Detection☆155Sep 26, 2024Updated last year
- [Official Repo] Visual Mamba: A Survey and New Outlooks☆734Feb 18, 2025Updated last year
- [WACV 2025] Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection☆16Mar 23, 2025Updated 11 months ago
- Vision-Language based Visual Object Tracking☆28Oct 10, 2025Updated 5 months ago
- Official Implementation of Video-MA2MBA☆12Dec 3, 2024Updated last year
- The code of paper "O-Mamba: O-shape State-Space Model for Underwater Image Enhancement"☆13Oct 18, 2024Updated last year
- Mamba in Vision: A Comprehensive Survey of Techniques and Applications☆137Oct 10, 2024Updated last year
- spatio-temporal tasks☆16Jul 15, 2024Updated last year
- ☆12Jul 26, 2022Updated 3 years ago
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆12Jul 26, 2024Updated last year
- This is a simple toolkit to view and crop image patches for image/video super-resolution tasks.☆11Jan 6, 2023Updated 3 years ago
- The repo for "On-the-fly Modulation for Balanced Multimodal Learning", T-PAMI 2024☆19Sep 29, 2024Updated last year
- Placeholder☆10Jul 17, 2023Updated 2 years ago
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆41Sep 25, 2023Updated 2 years ago
- ☆11Aug 7, 2024Updated last year
- Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"☆107Jan 28, 2024Updated 2 years ago
- Code for Max-Margin Contrastive Learning - AAAI 2022☆17Apr 25, 2022Updated 3 years ago
- ☆28Apr 8, 2025Updated 11 months ago
- [NeurIPS 2024] Efficiency for Free: Ideal Data Are Transportable Representations☆19Jan 19, 2025Updated last year
- [IJCV-2026, arXiv:2408.09764] Event Stream based Human Action Recognition: A High-Definition Benchmark Dataset and Algorithms☆23Jan 20, 2026Updated 2 months ago
- Code for "Purify Unlearnable Examples via Rate-Constrained Variational Autoencoders" at ICML 2024☆10Sep 18, 2025Updated 6 months ago