OpenGVLab/video-mamba-suite

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OpenGVLab/video-mamba-suite)

OpenGVLab / video-mamba-suite

The suite of modeling video with Mamba

☆295

Alternatives and similar repositories for video-mamba-suite

Users that are interested in video-mamba-suite are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OpenGVLab / VideoMamba
View on GitHub
[ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding
☆1,121Jul 6, 2024Updated 2 years ago
sming256 / OpenTAD
View on GitHub
OpenTAD is an open-source temporal action detection (TAD) toolbox based on PyTorch.
☆340Jul 14, 2026Updated last week
hotfinda / VideoMambaPro
View on GitHub
Improving Mamaba performance on Video Understanding task
☆48Dec 30, 2025Updated 6 months ago
happyharrycn / actionformer_release
View on GitHub
Code release for ActionFormer (ECCV 2022)
☆571Apr 11, 2024Updated 2 years ago
CG-Bench / CG-Bench
View on GitHub
☆20Jan 26, 2025Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
ayiyayi / EgoExoBench
View on GitHub
☆15Nov 13, 2025Updated 8 months ago
sming256 / AdaTAD
View on GitHub
[CVPR2024] The official implementation of AdaTAD: End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames
☆42Jul 9, 2024Updated 2 years ago
OpenGVLab / VideoChat-Flash
View on GitHub
[ICLR2026] VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling
☆527Updated this week
OpenGVLab / InternVideo
View on GitHub
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
☆2,339Jul 2, 2026Updated 3 weeks ago
yingsen1 / UniMD
View on GitHub
UniMD: Towards Unifying Moment retrieval and temporal action Detection
☆57Jul 5, 2024Updated 2 years ago
hustvl / Vim
View on GitHub
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
☆3,889Feb 13, 2025Updated last year
Echo0125 / MAT-Memory-and-Anticipation-Transformer
View on GitHub
[ICCV 2023] Official implementation of Memory-and-Anticipation Transformer for Online Action Understanding
☆50Oct 7, 2023Updated 2 years ago
Richard-61 / FineAction
View on GitHub
The official codebase of FineAction dataset. We will update the data and code of our FineAction.
☆24Apr 10, 2025Updated last year
NVlabs / LITA
View on GitHub
☆194Oct 14, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
wdrink / OmniVid
View on GitHub
☆58Jun 4, 2024Updated 2 years ago
sudo-Boris / mr-Blip
View on GitHub
Official Implementation of "Chrono: A Simple Blueprint for Representing Time in MLLMs"
☆95Mar 9, 2025Updated last year
OpenGVLab / VideoMAEv2
View on GitHub
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
☆803Oct 8, 2024Updated last year
Finspire13 / DiffAct
View on GitHub
Code for Diffusion Action Segmentation (ICCV 2023)
☆77Aug 16, 2023Updated 2 years ago
MCG-NJU / BasicTAD
View on GitHub
BasicTAD: an Astounding RGB-Only Baselinefor Temporal Action Detection
☆52Jun 10, 2023Updated 3 years ago
OpenGVLab / EgoExoLearn
View on GitHub
[CVPR 2024] Data and benchmark code for the EgoExoLearn dataset
☆85Aug 26, 2025Updated 10 months ago
MzeroMiko / VMamba
View on GitHub
VMamba: Visual State Space Models，code is based on mamba
☆3,207Mar 7, 2025Updated last year
sauradip / action_localization_visualization
View on GitHub
Temporal Action Localization Visualization Tool (TALVT) is a Javascript based simple visualization tool to visualize the outcomes of the …
☆29Sep 29, 2020Updated 5 years ago
facebookresearch / stepdiff
View on GitHub
Data release for Step Differences in Instructional Video (CVPR24)
☆15Jun 19, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
derkbreeze / AwesomeActionSegmentation
View on GitHub
☆33Jun 19, 2026Updated last month
QUVA-Lab / PIN
View on GitHub
Official code repo of PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs
☆26Jan 14, 2025Updated last year
dingfengshi / TriDet
View on GitHub
[CVPR2023] Code for the paper, TriDet: Temporal Action Detection with Relative Boundary Modeling
☆219Dec 27, 2023Updated 2 years ago
HYUNJS / STOV-TAL
View on GitHub
[WACV-2025] Exploring Scalability of Self-Training for Open-Vocabulary Temporal Action Localization
☆17May 28, 2025Updated last year
OpenGVLab / perception_test_iccv2023
View on GitHub
Champion Solutions repository for Perception Test challenges in ICCV2023 workshop.
☆14Oct 18, 2023Updated 2 years ago
brown-palm / AntGPT
View on GitHub
Official code implemtation of paper AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?
☆31Sep 23, 2024Updated last year
OpenGVLab / vinci
View on GitHub
Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model
☆93Nov 27, 2025Updated 7 months ago
CompVis / zigma
View on GitHub
A PyTorch implementation of the paper "ZigMa: A DiT-Style Mamba-based Diffusion Model" (ECCV 2024)
☆351Mar 17, 2025Updated last year
TuanTNG / TemporalMaxer
View on GitHub
TemporalMaxer: Maximize Temporal Context with only Max Pooling for Temporal Action Localization
☆65Dec 6, 2025Updated 7 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
lsy0882 / RDFA-S6
View on GitHub
☆16Feb 3, 2025Updated last year
bipashasen / INR-V-VideoGenerationSpace
View on GitHub
The Official Implementation for INR-V: A Continuous Representation Space for Video-based Generative Tasks
☆15Mar 31, 2023Updated 3 years ago
jnypark / VideoMamba
View on GitHub
☆27Jun 4, 2024Updated 2 years ago
OpenGVLab / EgoVideo
View on GitHub
[CVPR 2024 Champions][ICLR 2025] Solutions for EgoVis Chanllenges in CVPR 2024
☆136May 11, 2025Updated last year
AmeenAli / HiddenMambaAttn
View on GitHub
Official PyTorch Implementation of "The Hidden Attention of Mamba Models"
☆234Oct 16, 2025Updated 9 months ago
TimeMarker-LLM / TimeMarker
View on GitHub
A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Ability
☆107Nov 28, 2024Updated last year
OpenHelix-Team / cobra
View on GitHub
[AAAI-25] Cobra: Extending Mamba to Multi-modal Large Language Model for Efficient Inference
☆296Jan 8, 2025Updated last year