microsoft/VLM-Video-Action-Localization

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/microsoft/VLM-Video-Action-Localization)

microsoft / VLM-Video-Action-Localization

☆26

Alternatives and similar repositories for VLM-Video-Action-Localization

Users that are interested in VLM-Video-Action-Localization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Cogito2012 / OpenMixer
View on GitHub
[WACV 2025] Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection
☆17Mar 23, 2025Updated last year
webber2933 / iCLIP
View on GitHub
[ICCVW 2023] Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action Detection
☆21Feb 22, 2024Updated 2 years ago
mayhugotong / VideoINSTA
View on GitHub
This is the official impletations of the EMNLP Findings paper, VideoINSTA: Zero-shot Long-Form Video Understanding via Informative Spatia…
☆24Apr 7, 2026Updated 3 months ago
lucazanella / lavad
View on GitHub
Official implementation of "Harnessing Large Language Models for Training-free Video Anomaly Detection", CVPR 2024
☆149Jul 15, 2024Updated 2 years ago
budzianowski / opengvl
View on GitHub
Open GVL
☆23Dec 1, 2025Updated 7 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
qijimrc / ROBUST
View on GitHub
☆13Oct 19, 2023Updated 2 years ago
lizechng / FSG-FCOS3D
View on GitHub
☆10Apr 27, 2022Updated 4 years ago
jylins / hourllava
View on GitHub
[NeurIPS 2025 Spotlight] Unleashing Hour-Scale Video Training for Long Video-Language Understanding
☆19Jun 24, 2025Updated last year
frostinassiky / bsp
View on GitHub
Placeholder for code of BSP.
☆11Aug 13, 2021Updated 4 years ago
thearkaprava / MS-Temba
View on GitHub
[CVPR 2026] Official Repository of 'MS-Temba: Multi-Scale Temporal Mamba for Understanding Long Untrimmed Videos'
☆48Jun 22, 2026Updated 3 weeks ago
MI-Hussain / RVMDE
View on GitHub
RVMDE : Radar Validated Monocular Depth Estimation for Robotics
☆15Oct 5, 2021Updated 4 years ago
ShijianDeng / AV-ASD
View on GitHub
[IEEE Transactions on Multimedia] Hear Me, See Me, Understand Me: Audio-Visual Autism Behavior Recognition
☆16Nov 19, 2024Updated last year
merantix / acosp
View on GitHub
Semantic Segmentation in Pytorch
☆10Dec 9, 2022Updated 3 years ago
UCSC-VLAA / Recap-DataComp-1B
View on GitHub
[ICML 2025] This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"
☆152Jun 13, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
NVIDIA-AI-IOT / deepstream_triton_migration
View on GitHub
Triton Migration Guide for DeepStreamSDK.
☆15Dec 19, 2023Updated 2 years ago
yzhan238 / SeedTopicMine
View on GitHub
The source code used for paper "Effective Seed-Guided Topic Discovery by Integrating Multiple Types of Contexts", in WSDM 2023.
☆14May 27, 2023Updated 3 years ago
mpitropov / LiDAR-MIMO
View on GitHub
Efficient Uncertainty Estimation for LiDAR-based 3D Object Detection
☆10Nov 8, 2022Updated 3 years ago
rkjones4 / ShapeMOD
View on GitHub
Public code release for SIGGRAPH 2021 paper: ShapeMOD: Macro Operation Discovery for 3D Shape Programs
☆13Sep 8, 2021Updated 4 years ago
ServiceNow / promptmix-emnlp-2023
View on GitHub
Offical code repository for PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation, EMNLP 2023
☆12Dec 13, 2023Updated 2 years ago
wtwong316 / Univariate-Time-Series-Prediction-using-Deep-Learning
View on GitHub
Univariate Time Series Prediction using Deep Learning and PyTorch
☆15Feb 7, 2021Updated 5 years ago
omar-mohamed / Transformer-Arabic-To-English
View on GitHub
Arabic To English translation using transformer neural nets.
☆15Mar 15, 2019Updated 7 years ago
lizechng / Dynamic-Gesture-Recognition-Based-on-FMCW
View on GitHub
☆12Mar 15, 2022Updated 4 years ago
Eternaldeath / ChinasInternetChronicle
View on GitHub
通过时间轴的方式展示中国互联网的变迁
☆16Sep 9, 2022Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
appletea233 / LLaVA-ST
View on GitHub
[CVPR 2025] LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding
☆84Jul 4, 2025Updated last year
wengzejia1 / Open-VCLIP
View on GitHub
☆119Feb 19, 2024Updated 2 years ago
erictzeng / ssa-segmentation-release
View on GitHub
☆12Sep 29, 2019Updated 6 years ago
fywalter / label-bias
View on GitHub
A codebase for ACL 2023 paper: Mitigating Label Biases for In-context Learning
☆10Aug 4, 2023Updated 2 years ago
dairui01 / Toyota_Smarthome
View on GitHub
Tools for Toyota Smarthome datasets
☆16Nov 16, 2022Updated 3 years ago
idiap / geomgaze
View on GitHub
ChildPlay: A New Benchmark for Understanding Children's Gaze Behaviour; code and checkpoints
☆20Feb 13, 2025Updated last year
facebookresearch / stepdiff
View on GitHub
Data release for Step Differences in Instructional Video (CVPR24)
☆15Jun 19, 2024Updated 2 years ago
EvolvingLMMs-Lab / VideoMMMU
View on GitHub
Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos
☆72Sep 5, 2025Updated 10 months ago
jeasinema / egl-docker
View on GitHub
A customized docker for headless GPU rendering without host-side configuration
☆11Aug 22, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
NVIDIA-AI-IOT / deepstream-segmentation-analytics
View on GitHub
A project demonstration to do the industrial defect segmentation based on loading the image from directory and generate the output ground…
☆11Jun 27, 2024Updated 2 years ago
Philip-MIT / rover-vlm
View on GitHub
☆18Dec 1, 2025Updated 7 months ago
kevinschaich / py-imessage-shortcuts
View on GitHub
💬 Send iMessages using Python through the Shortcuts app.
☆18May 25, 2024Updated 2 years ago
zhiming-xu / computer-graphics
View on GitHub
Foundation of computer graphics course assignment at Berkeley in spring 2019
☆15May 25, 2019Updated 7 years ago
benedettaliberatori / T3AL
View on GitHub
Official implementation of "Test-Time Zero-Shot Temporal Action Localization", CVPR 2024
☆75Sep 11, 2024Updated last year
ItIsFriday / PcdSeg
View on GitHub
☆12Nov 28, 2022Updated 3 years ago
mybearyZhang / TwoStageReason
View on GitHub
Official implementation of ECCV 2024 paper: Take A Step Back: Rethinking the Two Stages in Visual Reasoning
☆13Jun 1, 2025Updated last year