☆22Jun 6, 2025Updated 11 months ago
Alternatives and similar repositories for VLM-Video-Action-Localization
Users that are interested in VLM-Video-Action-Localization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A task sequencer framework for achieving a GPT-to-action system in robotics.☆17Mar 6, 2025Updated last year
- [ICCVW 2023] Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action Detection☆21Feb 22, 2024Updated 2 years ago
- ☆13Mar 24, 2023Updated 3 years ago
- Project Moab software stack☆24May 23, 2023Updated 2 years ago
- [ICML 2025] This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"☆151Jun 13, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation☆36Feb 28, 2026Updated 2 months ago
- CVPR 2026 - MSGNav: Unleashing the Power of Multi-modal 3D Scene Graph for Zero-Shot Embodied Navigation☆51Mar 23, 2026Updated last month
- A customized docker for headless GPU rendering without host-side configuration☆11Aug 22, 2022Updated 3 years ago
- Official implementation of "Harnessing Large Language Models for Training-free Video Anomaly Detection", CVPR 2024☆141Jul 15, 2024Updated last year
- ☆12Sep 29, 2019Updated 6 years ago
- Tools for Toyota Smarthome datasets☆14Nov 16, 2022Updated 3 years ago
- LITEN: Learning from Inference Time Execution for VLAs☆27Oct 23, 2025Updated 6 months ago
- [ICLR 2026] Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆17Mar 18, 2026Updated 2 months ago
- ☆11Jul 4, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A Unified Framework for Video-Language Understanding☆62Jun 17, 2023Updated 2 years ago
- This repo takes the initial step towards leveraging text learning for online action detection without explicit human supervision.☆14Dec 13, 2024Updated last year
- ☆16Apr 14, 2026Updated last month
- This repository provides scripts that can be used to visualize BVH files. These scripts were developed for the GENEA Challenge 2020, and …☆40Feb 23, 2023Updated 3 years ago
- ☆16Apr 11, 2026Updated last month
- [CVPR 2022] OCSampler: Compressing Videos to One Clip with Single-step Sampling☆17Jun 21, 2022Updated 3 years ago
- Official Code for ICLR 2023 Paper: A Message Passing Perspective on Learning Dynamics of Contrastive Learning☆11Mar 9, 2023Updated 3 years ago
- ☆12Dec 6, 2024Updated last year
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆20Jul 10, 2025Updated 10 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Official implementation of "Multi-armed Bandit Algorithm against Strategic Replication"☆14May 17, 2022Updated 4 years ago
- [ECCV 2024] The first zero-shot setting for spatio-temporal video grounding.☆11Jul 16, 2024Updated last year
- F-16 is a powerful video large language model (LLM) that perceives high-frame-rate videos, which is developed by the Department of Electr…☆36Jul 3, 2025Updated 10 months ago
- [ICLR2023] Video Scene Graph Generation from Single-Frame Weak Supervision☆12Sep 17, 2023Updated 2 years ago
- CamReasoner: Reinforcing Camera Movement Understanding via Structured Spatial Reasoning☆30Apr 10, 2026Updated last month
- project website for "depth sensing beyond LiDAR range"☆11Jul 28, 2020Updated 5 years ago
- The implementation of a paper entitled "Action Knowledge for Video Captioning with Graph Neural Networks" (JKSUCIS 2023).☆14Mar 29, 2023Updated 3 years ago
- ☆22Apr 17, 2026Updated last month
- Paper "Learning-Semantic-Associations-for-Mirror-Detection" is accepted in CVPR 2022☆14Feb 21, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- UniMD: Towards Unifying Moment retrieval and temporal action Detection☆57Jul 5, 2024Updated last year
- ☆15Mar 15, 2023Updated 3 years ago
- Anomaly detection for images☆13Jan 14, 2020Updated 6 years ago
- This is a repository contains the implementation of our NeurIPS'24 paper "Temporal Sentence Grounding with Relevance Feedback in Videos"☆13Aug 22, 2025Updated 8 months ago
- [ICCV 2021] Relaxed Transformer Decoders for Direct Action Proposal Generation☆92Apr 5, 2022Updated 4 years ago
- [ICCV-2023] Heterogeneous Forgetting Compensation for Class-Incremental Learning☆12Dec 4, 2023Updated 2 years ago
- A PyTorch collection of semantic segmentation tools.☆32Mar 28, 2019Updated 7 years ago