☆24Sep 24, 2023Updated 2 years ago
Alternatives and similar repositories for VSP
Users that are interested in VSP are comparing it to the libraries listed below
Sorting:
- Learning by Aligning Videos in Time (CVPR 2021)☆14Sep 10, 2023Updated 2 years ago
- [ACL 2023] VSTAR is a multimodal dialogue dataset with scene and topic transition information☆15Oct 27, 2024Updated last year
- Context-Aware Sequence Alignment using 4D Skeletal Augmentation CVPR 2022☆23Jul 18, 2022Updated 3 years ago
- ☆26Nov 7, 2023Updated 2 years ago
- Pytorch Implementation of the Model from "MIRASOL3B: A MULTIMODAL AUTOREGRESSIVE MODEL FOR TIME-ALIGNED AND CONTEXTUAL MODALITIES"☆26Jan 27, 2025Updated last year
- Official pytorch repository for "Knowing Where to Focus: Event-aware Transformer for Video Grounding" (ICCV 2023)☆55Sep 7, 2023Updated 2 years ago
- Official PyTorch code for the CVPR 2024 paper 'Part-aware Unified Representation of Language and Skeleton for Zero-shot Action Recognitio…☆37May 28, 2025Updated 9 months ago
- ☆34Jun 2, 2023Updated 2 years ago
- ☆43Dec 1, 2025Updated 3 months ago
- 統計検定準1級合格に向けて☆20May 31, 2025Updated 9 months ago
- Source Code for Captionomaly: A Deep Learning Toolbox for Anomaly Captioning in Surveillance Videos☆13Jun 26, 2023Updated 2 years ago
- Official code repo for TCLR: Temporal Contrastive Learning for Video Representation [CVIU-2022]☆41Feb 28, 2024Updated 2 years ago
- Quick Long Video Understanding [TMLR2025]☆76Oct 27, 2025Updated 4 months ago
- ☆14Sep 11, 2025Updated 5 months ago
- Agentic Keyframe Search for Video Question Answering☆16Apr 7, 2025Updated 11 months ago
- Code for "HyperSDFusion: Bridging Hierarchical Structures in Language and Geometry for Enhanced 3D Text2Shape Generation" CVPR2024☆10Apr 19, 2024Updated last year
- [CVPR 2023] Spatial-then-Temporal Self-Supervised Learning for Video Correspondence☆11Jul 5, 2023Updated 2 years ago
- [AAAI 2023] Official implementation of FiTs: Fine-grained Two-stage Training for Knowledge Base Question Answering☆11Mar 10, 2023Updated 3 years ago
- [CVPR 2024] Official repository of ST_GT☆10Sep 15, 2024Updated last year
- The Third Place Winner in Generative Track of the ECCV 2024 DD Challenge☆10Oct 11, 2024Updated last year
- Spectral Graph Attention Network with Fast Eigen-approximation☆12Dec 24, 2021Updated 4 years ago
- Reinforcing Text-Rich Video Reasoning with Visual Rumination☆27Nov 24, 2025Updated 3 months ago
- Official implementation of "ConViS-Bench: Estimating Video Similarity Through Semantic Concepts", NeurIPS 2025☆25Nov 28, 2025Updated 3 months ago
- [MICCAI 2024] Official code for the paper "MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation"☆14Nov 1, 2024Updated last year
- [ECCV 22] LocVTP: Video-Text Pre-training for Temporal Localization☆39Jul 29, 2022Updated 3 years ago
- [ECCV 2024] Official repository of SkateFormer☆125Nov 18, 2024Updated last year
- Code for the TIP 2023 paper "Delving into Crispness: Guided Label Refinement for Crisp Edge Detection".☆10Jan 29, 2024Updated 2 years ago
- Released Code for ACL 21 paper: DocOIE A Document-level Context-Aware Dataset for OpenIE☆15Nov 25, 2022Updated 3 years ago
- Implementation of the paper 'Stochastic Wasserstein Barycenters'☆11Oct 17, 2018Updated 7 years ago
- Code to reproduce 'MOCCA: Multi-Layer One-Class Classification for Anomaly Detection'☆10Dec 12, 2021Updated 4 years ago
- official code for "3D Question Answering via only 2D Vision-Language Models"☆23Updated this week
- ☆12Jan 17, 2024Updated 2 years ago
- Fully buildable project files of Little Lead-rical Leader Pack (Yuni, Kokkoro, Kyoka), a leader mod for Sid Meier's Civilization VI.☆12Aug 13, 2023Updated 2 years ago
- [IEEE TMM] The official implementation of MAE3D☆13Oct 19, 2023Updated 2 years ago
- [ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"☆13Dec 1, 2024Updated last year
- Repository for the CVPR23 paper Re^2TAL☆13Nov 21, 2025Updated 3 months ago
- [CVPR 2025] Official Repository of the paper "On the Consistency of Video Large Language Models in Temporal Comprehension"☆16Oct 13, 2025Updated 4 months ago
- Behavioural and Dynamic Learning Network (BunDLe-Net) is an algorithm to learn meaningful coarse-grained representations from time-series…☆14Apr 15, 2025Updated 10 months ago
- ☆12Apr 13, 2023Updated 2 years ago