hengRUC/VSP

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hengRUC/VSP)

hengRUC / VSP

☆24

Alternatives and similar repositories for VSP

Users that are interested in VSP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

trquhuytin / LAV-CVPR21
View on GitHub
Learning by Aligning Videos in Time (CVPR 2021)
☆13Sep 10, 2023Updated 2 years ago
patrick-tssn / VSTAR
View on GitHub
[ACL 2023] VSTAR is a multimodal dialogue dataset with scene and topic transition information
☆16Oct 27, 2024Updated last year
taeinkwon / CASA
View on GitHub
Context-Aware Sequence Alignment using 4D Skeletal Augmentation CVPR 2022
☆23Jul 18, 2022Updated 3 years ago
iwangjian / Color4Dial
View on GitHub
Dialogue Planning via Brownian Bridge Stochastic Process for Goal-directed Proactive Dialogue (ACL Findings 2023)
☆21Nov 10, 2025Updated 8 months ago
DAVEISHAN / TCLR
View on GitHub
Official code repo for TCLR: Temporal Contrastive Learning for Video Representation [CVIU-2022]
☆40Feb 28, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
huiwon-jang / RSP
View on GitHub
Visual Representation Learning with Stochastic Frame Prediction (ICML 2024)
☆28Nov 27, 2024Updated last year
TablemanLiu / Rock-Radar
View on GitHub
☆216Jun 18, 2026Updated 3 weeks ago
kyegomez / Mirasol
View on GitHub
Pytorch Implementation of the Model from "MIRASOL3B: A MULTIMODAL AUTOREGRESSIVE MODEL FOR TIME-ALIGNED AND CONTEXTUAL MODALITIES"
☆26Jan 27, 2025Updated last year
tum-pbs / LSIM
View on GitHub
LSiM is a learned metric to compute distance values for 2D data from numerical simulations
☆28Oct 4, 2023Updated 2 years ago
HimangiM / RepLAI
View on GitHub
Self-supervised algorithm for learning representations from ego-centric video data. Code is tested on EPIC-Kitchens-100 and Ego4D in PyTo…
☆13Oct 23, 2022Updated 3 years ago
HanHuCAS / SurgNet
View on GitHub
The official repository for "SurgNet: Self-supervised Pretraining with Semantic Consistency for Vessel and Instrument Segmentation in Sur…
☆15Dec 30, 2024Updated last year
jinhyunj / EaTR
View on GitHub
Official pytorch repository for "Knowing Where to Focus: Event-aware Transformer for Video Grounding" (ICCV 2023)
☆55Sep 7, 2023Updated 2 years ago
musicman217 / Text-Proxy
View on GitHub
Text Proxy: Decomposing Retrieval from a 1-to-N Relationship into N 1-to-1 Relationships for Text-Video Retrieval -- AAAI2025
☆21May 8, 2026Updated 2 months ago
yunfan1202 / Delving-into-Crispness
View on GitHub
Code for the TIP 2023 paper "Delving into Crispness: Guided Label Refinement for Crisp Edge Detection".
☆11Jan 29, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Arturia-Pendragon-Iris / Frepa
View on GitHub
☆20Jun 19, 2026Updated 3 weeks ago
lshiwjx / DSTA-Net
View on GitHub
☆63Oct 26, 2020Updated 5 years ago
Ace-Pegasus / EasyDrag
View on GitHub
Official code for EasyDrag (CVPR 2024)
☆17Jun 18, 2024Updated 2 years ago
zhiyuanhubj / Long_form_VideoQA
View on GitHub
[EMNLP’24 Main] Encoding and Controlling Global Semantics for Long-form Video Question Answering
☆18Oct 9, 2024Updated last year
zihuixue / AlignEgoExo
View on GitHub
Code and data release for the paper "Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Align…
☆19Apr 5, 2024Updated 2 years ago
md-mohaiminul / TranS4mer
View on GitHub
☆34Jun 2, 2023Updated 3 years ago
lizhilin-ustc / AAAI2023-AICL
View on GitHub
The official implementation of "Actionness Inconsistency-guided Contrastive Learning for Weakly-supervised Temporal Action Localization"(…
☆18Nov 26, 2024Updated last year
sail-sg / Video-Next-Event-Prediction
View on GitHub
☆28Aug 9, 2025Updated 11 months ago
fansunqi / AKeyS
View on GitHub
Agentic Keyframe Search for Video Question Answering
☆18Jun 30, 2026Updated last week
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
TIGER-AI-Lab / QuickVideo
View on GitHub
Quick Long Video Understanding [TMLR2025]
☆79Oct 27, 2025Updated 8 months ago
YuanEZhou / CBTrans
View on GitHub
☆24Apr 4, 2022Updated 4 years ago
SwiftieH / SpGAT
View on GitHub
Spectral Graph Attention Network with Fast Eigen-approximation
☆11Dec 24, 2021Updated 4 years ago
UCF-CRCV / TeD-SPAD
View on GitHub
Official code for TeD-SPAD: Temporal Distinctiveness for Self-supervised Privacy-preservation for video Anomaly Detection, accepted at IC…
☆17Feb 18, 2025Updated last year
jayusxp / UECA-Prompt
View on GitHub
UECA-Prompt: Universal Prompt for Emotion Cause Analysis（COLING 2022）
☆16Jun 6, 2023Updated 3 years ago
hananshafi / MedContext
View on GitHub
[MICCAI 2024] Official code for the paper "MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation"
☆14Nov 1, 2024Updated last year
hyungjin-chung / VPS
View on GitHub
☆16Sep 11, 2025Updated 10 months ago
PKU-ICST-MIPL / FineParser_CVPR2024
View on GitHub
☆27Oct 11, 2024Updated last year
TangTao-PKU / ARTS
View on GitHub
[ACM MM 2024] PyTorch Implementation of "ARTS: Semi-Analytical Regressor using Disentangled Skeletal Representations for Human Mesh Recov…
☆15Feb 27, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
zihuixue / ProgCaptioner
View on GitHub
Code release for the paper "Progress-Aware Video Frame Captioning" (CVPR 2025)
☆26Jul 16, 2025Updated 11 months ago
Mael-zys / InterPose
View on GitHub
(3DV 2026) Pytorch implementation of “InterPose: Learning to Generate Human-Object Interactions from Large-Scale Web Videos”
☆27Mar 16, 2026Updated 3 months ago
OpenGVLab / DiffAgent
View on GitHub
[CVPR 2024] DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model
☆19Apr 16, 2024Updated 2 years ago
Pascalson / LERG
View on GitHub
A unified approach to explain conditional text generation models. Pytorch. The code of paper "Local Explanation of Dialogue Response Gene…
☆16Mar 21, 2022Updated 4 years ago
sakharok13 / Aligning-Stable-Diffusion-with-Noise-Conditioned-Perception
View on GitHub
☆17Aug 13, 2024Updated last year
Adit31 / Captionomaly-Deep-Learning-Toolbox-for-Anomaly-Captioning
View on GitHub
Source Code for Captionomaly: A Deep Learning Toolbox for Anomaly Captioning in Surveillance Videos
☆13Jun 26, 2023Updated 3 years ago
SAGNIKMJR / ego-AV-spatial-correspondence
View on GitHub
[CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'
☆14Jun 16, 2024Updated 2 years ago