Guaranteer/VidSTG-Dataset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Guaranteer/VidSTG-Dataset)

Guaranteer / VidSTG-Dataset

This repository provides the dataset introduced by the paper "Where Does It Exist: Spatio-Temporal Video Grounding for Multi-Form Sentences"

☆70

Alternatives and similar repositories for VidSTG-Dataset

Users that are interested in VidSTG-Dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tzhhhh123 / HC-STVG
View on GitHub
The HC-STVG Dataset
☆65Apr 12, 2023Updated 3 years ago
zfchenUnique / WSSTG
View on GitHub
This repository contains the main baselines introduced in WSSTG (ACL 2019).
☆57Jul 8, 2024Updated 2 years ago
TheShadow29 / vognet-pytorch
View on GitHub
[CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)
☆69Jun 10, 2020Updated 6 years ago
Sy-Zhang / TCMN-Release
View on GitHub
Codes for our ACM MM 2019 paper: "Exploiting Temporal Relationships in Video Moment Localization with Natural Language"
☆16Oct 22, 2022Updated 3 years ago
jy0205 / STCAT
View on GitHub
[NeurIPS 2022] Embracing Consistency: A One-Stage Approach for Spatio-Temporal Video Grounding
☆54Mar 5, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
antoyang / TubeDETR
View on GitHub
[CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers
☆194Sep 24, 2023Updated 2 years ago
INK-USC / VisCOLL
View on GitHub
Code and data for the project "Visually grounded continual learning of compositional semantics"
☆22Dec 27, 2022Updated 3 years ago
JaywongWang / CBP
View on GitHub
Official Tensorflow Implementation of the AAAI-2020 paper "Temporally Grounding Language Queries in Videos by Contextual Boundary-aware P…
☆59Mar 24, 2023Updated 3 years ago
zfchenUnique / VID-Sentence
View on GitHub
This repository provides the dataset introduced by our WSSTG paper
☆13Jul 21, 2019Updated 7 years ago
doc-doc / vRGV
View on GitHub
Visual Relation Grounding in Videos (ECCV'20, Spotlight)
☆57Dec 8, 2022Updated 3 years ago
mayu-ot / hidden-challenges-MR
View on GitHub
codes for Uncovering Hidden Challenges in Query-Based Video Moment Retrieval
☆20Sep 7, 2020Updated 5 years ago
HengLan / CGSTVG
View on GitHub
[CVPR 2024] Context-Guided Spatio-Temporal Video Grounding
☆66Jun 28, 2024Updated 2 years ago
facebookresearch / ActivityNet-Entities
View on GitHub
A Dataset for Grounded Video Description
☆165Jan 4, 2022Updated 4 years ago
escorciav / moments-retrieval-page
View on GitHub
Moments Retrieval Project Webpage (temporal)
☆31Jan 17, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
jayleicn / TVRetrieval
View on GitHub
[ECCV 2020] PyTorch code for XML on TVRetrieval dataset - TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
☆163May 28, 2024Updated 2 years ago
ikuinen / CMIN_moment_retrieval
View on GitHub
Cross-Modal Interaction Networks for Query-Based Moment Retrieval in Videos
☆87Nov 22, 2020Updated 5 years ago
Soldelli / VLG-Net
View on GitHub
VLG-Net: Video-Language Graph Matching Networks for Video Grounding
☆31May 31, 2022Updated 4 years ago
WuJie1010 / Temporally-language-grounding
View on GitHub
A Pytorch implemention for some state-of-the-art models for" Temporally Language Grounding in Untrimmed Videos"
☆95Sep 21, 2019Updated 6 years ago
26hzhang / VSLNet
View on GitHub
Span-based Localizing Network for Natural Language Video Localization (ACL 2020)
☆113Oct 15, 2021Updated 4 years ago
niluthpol / weak_supervised_video_moment
View on GitHub
Weakly Supervised Video Moment Retrieval from Text Queries
☆43Jul 20, 2020Updated 6 years ago
Alvin-Zeng / DRN
View on GitHub
Dense Regression Network for Video Grounding (CVPR2020)
☆53Jan 28, 2021Updated 5 years ago
jiyanggao / TALL
View on GitHub
TALL: Temporal Activity Localization via Language Query
☆220Mar 15, 2018Updated 8 years ago
xdshang / VidVRD-helper
View on GitHub
To keep updates with VRU Grand Challenge, please use https://github.com/NExTplusplus/VidVRD-helper
☆102Jan 24, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
MichiganCOG / Video-Grounding-from-Text
View on GitHub
Source code for "Weakly-Supervised Video Object Grounding from Text by Loss Weighting and Object Interaction"
☆47Jun 22, 2024Updated 2 years ago
zjr2000 / GVL
View on GitHub
Official implementation for paper Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos
☆28Dec 8, 2023Updated 2 years ago
BigRedT / info-ground
View on GitHub
Learning phrase grounding from captioned images through InfoNCE bound on mutual information
☆73Aug 22, 2020Updated 5 years ago
XgDuan / WSDEC
View on GitHub
Weakly Supervised Dense Event Captioning in Videos, i.e. generating multiple sentence descriptions for a video in a weakly-supervised man…
☆104Mar 21, 2020Updated 6 years ago
JonghwanMun / LGI4temporalgrounding
View on GitHub
Repository for the CVPR-20 paper "Local-Global Video-Text Interactions for Temporal Grounding"
☆132Jul 5, 2021Updated 5 years ago
SCZwangxiao / Temporal-Language-Grounding-in-videos
View on GitHub
Temporal Moment(Action) Localization via Language / Temporal Language Grounding / Video Moment Retrieval
☆100Jan 23, 2022Updated 4 years ago
ikuinen / semantic_completion_network
View on GitHub
☆26Aug 4, 2020Updated 5 years ago
hyounghk / VideoQADenseCapFrameGate-ACL2020
View on GitHub
Code for ACL 2020 paper "Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA." Hyounghun Kim, Zineng T…
☆34May 14, 2020Updated 6 years ago
yytzsy / SCDM
View on GitHub
Code for the paper: Semantic Conditioned Dynamic Modulation for Temporal Sentence Grounding in Videos
☆71Sep 7, 2021Updated 4 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
jimmy646 / violin
View on GitHub
Data and code for CVPR 2020 paper: "VIOLIN: A Large-Scale Dataset for Video-and-Language Inference"
☆161Apr 29, 2020Updated 6 years ago
WuJie1010 / TSP-PRL
View on GitHub
Tree-Structured Policy based Progressive Reinforcement Learning for Temporally Language Grounding in Video (AAAI2020)
☆47Jan 22, 2020Updated 6 years ago
liudaizong / CSMGAN
View on GitHub
Code for ACM MM2020 paper: Jointly Cross- and Self-Modal Graph Attention Network for Query-Based Moment Localization
☆34Sep 3, 2020Updated 5 years ago
simon-ging / coot-videotext
View on GitHub
COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning
☆291Sep 6, 2022Updated 3 years ago
jayleicn / TVQAplus
View on GitHub
[ACL 2020] PyTorch code for TVQA+: Spatio-Temporal Grounding for Video Question Answering
☆132Oct 25, 2022Updated 3 years ago
WHB139426 / Grounded-Video-LLM
View on GitHub
[EMNLP 2025 Findings] Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models
☆149Aug 21, 2025Updated 11 months ago
cshizhe / hgr_v2t
View on GitHub
Code accompanying the paper "Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning".
☆211Jun 12, 2020Updated 6 years ago