A collection of videos annotated with timelines where each video is divided into segments, and each segment is labelled with a short free-text description
☆29Jan 15, 2022Updated 4 years ago
Alternatives and similar repositories for Video-Timeline-Tags-ViTT
Users that are interested in Video-Timeline-Tags-ViTT are comparing it to the libraries listed below
Sorting:
- A one-stop shop for YouCook2 info such as leaderboard and recent advances on (cooking) video retrieval and captioning.☆41Jun 29, 2022Updated 3 years ago
- A video retrieval dataset How2R and a video QA dataset How2QA☆24Oct 15, 2020Updated 5 years ago
- ☆80Nov 24, 2024Updated last year
- Code for the paper "Controllable Video Captioning with an Exemplar Sentence"☆12Apr 14, 2021Updated 4 years ago
- ☆16Dec 28, 2020Updated 5 years ago
- ☆17Dec 25, 2023Updated 2 years ago
- Official code of "Discover the Unknown Biased Attribute of an Image Classifier" (ICCV 2021)☆21Oct 11, 2021Updated 4 years ago
- ☆44Mar 8, 2021Updated 4 years ago
- Code and data for the project "Visually grounded continual learning of compositional semantics"☆22Dec 27, 2022Updated 3 years ago
- Official repository for "IntentQA: Context-aware Video Intent Reasoning" from ICCV 2023.☆23Nov 29, 2024Updated last year
- PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision☆46Jul 29, 2020Updated 5 years ago
- Code for the paper "Zero-shot Natural Language Video Localization" (ICCV2021, Oral).☆48Mar 15, 2023Updated 2 years ago
- AViD Dataset: Anonymized Videos from Diverse Countries☆56Mar 30, 2023Updated 2 years ago
- Code for the paper "Benchmarking Object Detectors with COCO: A New Path Forward."☆32Jul 13, 2024Updated last year
- MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge (ICCV 2023)☆30Sep 5, 2023Updated 2 years ago
- ☆27Jul 18, 2025Updated 7 months ago
- code for COLING paper "A Hybrid Model of Classification and Generation for Spatial Relation Extraction"☆10Oct 20, 2022Updated 3 years ago
- ☆62May 11, 2021Updated 4 years ago
- S3D Text-Video model trained on HowTo100M using MIL-NCE☆200Jul 3, 2020Updated 5 years ago
- Github for my ICCV 2017 paper: "Localizing Moments in Video with Natural Language"☆196Oct 31, 2020Updated 5 years ago
- Evaluation code for Dense-Captioning Events in Videos☆130Jun 11, 2019Updated 6 years ago
- Data Release for VALUE Benchmark☆30Feb 16, 2022Updated 4 years ago
- [ICCV2021] Generic Event Boundary Detection: A Benchmark for Event Segmentation☆75Dec 28, 2021Updated 4 years ago
- A weakly-supervised scene graph generation codebase. The implementation of our CVPR2021 paper ``Linguistic Structures as Weak Supervision…☆37Apr 25, 2021Updated 4 years ago
- Diverse Image Captioning with Context-Object Split Latent Spaces (NeurIPS 2020)☆37May 16, 2022Updated 3 years ago
- Repository of proposal-free temporal moment localization work☆33Jun 11, 2024Updated last year
- Latent Normalizing Flows for Many-to-Many Cross Domain Mappings (ICLR 2020)☆33May 12, 2022Updated 3 years ago
- NAACL 2022 paper on Analyzing Modality Robustness in Multimodal Sentiment Analysis☆31Jan 21, 2023Updated 3 years ago
- Source code of the paper titled *Improving Video Captioning with Temporal Composition of a Visual-Syntactic Embedding*☆30Apr 16, 2021Updated 4 years ago
- Weakly Supervised Temporal Action Localization Using Deep Metric Learning☆27Mar 19, 2020Updated 5 years ago
- Code for Learning to Learn Language from Narrated Video☆33Oct 3, 2023Updated 2 years ago
- Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)☆83Jul 1, 2024Updated last year
- ☆36Jul 9, 2025Updated 7 months ago
- [CVPR'2022 Oral] The Devil is in the Labels: Noisy Label Correction for Robust Scene Graph Generation☆32Oct 19, 2023Updated 2 years ago
- Implementation of paper "Not All Frames Are Equal: Weakly-Supervised Video Grounding with Contextual Similarity and Visual Clustering Los…☆30Jun 29, 2020Updated 5 years ago
- ACM ICMR 2019《Cross-Modal Video Moment Retrieval with Spatial and Language-Temporal Attention》☆36Jun 19, 2019Updated 6 years ago
- End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)☆229Jan 3, 2024Updated 2 years ago
- Diagnostic tools and additional visualizations from "What Actions are Needed for Understanding Human Actions in Videos?" ICCV 2017☆88Dec 19, 2017Updated 8 years ago
- ☆17Apr 25, 2023Updated 2 years ago