A collection of videos annotated with timelines where each video is divided into segments, and each segment is labelled with a short free-text description
☆29Jan 15, 2022Updated 4 years ago
Alternatives and similar repositories for Video-Timeline-Tags-ViTT
Users that are interested in Video-Timeline-Tags-ViTT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A one-stop shop for YouCook2 info such as leaderboard and recent advances on (cooking) video retrieval and captioning.☆41Jun 29, 2022Updated 3 years ago
- ☆23Dec 5, 2023Updated 2 years ago
- A video retrieval dataset How2R and a video QA dataset How2QA☆24Oct 15, 2020Updated 5 years ago
- ☆80Nov 24, 2024Updated last year
- ☆16Dec 28, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for the paper "Controllable Video Captioning with an Exemplar Sentence"☆12Apr 14, 2021Updated 5 years ago
- This is an official pytorch implementation of Learning To Recognize Procedural Activities with Distant Supervision. In this repository, w…☆43Feb 21, 2023Updated 3 years ago
- MAVERICS (Manually-vAlidated Vq^2a Examples fRom Image-Caption datasetS) is a suite of test-only benchmarks for visual question answering…☆13Feb 18, 2023Updated 3 years ago
- A shortway to handle keyboard shortcuts.☆14Feb 10, 2019Updated 7 years ago
- command line note organizer☆10Aug 23, 2018Updated 7 years ago
- Code for ACL 2020 paper "Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA." Hyounghun Kim, Zineng T…☆34May 14, 2020Updated 5 years ago
- This repository aims to collect the articles and codes for the Visual Storytelling (VIST) task. VIST is a vision-and-language task. It ai…☆25Mar 3, 2021Updated 5 years ago
- AViD Dataset: Anonymized Videos from Diverse Countries☆56Mar 30, 2023Updated 3 years ago
- [NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale☆208Nov 13, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official repository for "IntentQA: Context-aware Video Intent Reasoning" from ICCV 2023.☆24Nov 29, 2024Updated last year
- PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision☆46Jul 29, 2020Updated 5 years ago
- Code and data for the project "Visually grounded continual learning of compositional semantics"☆22Dec 27, 2022Updated 3 years ago
- ☆44Mar 8, 2021Updated 5 years ago
- Code for the paper "Zero-shot Natural Language Video Localization" (ICCV2021, Oral).☆48Mar 15, 2023Updated 3 years ago
- Latent Normalizing Flows for Many-to-Many Cross Domain Mappings (ICLR 2020)☆33May 12, 2022Updated 3 years ago
- Code for the paper "Benchmarking Object Detectors with COCO: A New Path Forward."☆32Jul 13, 2024Updated last year
- ☆18Jul 10, 2024Updated last year
- The code and output of our AAAI paper "Knowledge-Enriched Visual Storytelling"☆41May 3, 2021Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Infrastructure useful to create natural language processing systems based on transformer networks☆12Sep 26, 2019Updated 6 years ago
- Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents Integration☆15Jun 4, 2024Updated last year
- Data release for Step Differences in Instructional Video (CVPR24)☆14Jun 19, 2024Updated last year
- Code for "Neural Retrievers are Biased Towards LLM-Generated Content"☆14Oct 18, 2024Updated last year
- Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)☆86Jul 1, 2024Updated last year
- Code for GLAT (Global Local Transformer), ECCV 2020 "Learning Visual Commonsense for Robust Scene Graph Generation"☆11Dec 16, 2020Updated 5 years ago
- NICE challenge 2023 Track2 2nd result(total 4th) (CVPR 2023) sponsered by LG AI/Shutterstock/SNU☆11Jun 22, 2023Updated 2 years ago
- ☆62May 11, 2021Updated 4 years ago
- Topic Extraction baseline for Dialogue Text Analysis Task of nlpcc 2022☆15Apr 15, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆12May 26, 2023Updated 2 years ago
- Official PyTorch Implementation of RITC☆21Oct 26, 2021Updated 4 years ago
- MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge (ICCV 2023)☆30Sep 5, 2023Updated 2 years ago
- S3D Text-Video model trained on HowTo100M using MIL-NCE☆200Jul 3, 2020Updated 5 years ago
- Text-to-Speech Synthesis by Generating Spectrograms using Generative Adversarial Network☆10Dec 12, 2018Updated 7 years ago
- ☆28Jul 18, 2025Updated 8 months ago
- Poet: Product-oriented Video Captioner for E-commerce☆12Sep 21, 2020Updated 5 years ago