A curated list of video-text datasets in a variety of languages. These datasets can be used for video captioning (video description) or video retrieval.
☆39Feb 18, 2024Updated 2 years ago
Alternatives and similar repositories for awesome-video-text-datasets
Users that are interested in awesome-video-text-datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding. (WACV2025)☆39Apr 17, 2025Updated last year
- Some papers about *diverse* image (a few videos) captioning☆25Apr 4, 2023Updated 3 years ago
- The implementation of a paper entitled "Action Knowledge for Video Captioning with Graph Neural Networks" (JKSUCIS 2023).☆14Mar 29, 2023Updated 3 years ago
- Summary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Pe…☆134Oct 27, 2023Updated 2 years ago
- ☆23Mar 1, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official code for the ICCV23 paper: "Domain Generalization via Rationale Invariance"☆20Jan 18, 2025Updated last year
- OmegaViT (ΩViT) is a cutting-edge vision transformer architecture that combines multi-query attention, rotary embeddings, state space mod…☆15Jun 8, 2026Updated last week
- Automatically erase objects in the video, such as logo, text, etc.☆22Dec 22, 2020Updated 5 years ago
- Implementation of the Gumbel-Sigmoid distribution in PyTorch.☆21Jul 22, 2022Updated 3 years ago
- ☆22Dec 11, 2018Updated 7 years ago
- code for studying OpenAI's CLIP explainability☆39Jan 7, 2022Updated 4 years ago
- Short video crawler based on scrapy☆14May 18, 2026Updated 3 weeks ago
- 📦 A lightweight machine learning toolkit for researchers, providing common model design & learning functionalities.☆29Jul 2, 2025Updated 11 months ago
- This is a Pytorch Implementation of the DASP algorithm from the paper "Explaining Deep Neural Networks with a Polynomial Time Algorithm f…☆11Jun 12, 2020Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- GPU accelerated Perlin Noise in python☆11Oct 23, 2020Updated 5 years ago
- ☆14Sep 28, 2023Updated 2 years ago
- Implementation for ACProp ( Momentum centering and asynchronous update for adaptive gradient methdos, NeurIPS 2021)☆17Oct 11, 2021Updated 4 years ago
- Feedforward implementation of Lightweight Probabilistic Deep Networks for Keras and Tensorflow☆14Jul 1, 2019Updated 6 years ago
- Spherical Parameterization of Genus-0 Surfaces☆10Mar 31, 2022Updated 4 years ago
- Facial-Expression Recognition with Deep Neural Networks☆10Mar 6, 2016Updated 10 years ago
- Professor forcing future code☆10Sep 22, 2018Updated 7 years ago
- ☆14Jan 5, 2022Updated 4 years ago
- Customizable video storyboard generator. (Deprecated. Use https://github.com/zmwangx/metadata.)☆29Sep 20, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Unofficial implementation of the Ask-LLM paper 'How to Train Data-Efficient LLMs', arXiv:2402.09668.☆12Jun 19, 2024Updated last year
- [MM'24 Oral] Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval☆130Aug 23, 2024Updated last year
- paper list on Video Moment Retrieval (VMR), or Natural Language Video Localization (NLVL), or Temporal Sentence Grounding in Videos (TSGV…☆38Jan 12, 2023Updated 3 years ago
- ☆34Mar 26, 2024Updated 2 years ago
- Simple python rasterizer tool implemented by OpenGL and C++☆15Nov 10, 2025Updated 7 months ago
- ☆14Mar 12, 2023Updated 3 years ago
- Managed L2D tool libs. (In Dev)☆14Apr 20, 2019Updated 7 years ago
- codewithgpu.com python client package☆20May 18, 2023Updated 3 years ago
- Source code of our MM'22 paper Partially Relevant Video Retrieval☆56Nov 4, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- My submission for the Robotic Instrument Segmentation Sub-Challenge held in conjunction with MICCAI 2017.☆14Sep 8, 2017Updated 8 years ago
- AIST++ Dataset Webpage: https://google.github.io/aistplusplus_dataset☆19Apr 8, 2026Updated 2 months ago
- ☆18Nov 11, 2022Updated 3 years ago
- List of PyTorch repositories for visual question answering☆15Jul 4, 2019Updated 6 years ago
- A medical image recognition project powered by self-implemented ResNet and ViT models, utilizing PyTorch, with a user-friendly web demo b…☆14Feb 26, 2024Updated 2 years ago
- PyTorch implementation of "TALL: Temporal Activity Localization via Language Query. Gao et al. ICCV2017."☆14Apr 20, 2019Updated 7 years ago
- Weakly Supervised Dense Event Captioning in Videos, i.e. generating multiple sentence descriptions for a video in a weakly-supervised man…☆104Mar 21, 2020Updated 6 years ago