willyfh/awesome-video-text-datasets

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/willyfh/awesome-video-text-datasets)

willyfh / awesome-video-text-datasets

A curated list of video-text datasets in a variety of languages. These datasets can be used for video captioning (video description) or video retrieval.

☆40

Alternatives and similar repositories for awesome-video-text-datasets

Users that are interested in awesome-video-text-datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

willyfh / graph-transformer
View on GitHub
An unofficial implementation of Graph Transformer (Masked Label Prediction: Unified Message Passing Model for Semi-Supervised Classificat…
☆36Apr 20, 2024Updated 2 years ago
aws-samples / mammography-classification-workshop
View on GitHub
☆12Jan 10, 2023Updated 3 years ago
RyanLiut / awesome-diverse-captioning
View on GitHub
Some papers about *diverse* image (a few videos) captioning
☆25Apr 4, 2023Updated 3 years ago
Zhuo-Cao / FlashVTG
View on GitHub
FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding. (WACV2025)
☆39Apr 17, 2025Updated last year
nhtlongcs / AIC2022-VER
View on GitHub
Text Query based Traffic Video Event Retrieval with Global-Local Fusion Embedding
☆13Aug 2, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
jssprz / video_captioning_datasets
View on GitHub
Summary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Pe…
☆134Oct 27, 2023Updated 2 years ago
liangchen527 / RIDG
View on GitHub
Official code for the ICCV23 paper: "Domain Generalization via Rationale Invariance"
☆20Jan 18, 2025Updated last year
duanjiaqi / Video-Auto-Wipe
View on GitHub
Automatically erase objects in the video, such as logo, text, etc.
☆22Dec 22, 2020Updated 5 years ago
Jinyu2019 / Suppl-data-BBpaper
View on GitHub
The Supplementary data in the paper "A Survey and Systematic Assessment of Computational Methods for Drug Response Prediction"
☆12Sep 27, 2019Updated 6 years ago
AngelosNal / PyTorch-Gumbel-Sigmoid
View on GitHub
Implementation of the Gumbel-Sigmoid distribution in PyTorch.
☆20Jul 22, 2022Updated 4 years ago
duchesneaumathieu / pyperlin
View on GitHub
GPU accelerated Perlin Noise in python
☆11Oct 23, 2020Updated 5 years ago
sMamooler / CLIP_Explainability
View on GitHub
code for studying OpenAI's CLIP explainability
☆39Jan 7, 2022Updated 4 years ago
vkverma01 / Zero-Shot-Learning
View on GitHub
Zero-Shot Learning
☆19Dec 9, 2019Updated 6 years ago
qianlima-lab / TS-TFC
View on GitHub
This is an official pytorch implementation for paper "Temporal-Frequency Co-training for Time Series Semi-supervised Learning" (AAAI-23)…
☆15May 17, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yeliudev / nncore
View on GitHub
📦 A lightweight machine learning toolkit for researchers, providing common model design & learning functionalities.
☆29Jul 9, 2026Updated 2 weeks ago
CuiRuikai / NumGrad-Pull
View on GitHub
☆12Jan 16, 2025Updated last year
DCDmllm / Momentor
View on GitHub
☆81Nov 24, 2024Updated last year
GX77 / LCVSL
View on GitHub
☆14Sep 28, 2023Updated 2 years ago
liuxiaolei88 / Awesome-Text2Video-Retrieval
View on GitHub
The top conferences on video retrieval libraries in recent years, synchronized with my blog.
☆14Nov 27, 2021Updated 4 years ago
leotam / MIMIC-CXR-annotations
View on GitHub
☆15Aug 4, 2020Updated 5 years ago
QiQAng / UEDVC
View on GitHub
☆12May 26, 2023Updated 3 years ago
neopenx / Facial-Expression
View on GitHub
Facial-Expression Recognition with Deep Neural Networks
☆10Mar 6, 2016Updated 10 years ago
fL0n9 / SKFAC-MindSpore
View on GitHub
SKFAC Preconditioner for MindSpore
☆12Jul 2, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
yudhanjaya / Eluwa
View on GitHub
A conversational LoRA for OPT 2.7b
☆10Apr 28, 2023Updated 3 years ago
fletcherjiang / LLMEPET
View on GitHub
[MM'24 Oral] Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval
☆130Aug 23, 2024Updated last year
susumuota / nano-askllm
View on GitHub
Unofficial implementation of the Ask-LLM paper 'How to Train Data-Efficient LLMs', arXiv:2402.09668.
☆12Jun 19, 2024Updated 2 years ago
NoahBishop / index-tts
View on GitHub
☆12Dec 1, 2025Updated 7 months ago
kkang831 / BDInvert_Release
View on GitHub
☆34Mar 26, 2024Updated 2 years ago
NiaBie / FreeLive
View on GitHub
Managed L2D tool libs. (In Dev)
☆14Apr 20, 2019Updated 7 years ago
minha12 / StyleID
View on GitHub
☆14Mar 12, 2023Updated 3 years ago
HuiGuanLab / ms-sl
View on GitHub
Source code of our MM'22 paper Partially Relevant Video Retrieval
☆56Nov 4, 2024Updated last year
ailab-kyunghee / CM2_DVC
View on GitHub
[CVPR 2024] Do you remember? Dense Video Captioning with Cross-Modal Memory Retrieval
☆66Jun 19, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
WitchPuff / CRC-Diagnose
View on GitHub
A medical image recognition project powered by self-implemented ResNet and ViT models, utilizing PyTorch, with a user-friendly web demo b…
☆14Feb 26, 2024Updated 2 years ago
noagarcia / awesome-vqa-pytorch
View on GitHub
List of PyTorch repositories for visual question answering
☆15Jul 4, 2019Updated 7 years ago
duggalrahul / MICCAI17_EndoVis_RoboSeg
View on GitHub
My submission for the Robotic Instrument Segmentation Sub-Challenge held in conjunction with MICCAI 2017.
☆14Sep 8, 2017Updated 8 years ago
iworldtong / TALL.pytorch
View on GitHub
PyTorch implementation of "TALL: Temporal Activity Localization via Language Query. Gao et al. ICCV2017."
☆14Apr 20, 2019Updated 7 years ago
LX-doctorAI1 / DeltaNet
View on GitHub
☆18Nov 11, 2022Updated 3 years ago
glober-vaibhav / go-pub-sub
View on GitHub
Implementing a simple pub/sub design pattern in Go
☆10Jan 9, 2023Updated 3 years ago
yawenzeng / Awesome-Cross-Modal-Video-Moment-Retrieval
View on GitHub
前沿论文持续更新--视频时刻定位 or 时域语言定位 or 视频片段检索。
☆265Aug 26, 2023Updated 2 years ago