A curated list of video-text datasets in a variety of languages. These datasets can be used for video captioning (video description) or video retrieval.
☆39Feb 18, 2024Updated 2 years ago
Alternatives and similar repositories for awesome-video-text-datasets
Users that are interested in awesome-video-text-datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding. (WACV2025)☆35Apr 17, 2025Updated 11 months ago
- The implementation of a paper entitled "Action Knowledge for Video Captioning with Graph Neural Networks" (JKSUCIS 2023).☆14Mar 29, 2023Updated 2 years ago
- Summary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Pe…☆133Oct 27, 2023Updated 2 years ago
- Official code for the ICCV23 paper: "Domain Generalization via Rationale Invariance"☆21Jan 18, 2025Updated last year
- OmegaViT (ΩViT) is a cutting-edge vision transformer architecture that combines multi-query attention, rotary embeddings, state space mod…☆14Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Implementation of the Gumbel-Sigmoid distribution in PyTorch.☆21Jul 22, 2022Updated 3 years ago
- ☆22Dec 11, 2018Updated 7 years ago
- code for studying OpenAI's CLIP explainability☆39Jan 7, 2022Updated 4 years ago
- ☆11Jan 16, 2025Updated last year
- 📦 A lightweight machine learning toolkit for researchers, providing common model design & learning functionalities.☆28Jul 2, 2025Updated 8 months ago
- Zero-Shot Learning☆19Dec 9, 2019Updated 6 years ago
- The top conferences on video retrieval libraries in recent years, synchronized with my blog.☆14Nov 27, 2021Updated 4 years ago
- ☆80Nov 24, 2024Updated last year
- This is a Pytorch Implementation of the DASP algorithm from the paper "Explaining Deep Neural Networks with a Polynomial Time Algorithm f…☆11Jun 12, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆15Sep 28, 2023Updated 2 years ago
- Implementation for ACProp ( Momentum centering and asynchronous update for adaptive gradient methdos, NeurIPS 2021)☆16Oct 11, 2021Updated 4 years ago
- ☆15Aug 4, 2020Updated 5 years ago
- Spherical Parameterization of Genus-0 Surfaces☆10Mar 31, 2022Updated 3 years ago
- Facial-Expression Recognition with Deep Neural Networks☆10Mar 6, 2016Updated 10 years ago
- ☆12May 26, 2023Updated 2 years ago
- SKFAC Preconditioner for MindSpore☆12Jul 2, 2021Updated 4 years ago
- [NeurIPS 2025] Code for Low-Rank Head Avatar Personalization with Registers☆17Dec 9, 2025Updated 3 months ago
- ☆14Jan 5, 2022Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Unofficial implementation of the Ask-LLM paper 'How to Train Data-Efficient LLMs', arXiv:2402.09668.☆12Jun 19, 2024Updated last year
- A conversational LoRA for OPT 2.7b☆10Apr 28, 2023Updated 2 years ago
- ☆34Mar 26, 2024Updated 2 years ago
- Simple python rasterizer tool implemented by OpenGL and C++☆15Nov 10, 2025Updated 4 months ago
- codewithgpu.com python client package☆20May 18, 2023Updated 2 years ago
- ☆14Mar 12, 2023Updated 3 years ago
- Managed L2D tool libs. (In Dev)☆12Apr 20, 2019Updated 6 years ago
- ☆12Sep 19, 2021Updated 4 years ago
- [ICRA 2025] Robotic-CLIP: Fine-tuning CLIP on Action Data for Robotic Applications☆18Mar 2, 2025Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- My submission for the Robotic Instrument Segmentation Sub-Challenge held in conjunction with MICCAI 2017.☆13Sep 8, 2017Updated 8 years ago
- [CVPR 2024] Do you remember? Dense Video Captioning with Cross-Modal Memory Retrieval☆64Jun 19, 2024Updated last year
- AIST++ Dataset Webpage: https://google.github.io/aistplusplus_dataset☆17Sep 16, 2021Updated 4 years ago
- ☆15Nov 19, 2020Updated 5 years ago
- ☆18Nov 11, 2022Updated 3 years ago
- Generalized Method of Moments estimation☆13Mar 23, 2025Updated last year
- PyTorch implementation of "TALL: Temporal Activity Localization via Language Query. Gao et al. ICCV2017."☆14Apr 20, 2019Updated 6 years ago