iejMac/video2dataset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/iejMac/video2dataset)

iejMac / video2dataset

Easily create large video dataset from video urls

☆662

Alternatives and similar repositories for video2dataset

Users that are interested in video2dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

iejMac / clip-video-encode
View on GitHub
Easily compute clip embeddings from video frames
☆149Oct 31, 2023Updated 2 years ago
m-bain / webvid
View on GitHub
Large-scale text-video dataset. 10 million captioned short videos.
☆686Aug 14, 2024Updated last year
snap-research / Panda-70M
View on GitHub
[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
☆700Oct 25, 2024Updated last year
rom1504 / img2dataset
View on GitHub
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
☆4,438Oct 19, 2025Updated 9 months ago
iejMac / video2numpy
View on GitHub
Optimized library for large-scale extraction of frames and audio from video.
☆204Sep 11, 2023Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
microsoft / XPretrain
View on GitHub
Multi-modality pre-training
☆511Mar 27, 2026Updated 4 months ago
rom1504 / python-template
View on GitHub
Simple python template
☆44Apr 25, 2024Updated 2 years ago
OpenGVLab / InternVideo
View on GitHub
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
☆2,342Jul 2, 2026Updated 3 weeks ago
allenai / mmc4
View on GitHub
MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.
☆953Mar 19, 2025Updated last year
mlfoundations / open_flamingo
View on GitHub
An open-source framework for training large multimodal models.
☆4,116Aug 31, 2024Updated last year
m-bain / frozen-in-time
View on GitHub
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]
☆376May 19, 2022Updated 4 years ago
NJU-PCALab / OpenVid-1M
View on GitHub
[ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation
☆452May 30, 2025Updated last year
mira-space / MiraData
View on GitHub
Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"
☆527Sep 2, 2024Updated last year
webdataset / webdataset
View on GitHub
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
☆3,149Feb 9, 2026Updated 5 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Vchitect / LaVie
View on GitHub
[IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models
☆952Nov 13, 2024Updated last year
dmlc / decord
View on GitHub
An efficient video loader for deep learning with smart shuffling that's super easy to digest
☆2,507Jul 17, 2024Updated 2 years ago
AILab-CVC / VideoCrafter
View on GitHub
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
☆5,067Jan 9, 2026Updated 6 months ago
DAMO-NLP-SG / Video-LLaMA
View on GitHub
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
☆3,142Jun 4, 2024Updated 2 years ago
AILab-CVC / FreeNoise
View on GitHub
[ICLR 2024] Code for FreeNoise based on VideoCrafter
☆428Aug 25, 2025Updated 11 months ago
LAION-AI / laion50BU
View on GitHub
Un-*** 50 billions multimodality dataset
☆24Sep 14, 2022Updated 3 years ago
xiaobai1217 / Awesome-Video-Datasets
View on GitHub
Video datasets
☆1,657Mar 8, 2023Updated 3 years ago
google-research / magvit
View on GitHub
Official JAX implementation of MAGVIT: Masked Generative Video Transformer
☆1,002Jan 17, 2024Updated 2 years ago
mlfoundations / open_clip
View on GitHub
An open source implementation of CLIP.
☆14,031Jul 17, 2026Updated last week
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
Vchitect / VBench
View on GitHub
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
☆1,712Mar 23, 2026Updated 4 months ago
evalcrafter / EvalCrafter
View on GitHub
[CVPR 2024] EvalCrafter: Benchmarking and Evaluating Large Video Generation Models
☆193Oct 3, 2024Updated last year
LAION-AI / temporal-embedding-aggregation
View on GitHub
Aggregating embeddings over time
☆32Jan 19, 2023Updated 3 years ago
rom1504 / cc2dataset
View on GitHub
Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...
☆321Dec 9, 2023Updated 2 years ago
mlfoundations / datacomp
View on GitHub
DataComp: In search of the next generation of multimodal datasets
☆787Apr 28, 2025Updated last year
showlab / Awesome-Video-Diffusion
View on GitHub
A curated list of recent diffusion models for video generation, editing, and various other applications.
☆5,734Updated this week
AILab-CVC / Make-Your-Video
View on GitHub
[IEEE TVCG 2024] Customized Video Generation Using Textual and Structural Guidance
☆196Feb 24, 2024Updated 2 years ago
NVIDIA / Cosmos-Tokenizer
View on GitHub
A suite of image and video neural tokenizers
☆1,732Feb 11, 2025Updated last year
LAION-AI / laion-dreams
View on GitHub
Aim for the moon. If you miss, you may hit a star.
☆168Feb 14, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
rom1504 / clip-retrieval
View on GitHub
Easily compute clip embeddings and build a clip retrieval system with them
☆2,789Mar 28, 2026Updated 4 months ago
facebookresearch / DiT
View on GitHub
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
☆8,693May 31, 2024Updated 2 years ago
showlab / MotionDirector
View on GitHub
[ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.
☆1,049Aug 21, 2024Updated last year
Vchitect / Latte
View on GitHub
[TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.
☆1,948Oct 30, 2025Updated 8 months ago
huggingface / open-muse
View on GitHub
Open reproduction of MUSE for fast text2image generation.
☆358Jun 1, 2024Updated 2 years ago
LAION-AI / video-clip
View on GitHub
Let's make a video clip
☆97Jul 29, 2022Updated 3 years ago
daooshee / HD-VG-130M
View on GitHub
The HD-VG-130M Dataset
☆126Apr 8, 2024Updated 2 years ago