LandyGuo/Download_HowTo100M

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/LandyGuo/Download_HowTo100M)

LandyGuo / Download_HowTo100M

code for downloading videos from HowTo100M dataset

☆18

Alternatives and similar repositories for Download_HowTo100M

Users that are interested in Download_HowTo100M are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dengandong / GroundMoRe
View on GitHub
☆18May 18, 2026Updated 2 months ago
ShareLab-SII / CaTok
View on GitHub
[CVPR-26] Official repository of "CaTok: Taming Mean Flows for One-Dimensional Causal Image Tokenization"
☆19Mar 9, 2026Updated 4 months ago
Jayce1kk / SpaceVLLM
View on GitHub
SpaceVLLM: Endowing Multimodal Large Language Model with Spatio-Temporal Video Grounding Capability
☆17May 8, 2025Updated last year
showlab / Region_Learner
View on GitHub
The Pytorch implementation for "Video-Text Pre-training with Learned Regions"
☆43Jul 15, 2022Updated 4 years ago
jaeyun95 / pre-trained-vlk-model
View on GitHub
pre-trained vision and language model summary
☆12Apr 20, 2021Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
tsujuifu / pytorch_violet
View on GitHub
A PyTorch implementation of VIOLET
☆138Dec 17, 2023Updated 2 years ago
HuiZhang0812 / WeEdit
View on GitHub
A Dataset, Benchmark and Glyph-Guided Framework for Text-centric Image Editing
☆20Mar 13, 2026Updated 4 months ago
rentainhe / TRAR-VQA
View on GitHub
[ICCV 2021] Official implementation of the paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering"
☆68Oct 11, 2021Updated 4 years ago
adapter-hub / xGQA
View on GitHub
☆25Mar 4, 2022Updated 4 years ago
jayleicn / mTVRetrieval
View on GitHub
[ACL 2021] mTVR: Multilingual Video Moment Retrieval
☆27Aug 20, 2022Updated 3 years ago
antoine77340 / MIL-NCE_HowTo100M
View on GitHub
PyTorch GPU distributed training code for MIL-NCE HowTo100M
☆221Jul 5, 2022Updated 4 years ago
derkbreeze / AwesomeActionSegmentation
View on GitHub
☆33Jun 19, 2026Updated last month
tzhhhh123 / HC-STVG
View on GitHub
The HC-STVG Dataset
☆65Apr 12, 2023Updated 3 years ago
ShareLab-SII / FluxMem
View on GitHub
[CVPR 2026] FluxMem: Adaptive Hierarchical Memory for Streaming Video Understanding
☆73Mar 16, 2026Updated 4 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
JPShi12 / VideoLoom
View on GitHub
[ICML 2026] VideoLoom: A Video Large Language Model for Joint Spatial-Temporal Understanding
☆27Jul 3, 2026Updated 2 weeks ago
olga-zats / GTDA
View on GitHub
[ECCV2024] Gated Temporal Action Anticipation for Stochastic Long-Term Anticipation
☆24May 29, 2025Updated last year
VALUE-Leaderboard / DataRelease
View on GitHub
Data Release for VALUE Benchmark
☆30Feb 16, 2022Updated 4 years ago
inst-it / inst-it
View on GitHub
[NeurIPS 2025] The official repository of "Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tun…
☆40Feb 20, 2025Updated last year
yonatanbitton / data_efficient_masked_language_modeling_for_vision_and_language
View on GitHub
Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".
☆18Sep 17, 2021Updated 4 years ago
antoine77340 / howto100m
View on GitHub
Code for the HowTo100M paper
☆303Mar 10, 2020Updated 6 years ago
MengLcool / SliMM
View on GitHub
☆25Dec 26, 2024Updated last year
Feng-Hong / DivBS
View on GitHub
[ICML 2024] PyTorch implementation for "Diversified Batch Selection for Training Acceleration"
☆10Jul 30, 2024Updated last year
ItzikMalkiel / MTAdam
View on GitHub
MTAdam: Automatic Balancing of Multiple Training Loss Terms
☆37Nov 21, 2020Updated 5 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
BriansIDP / AudioVisualLLM
View on GitHub
☆19May 19, 2024Updated 2 years ago
ruiyan1995 / Interactive_Fusion_for_CAR
View on GitHub
☆16Jan 6, 2025Updated last year
xmu-xiaoma666 / ImageCaptionMetrics
View on GitHub
This repository contains 2 tools: - A py3 Lib for NLP & image-caption metrics - Code for a two-tailed t-test with paired samples. It wil…
☆18Apr 4, 2021Updated 5 years ago
AlenUbuntu / Awesome-Vision-and-Language-PreTrain-Papers
View on GitHub
☆14Dec 25, 2020Updated 5 years ago
ImperialNLP / BertGen
View on GitHub
Training and evaluation codes for the BertGen paper (ACL-IJCNLP 2021)
☆11Sep 17, 2023Updated 2 years ago
luo3300612 / Transformer-Captioning
View on GitHub
Optimized code based on M2 for faster image captioning training
☆21Nov 18, 2022Updated 3 years ago
ShareLab-SII / UniAR
View on GitHub
[ICML 2026] The official implementation of paper "Unified Multimodal Autoregressive Modeling with Shared Context—Visual Tokenizer is Key …
☆46Jul 13, 2026Updated last week
nailwatts / FNIN
View on GitHub
FNIN: A Fourier Neural Operator-based Numerical Integration Network for Surface-form-gradients
☆13Jan 22, 2025Updated last year
fujiso / SODA
View on GitHub
SODA: Story Oriented Dense Video Captioning Evaluation Framework
☆14May 3, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
GX77 / LCVSL
View on GitHub
☆14Sep 28, 2023Updated 2 years ago
JinchaoLove / CUHK-PhD-Thesis-Template
View on GitHub
Latex template for CUHK PhD Thesis
☆14Jun 29, 2025Updated last year
starmemda / CAMoE
View on GitHub
☆101Sep 27, 2021Updated 4 years ago
ZhouJiahui-dlut / OpEvFake
View on GitHub
Official repository for "Unveiling Opinion Evolution via Prompting and Diffusion for Short Video Fake News Detection", ACL Findings 2024.
☆15Apr 25, 2025Updated last year
bytedance / UniVR
View on GitHub
☆17Updated this week
marcoancona / LPDN
View on GitHub
Feedforward implementation of Lightweight Probabilistic Deep Networks for Keras and Tensorflow
☆14Jul 1, 2019Updated 7 years ago
wllmzhu / G-VUE
View on GitHub
General-purpose Visual Understanding Evaluation
☆20Dec 21, 2023Updated 2 years ago