antoine77340/S3D_HowTo100M

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/antoine77340/S3D_HowTo100M)

antoine77340 / S3D_HowTo100M

S3D Text-Video model trained on HowTo100M using MIL-NCE

☆200

Alternatives and similar repositories for S3D_HowTo100M

Users that are interested in S3D_HowTo100M are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

antoine77340 / MIL-NCE_HowTo100M
View on GitHub
PyTorch GPU distributed training code for MIL-NCE HowTo100M
☆221Jul 5, 2022Updated 4 years ago
antoine77340 / howto100m
View on GitHub
Code for the HowTo100M paper
☆303Mar 10, 2020Updated 6 years ago
antoine77340 / video_feature_extractor
View on GitHub
Easy to use video deep features extractor
☆322Jul 5, 2020Updated 6 years ago
zinengtang / DeCEMBERT
View on GitHub
Pytorch version of DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization (NAACL 2021)
☆17Jan 12, 2023Updated 3 years ago
MichiganCOG / Video-Grounding-from-Text
View on GitHub
Source code for "Weakly-Supervised Video Object Grounding from Text by Loss Weighting and Object Interaction"
☆47Jun 22, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
simon-ging / coot-videotext
View on GitHub
COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning
☆291Sep 6, 2022Updated 3 years ago
ArrowLuo / VideoFeatureExtractor
View on GitHub
Video Feature Extractor for S3D-HowTo100M
☆29Apr 30, 2021Updated 5 years ago
jayleicn / ClipBERT
View on GitHub
[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning…
☆730Aug 8, 2023Updated 2 years ago
VALUE-Leaderboard / DataRelease
View on GitHub
Data Release for VALUE Benchmark
☆30Feb 16, 2022Updated 4 years ago
MichiganNLP / vlog_action_recognition
View on GitHub
Identifying Visible Actions in Lifestyle Vlogs
☆15Aug 3, 2023Updated 2 years ago
TengdaHan / TemporalAlignNet
View on GitHub
[CVPR'22 Oral] Temporal Alignment Networks for Long-term Video. Tengda Han, Weidi Xie, Andrew Zisserman.
☆122Oct 9, 2023Updated 2 years ago
jshi31 / NAFAE
View on GitHub
Implementation of paper "Not All Frames Are Equal: Weakly-Supervised Video Grounding with Contextual Similarity and Visual Clustering Los…
☆30Jun 29, 2020Updated 6 years ago
m-bain / frozen-in-time
View on GitHub
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]
☆377May 19, 2022Updated 4 years ago
jayleicn / singularity
View on GitHub
[ACL 2023] Official PyTorch code for Singularity model in "Revealing Single Frame Bias for Video-and-Language Learning"
☆136May 5, 2023Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
albanie / collaborative-experts
View on GitHub
Video embeddings for retrieval with natural language queries
☆344Feb 15, 2023Updated 3 years ago
moabitcoin / ig65m-pytorch
View on GitHub
PyTorch 3D video classification models pre-trained on 65 million Instagram videos
☆265Dec 7, 2019Updated 6 years ago
frankxu2004 / cooking-procedural-extraction
View on GitHub
☆19May 2, 2020Updated 6 years ago
cshizhe / hgr_v2t
View on GitHub
Code accompanying the paper "Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning".
☆211Jun 12, 2020Updated 6 years ago
DmZhukov / CrossTask
View on GitHub
☆97Feb 14, 2022Updated 4 years ago
jayleicn / recurrent-transformer
View on GitHub
[ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
☆170Dec 4, 2020Updated 5 years ago
linjieli222 / HERO
View on GitHub
Research code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"
☆235Sep 16, 2021Updated 4 years ago
TengdaHan / CoCLR
View on GitHub
[NeurIPS'20] Self-supervised Co-Training for Video Representation Learning. Tengda Han, Weidi Xie, Andrew Zisserman.
☆288Oct 10, 2021Updated 4 years ago
antoine77340 / Mixture-of-Embedding-Experts
View on GitHub
Mixture-of-Embeddings-Experts
☆122Jul 21, 2020Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
rxtan2 / video-grounding-narrations
View on GitHub
☆12Mar 12, 2023Updated 3 years ago
gabeur / mmt
View on GitHub
Multi-Modal Transformer for Video Retrieval
☆265Oct 9, 2024Updated last year
jayleicn / TVCaption
View on GitHub
[ECCV 2020] PyTorch code of MMT (a multimodal transformer captioning model) on TVCaption dataset
☆91Sep 6, 2023Updated 2 years ago
airsplay / vimpac
View on GitHub
☆73Jun 3, 2022Updated 4 years ago
jayleicn / TVRetrieval
View on GitHub
[ECCV 2020] PyTorch code for XML on TVRetrieval dataset - TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
☆163May 28, 2024Updated 2 years ago
yytzsy / SCDM
View on GitHub
Code for the paper: Semantic Conditioned Dynamic Modulation for Temporal Sentence Grounding in Videos
☆71Sep 7, 2021Updated 4 years ago
JaywongWang / CBP
View on GitHub
Official Tensorflow Implementation of the AAAI-2020 paper "Temporally Grounding Language Queries in Videos by Contextual Boundary-aware P…
☆59Mar 24, 2023Updated 3 years ago
LuoweiZhou / YouCook2-Leaderboard
View on GitHub
A one-stop shop for YouCook2 info such as leaderboard and recent advances on (cooking) video retrieval and captioning.
☆41Jun 29, 2022Updated 4 years ago
ych133 / How2R-and-How2QA
View on GitHub
A video retrieval dataset How2R and a video QA dataset How2QA
☆24Oct 15, 2020Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
jimmy646 / violin
View on GitHub
Data and code for CVPR 2020 paper: "VIOLIN: A Large-Scale Dataset for Video-and-Language Inference"
☆161Apr 29, 2020Updated 6 years ago
kylemin / S3D
View on GitHub
Release of the pretrained S3D Network in PyTorch (ECCV 2018)
☆138Jul 20, 2023Updated 3 years ago
facebookresearch / grounded-video-description
View on GitHub
Video Grounding and Captioning
☆332Oct 12, 2021Updated 4 years ago
INK-USC / VisCOLL
View on GitHub
Code and data for the project "Visually grounded continual learning of compositional semantics"
☆22Dec 27, 2022Updated 3 years ago
LisaAnne / LocalizingMoments
View on GitHub
Github for my ICCV 2017 paper: "Localizing Moments in Video with Natural Language"
☆198Oct 31, 2020Updated 5 years ago
rowanz / merlot
View on GitHub
MERLOT: Multimodal Neural Script Knowledge Models
☆226Mar 15, 2022Updated 4 years ago
klauscc / VindLU
View on GitHub
☆109Dec 23, 2022Updated 3 years ago