LuoweiZhou/YouCook2-Leaderboard

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/LuoweiZhou/YouCook2-Leaderboard)

LuoweiZhou / YouCook2-Leaderboard

A one-stop shop for YouCook2 info such as leaderboard and recent advances on (cooking) video retrieval and captioning.

☆41

Alternatives and similar repositories for YouCook2-Leaderboard

Users that are interested in YouCook2-Leaderboard are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MichiganCOG / Video-Grounding-from-Text
View on GitHub
Source code for "Weakly-Supervised Video Object Grounding from Text by Loss Weighting and Object Interaction"
☆47Jun 22, 2024Updated 2 years ago
google-research-datasets / Video-Timeline-Tags-ViTT
View on GitHub
A collection of videos annotated with timelines where each video is divided into segments, and each segment is labelled with a short free…
☆30Jan 15, 2022Updated 4 years ago
VALUE-Leaderboard / DataRelease
View on GitHub
Data Release for VALUE Benchmark
☆30Feb 16, 2022Updated 4 years ago
jayleicn / recurrent-transformer
View on GitHub
[ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
☆170Dec 4, 2020Updated 5 years ago
antoine77340 / MIL-NCE_HowTo100M
View on GitHub
PyTorch GPU distributed training code for MIL-NCE HowTo100M
☆221Jul 5, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ranjaykrishna / densevid_eval
View on GitHub
Evaluation code for Dense-Captioning Events in Videos
☆130Jun 11, 2019Updated 7 years ago
LuoweiZhou / ProcNets-YouCook2
View on GitHub
Source code for paper "Towards Automatic Learning of Procedures from Web Instructional Videos"
☆34Jan 6, 2019Updated 7 years ago
ShiYaya / emscore
View on GitHub
Research code for CVPR 2022 paper: "EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching"
☆26Oct 20, 2022Updated 3 years ago
DmZhukov / CrossTask
View on GitHub
☆97Feb 14, 2022Updated 4 years ago
zmykevin / UC2
View on GitHub
CVPR 2021 Official Pytorch Code for UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training
☆34Nov 9, 2021Updated 4 years ago
interactive-cookbook / ara
View on GitHub
Corpus and code for Aligned Recipe Actions (ARA) corpus, EMNLP 2021
☆10May 22, 2024Updated 2 years ago
DTaoo / DMC
View on GitHub
Code for Deep Multimodal Clustering for Unsupervised Audiovisual Learning (CVPR2019)
☆15May 27, 2020Updated 6 years ago
MichiganNLP / vlog_action_recognition
View on GitHub
Identifying Visible Actions in Lifestyle Vlogs
☆15Aug 3, 2023Updated 2 years ago
jayleicn / mTVRetrieval
View on GitHub
[ACL 2021] mTVR: Multilingual Video Moment Retrieval
☆27Aug 20, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
sairin1202 / Commonsense-Knowledge-Aware-Concept-Selection-For-Diverse-and-Informative-Visual-Storytelling
View on GitHub
The implement of Commonsense Knowledge Aware Concept Selection For Diverse and Informative Visual Storytelling
☆12Aug 19, 2021Updated 4 years ago
MikeWangWZHL / VidIL
View on GitHub
Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners
☆117Sep 15, 2022Updated 3 years ago
chihyaoma / cyclical-visual-captioning
View on GitHub
PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision
☆46Jul 29, 2020Updated 5 years ago
zinengtang / DeCEMBERT
View on GitHub
Pytorch version of DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization (NAACL 2021)
☆17Jan 12, 2023Updated 3 years ago
ttengwang / dense-video-captioning-pytorch
View on GitHub
Second-place solution to dense video captioning task in ActivityNet Challenge (CVPR 2020 workshop)
☆75Aug 25, 2021Updated 4 years ago
frankxu2004 / cooking-procedural-extraction
View on GitHub
☆19May 2, 2020Updated 6 years ago
cvlab-columbia / expert
View on GitHub
Code for Learning to Learn Language from Narrated Video
☆33Oct 3, 2023Updated 2 years ago
jacobswan1 / Video2Commonsense
View on GitHub
Video captioning baseline models on Video2Commonsense Dataset.
☆56Apr 15, 2021Updated 5 years ago
ruotianluo / refexp-comprehension
View on GitHub
Referring expression comprehension on ReferIt(RefClef)
☆10Nov 28, 2016Updated 9 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
zhegan27 / LXMERT-AdvTrain
View on GitHub
Research Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": LXMERT…
☆21Oct 20, 2020Updated 5 years ago
tridivb / slowfast_feature_extractor
View on GitHub
Feature Extractor module for videos using the PySlowFast framework
☆80Apr 22, 2021Updated 5 years ago
salesforce / densecap
View on GitHub
☆191Jun 16, 2025Updated last year
jamespark3922 / adv-inf
View on GitHub
Adversarial Inference for Multi-Sentence Video Descriptions (CVPR 2019)
☆34Jul 17, 2019Updated 7 years ago
antoine77340 / S3D_HowTo100M
View on GitHub
S3D Text-Video model trained on HowTo100M using MIL-NCE
☆200Jul 3, 2020Updated 6 years ago
jayleicn / ClipBERT
View on GitHub
[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning…
☆730Aug 8, 2023Updated 2 years ago
antoine77340 / video_feature_extractor
View on GitHub
Easy to use video deep features extractor
☆322Jul 5, 2020Updated 6 years ago
ramakanth-pasunuru / CAS-MAS
View on GitHub
Code for paper "Continual and Multi-Task Architecture Search (ACL 2019)"
☆41Jul 8, 2019Updated 7 years ago
simon-ging / coot-videotext
View on GitHub
COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning
☆291Sep 6, 2022Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
daicoolb / Awesome-Video-Captioning
View on GitHub
video captioning
☆24Mar 14, 2019Updated 7 years ago
showlab / Region_Learner
View on GitHub
The Pytorch implementation for "Video-Text Pre-training with Learned Regions"
☆43Jul 15, 2022Updated 4 years ago
antoine77340 / howto100m
View on GitHub
Code for the HowTo100M paper
☆303Mar 10, 2020Updated 6 years ago
ych133 / How2R-and-How2QA
View on GitHub
A video retrieval dataset How2R and a video QA dataset How2QA
☆24Oct 15, 2020Updated 5 years ago
tgc1997 / RMN
View on GitHub
IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning
☆79Nov 23, 2020Updated 5 years ago
linjieli222 / HERO_Video_Feature_Extractor
View on GitHub
Video Feature Extraction Code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"
☆118Jun 9, 2021Updated 5 years ago
linjieli222 / HERO
View on GitHub
Research code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"
☆235Sep 16, 2021Updated 4 years ago