facebookresearch/ProcedureVRL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/ProcedureVRL)

facebookresearch / ProcedureVRL

[CVPR 2023] Official code for "Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations"

☆56

Alternatives and similar repositories for ProcedureVRL

Users that are interested in ProcedureVRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

salesforce / paprika
View on GitHub
Code for CVPR 2023 paper "Procedure-Aware Pretraining for Instructional Video Understanding"
☆50Jun 2, 2026Updated last month
facebookresearch / TaskGraph
View on GitHub
Official code repository for "Video-Mined Task Graphs for Keystep Recognition in Instructional Videos" arXiv, 2023
☆15Apr 1, 2024Updated 2 years ago
YiwuZhong / Sub-GC
View on GitHub
[ECCV 2020] Official code for "Comprehensive Image Captioning via Scene Graph Decomposition"
☆99Aug 20, 2024Updated last year
soCzech / GenHowTo
View on GitHub
Code for the paper "GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos" published at CVPR 2024
☆54Mar 3, 2024Updated 2 years ago
facebookresearch / htstep
View on GitHub
HT-Step is a large-scale article grounding dataset of temporal step annotations on how-to videos
☆26Mar 20, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
jhuang81 / weak-sup-visual-grounding
View on GitHub
The official implementation of CVPR 2021 Paper: Improving Weakly Supervised Visual Grounding by Contrastive Knowledge Distillation.
☆12Oct 15, 2021Updated 4 years ago
facebookresearch / HierVL
View on GitHub
[CVPR 2023] HierVL Learning Hierarchical Video-Language Embeddings
☆46Aug 14, 2023Updated 2 years ago
lbaermann / qaego4d
View on GitHub
Code and Dataset for the CVPRW Paper "Where did I leave my keys? — Episodic-Memory-Based Question Answering on Egocentric Videos"
☆31Aug 28, 2023Updated 2 years ago
facebookresearch / video-distant-supervision
View on GitHub
This is an official pytorch implementation of Learning To Recognize Procedural Activities with Distant Supervision. In this repository, w…
☆43Feb 21, 2023Updated 3 years ago
facebookresearch / ego4d-goalstep
View on GitHub
Ego4D Goal-Step: Toward Hierarchical Understanding of Procedural Activities (NeurIPS 2023)
☆61Apr 15, 2024Updated 2 years ago
Sid2697 / EgoProceL-egocentric-procedure-learning
View on GitHub
Code implementation for our ECCV, 2022 paper titled "My View is the Best View: Procedure Learning from Egocentric Videos"
☆35Feb 5, 2024Updated 2 years ago
abrarmajeedi / rica2_aqa
View on GitHub
Code release for RICA^2: Rubric-Informed, Calibrated Assessment of Actions (ECCV 2024)
☆15Nov 9, 2025Updated 8 months ago
zhuoyan-xu / Foundation-Model_Multitask
View on GitHub
☆17Mar 14, 2024Updated 2 years ago
WenliangGuo / SCHEMA
View on GitHub
[ICLR 2024 Poster] SCHEMA: State CHangEs MAtter for Procedure Planning in Instructional Videos
☆20Aug 21, 2025Updated 11 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
facebookresearch / EgoVLPv2
View on GitHub
Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]
☆110Jul 2, 2024Updated 2 years ago
LaVi-Lab / Rethink_CoT_Video
View on GitHub
Official code for "Rethinking Chain-of-Thought Reasoning for Videos"
☆21Dec 14, 2025Updated 7 months ago
facebookresearch / stepdiff
View on GitHub
Data release for Step Differences in Instructional Video (CVPR24)
☆15Jun 19, 2024Updated 2 years ago
TencentARC / FLM
View on GitHub
Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)
☆31May 15, 2023Updated 3 years ago
DanDoge / Palm
View on GitHub
team Doggeee's solution to Ego4D LTA challenge@CVPRW23'
☆14Nov 4, 2023Updated 2 years ago
medhini / Instructional-Video-Summarization
View on GitHub
Code for paper, "TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency" ECCV 2022
☆39Feb 17, 2023Updated 3 years ago
zjuchenlong / WSAG
View on GitHub
[EMNLP'22] Weakly-Supervised Temporal Article Grounding
☆14Nov 25, 2023Updated 2 years ago
Ravindu-Yasas-Nagasinghe / KEPP
View on GitHub
[CVPR 2024] KEPP: Why Not Use Your Textbook? Knowledge-Enhanced Procedure Planning of Instructional Videos
☆12Sep 24, 2024Updated last year
YujieLu10 / TIP
View on GitHub
Multimodal-Procedural-Planning
☆92Jun 1, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ttlmh / Bridge-Prompt
View on GitHub
[CVPR2022] Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos
☆102Oct 30, 2022Updated 3 years ago
yuleiniu / introd
View on GitHub
[NeurIPS 2021] Introspective Distillation for Robust Question Answering
☆13Dec 7, 2021Updated 4 years ago
gqa-ood / GQA-OOD
View on GitHub
GQA-OOD is a new dataset and benchmark for the evaluation of VQA models in OOD (out of distribution) settings.
☆33Mar 1, 2021Updated 5 years ago
facebookresearch / vq2d_cvpr
View on GitHub
This repo contains the code for the recipe of the winning entry to the Ego4d VQ2D challenge at CVPR 2022.
☆42Mar 7, 2023Updated 3 years ago
Buzz-Beater / EgoTaskQA
View on GitHub
Code for NeurIPS 2022 Datasets and Benchmarks paper - EgoTaskQA: Understanding Human Tasks in Egocentric Videos.
☆45Apr 17, 2023Updated 3 years ago
dragonlzm / PAVE
View on GitHub
This repo holds the implementation of PAVE: Patching and Adapting Video Large Language Models (CVPR2025)
☆27Sep 6, 2025Updated 10 months ago
fmu2 / snag_release
View on GitHub
Official Implementation of SnAG (CVPR 2024)
☆59Apr 26, 2025Updated last year
PKU-VaLuE-Lab / m3eval
View on GitHub
Official code for M3Eval: Multi-Modal Memory Evaluation through Cognitively-Grounded Video Tasks
☆21Jun 4, 2026Updated last month
ayiyayi / EgoExoBench
View on GitHub
☆15Nov 13, 2025Updated 8 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
fmu2 / gradfeat20
View on GitHub
Gradients as Features for Deep Representation Learning
☆43Mar 8, 2020Updated 6 years ago
LaVi-Lab / AIM
View on GitHub
[ICCV 2025] Official code for "AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning"
☆65Oct 9, 2025Updated 9 months ago
brown-palm / AntGPT
View on GitHub
Official code implemtation of paper AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?
☆31Sep 23, 2024Updated last year
TencentARC / TaCA
View on GitHub
Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".
☆16Jun 20, 2023Updated 3 years ago
Nmegha2601 / anticipatr
View on GitHub
☆12Apr 6, 2023Updated 3 years ago
umd-huang-lab / Mementos
View on GitHub
☆32Feb 8, 2024Updated 2 years ago
soCzech / MultiTaskObjectStates
View on GitHub
Code for the paper "Multi-Task Learning of Object States and State-Modifying Actions from Web Videos" published in TPAMI
☆11Mar 3, 2024Updated 2 years ago