Chiaraplizz / ARGO1M-What-can-a-cookLinks
☆10Updated 2 years ago
Alternatives and similar repositories for ARGO1M-What-can-a-cook
Users that are interested in ARGO1M-What-can-a-cook are comparing it to the libraries listed below
Sorting:
- The implementation of CVPR2021 paper Temporal Query Networks for Fine-grained Video Understanding☆64Updated 3 years ago
- Implementation of "With a Little Help from my Temporal Context: Multimodal Egocentric Action Recognition, BMVC, 2021" in PyTorch☆19Updated 3 years ago
- Codebase for "Revisiting spatio-temporal layouts for compositional action recognition" (Oral at BMVC 2021).☆25Updated 3 years ago
- This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…☆13Updated 2 years ago
- [CVPR'22 Oral] Temporal Alignment Networks for Long-term Video. Tengda Han, Weidi Xie, Andrew Zisserman.☆118Updated 2 years ago
- [CVPR2022] Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos☆99Updated 3 years ago
- CVPR2022:Learning from Untrimmed Videos: Self-Supervised Video Representation Learning with Hierarchical Consistency☆18Updated 3 years ago
- CVPR2022☆22Updated 3 years ago
- [ICCV 2023] How Much Temporal Long-Term Context is Needed for Action Segmentation?☆50Updated last year
- X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization, CVPR 2024☆11Updated last year
- Code for Static and Dynamic Concepts for Self-supervised Video Representation Learning.☆11Updated 3 years ago
- Temporal Action Localization Visualization Tool (TALVT) is a Javascript based simple visualization tool to visualize the outcomes of the …☆29Updated 5 years ago
- This is the official implementation of Elaborative Rehearsal for Zero-shot Action Recognition (ICCV2021)☆36Updated 3 years ago
- Code implementation for our ECCV, 2022 paper titled "My View is the Best View: Procedure Learning from Egocentric Videos"☆31Updated last year
- Code accompanying Ego-Exo: Transferring Visual Representations from Third-person to First-person Videos (CVPR 2021)☆34Updated 4 years ago
- [CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers☆190Updated 2 years ago
- Official code for the ICLR2023 paper Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection☆43Updated last year
- ☆52Updated 2 years ago
- [ECCV 2022] Official Pytorch Implementation of the paper : " Zero-Shot Temporal Action Detection via Vision-Language Prompting "☆110Updated 2 years ago
- MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge (ICCV 2023)☆30Updated 2 years ago
- BEAR: a new BEnchmark on video Action Recognition☆45Updated last year
- [ICCV 2023] Official implementation of Memory-and-Anticipation Transformer for Online Action Understanding☆49Updated 2 years ago
- Implementation of paper 'Helping Hands: An Object-Aware Ego-Centric Video Recognition Model'☆33Updated 2 years ago
- Official implementation of "Test-Time Zero-Shot Temporal Action Localization", CVPR 2024☆68Updated last year
- [ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"☆15Updated 2 years ago
- Annotations for the public release of the EPIC-KITCHENS-100 dataset☆158Updated 3 years ago
- ☆26Updated 2 years ago
- [CVPR 2022 (oral)] Bongard-HOI for benchmarking few-shot visual reasoning☆72Updated 3 years ago
- Download scripts for EPIC-KITCHENS☆152Updated 4 months ago
- [ICCV 2021] Target Adaptive Context Aggregation for Video Scene Graph Generation☆58Updated 3 years ago