☆19May 2, 2020Updated 5 years ago
Alternatives and similar repositories for cooking-procedural-extraction
Users that are interested in cooking-procedural-extraction are comparing it to the libraries listed below
Sorting:
- Source code for paper "Towards Automatic Learning of Procedures from Web Instructional Videos"☆34Jan 6, 2019Updated 7 years ago
- ☆96Feb 14, 2022Updated 4 years ago
- Code for our ECCV 2018 paper "Affine Correspondences between Central Cameras for Rapid Relative Pose Estimation"☆14Dec 15, 2018Updated 7 years ago
- ☆18Jun 5, 2024Updated last year
- Official python implementation of R3-Transformer☆15Nov 30, 2020Updated 5 years ago
- Codes for "Solving Long-tailed Recognition with Deep Realistic Taxonomic Classifier"☆17Jun 10, 2022Updated 3 years ago
- Detectron for image/video region feature extraction, inspired by Xinlei's repo☆22Nov 21, 2020Updated 5 years ago
- Source code for "Weakly-Supervised Video Object Grounding from Text by Loss Weighting and Object Interaction"☆48Jun 22, 2024Updated last year
- PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision☆46Jul 29, 2020Updated 5 years ago
- Dataset generated by the methods in "What's Cookin'? Interpreting Cooking Videos using Text, Speech and Vision"☆21May 27, 2015Updated 10 years ago
- some models for video caption implemented by pytorch. (S2VT)☆23Feb 1, 2018Updated 8 years ago
- Code and benchmarks for the Semantic Video Retrieval Task☆53Oct 18, 2022Updated 3 years ago
- PyTorch GPU distributed training code for MIL-NCE HowTo100M☆219Jul 5, 2022Updated 3 years ago
- A pytorch implemetation of data augmentation method for visual question answering☆21May 25, 2023Updated 2 years ago
- Black-box Adversarial Attacks on Video Recognition Models. (VBAD)☆27Oct 28, 2019Updated 6 years ago
- ☆23Jan 10, 2019Updated 7 years ago
- Word2VisualVec : Predicting Visual Features from Text for Image and Video Caption Retrieval☆70Jan 27, 2020Updated 6 years ago
- Code for the HowTo100M paper☆294Mar 10, 2020Updated 6 years ago
- Code for Knowledge-Embedded Routing Network for Scene Graph Generation (CVPR 2019)☆123Aug 17, 2022Updated 3 years ago
- Implementation of different Normalizing Flows, NF, Planar Flows, IAF, etc.☆30May 3, 2018Updated 7 years ago
- EPIC-Kitchens-100 Action Recognition baselines: TSN, TRN, TSM☆33Mar 15, 2022Updated 3 years ago
- SGAP-Net: Semantic-Guided Attentive Prototypes Network for Few-Shot Human-Object Interaction Recognition, AAAI2020.☆14Dec 15, 2020Updated 5 years ago
- PyTorch implementation of "Detecting 32 Pedestrian Attributes for Autonomous Vehicles"☆33Oct 16, 2021Updated 4 years ago
- PyTorch implementation for Deep Griffin-Lim Iteration paper(https://arxiv.org/abs/1903.03971)☆39Oct 12, 2019Updated 6 years ago
- Official implementation of "Flying Guide Dog: Walkable Path Discovery for the Visually Impaired Utilizing Drones and Transformer-based Se…☆14Feb 6, 2022Updated 4 years ago
- Adversarial Inference for Multi-Sentence Video Descriptions (CVPR 2019)☆34Jul 17, 2019Updated 6 years ago
- [ECCV 2020] PyTorch code of MMT (a multimodal transformer captioning model) on TVCaption dataset☆90Sep 6, 2023Updated 2 years ago
- A Tree-LSTM-based dependency tree sentiment labeler☆15May 9, 2019Updated 6 years ago
- A static site generator for SiYuan Note (思源笔记) app☆14Oct 18, 2025Updated 4 months ago
- Feature Extraction Toolbox from CUHKÐZ&SIAT submission to ActivityNet 2016☆32Mar 31, 2019Updated 6 years ago
- Implementation of the Little Man Computer for learning Assembly programming in Julia☆12Jul 16, 2022Updated 3 years ago
- Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation Perspective☆14Oct 22, 2024Updated last year
- IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning☆79Nov 23, 2020Updated 5 years ago
- Official code and dataset link for ''VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles''☆36Jul 30, 2021Updated 4 years ago
- ☆42Jun 2, 2020Updated 5 years ago
- A Spherical Hidden Markov Model for Semantics-Rich Human Mobility Modeling (AAAI 2018)☆10Oct 23, 2020Updated 5 years ago
- Code for our paper in ACL 2017☆13Dec 14, 2017Updated 8 years ago
- Code for GLAT (Global Local Transformer), ECCV 2020 "Learning Visual Commonsense for Robust Scene Graph Generation"☆11Dec 16, 2020Updated 5 years ago
- Poet: Product-oriented Video Captioner for E-commerce☆12Sep 21, 2020Updated 5 years ago