COM Kitchens: An Unedited Overhead-view Video Dataset as a Vision-Language Benchmark
☆14Aug 22, 2024Updated last year
Alternatives and similar repositories for com_kitchens
Users that are interested in com_kitchens are comparing it to the libraries listed below
Sorting:
- HT-Step is a large-scale article grounding dataset of temporal step annotations on how-to videos☆24Mar 20, 2024Updated last year
- Data release for Step Differences in Instructional Video (CVPR24)☆14Jun 19, 2024Updated last year
- Ego4D Goal-Step: Toward Hierarchical Understanding of Procedural Activities (NeurIPS 2023)☆54Apr 15, 2024Updated last year
- The official PyTorch implementation of the IEEE/CVF Computer Vision and Pattern Recognition (CVPR) '24 paper PREGO: online mistake detect…☆32Jun 9, 2025Updated 8 months ago
- ☆27Jul 18, 2025Updated 7 months ago
- The project is intended to demonstrate Lane tracking & detection on Qualcomm’s Robotics Platform RB5. YOLOP is the architecture used to i…☆10Aug 22, 2023Updated 2 years ago
- HPFF: Hierarchical Locally Supervised Learning with Patch Feature Fusion☆12Jul 6, 2024Updated last year
- C3P_code☆11Sep 30, 2022Updated 3 years ago
- [CVPR 2026] UFVideo: Towards Unified Fine-Grained Video Cooperative Understanding with Large Language Models☆36Feb 21, 2026Updated last week
- Official implementation of "Flying Guide Dog: Walkable Path Discovery for the Visually Impaired Utilizing Drones and Transformer-based Se…☆14Feb 6, 2022Updated 4 years ago
- Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation Perspective☆14Oct 22, 2024Updated last year
- [CVPR 2024] Selective, Interpretable and Motion Consistent Privacy Attribute Obfuscation for Action Recognition☆12Mar 20, 2024Updated last year
- RDS message logger using a silicon labs si470x chip connected to a raspberry pi☆11Apr 18, 2015Updated 10 years ago
- Image classification with front-end interface,Examples of categorical use: Chinese herbal medicine;Included Networks:mobilenetv2、resnet、v…☆11Oct 11, 2023Updated 2 years ago
- Code for GLAT (Global Local Transformer), ECCV 2020 "Learning Visual Commonsense for Robust Scene Graph Generation"☆11Dec 16, 2020Updated 5 years ago
- ☆11Aug 5, 2024Updated last year
- Human-centric environment representations from egocentric video☆14Feb 5, 2026Updated 3 weeks ago
- Highly configurable simulation made using ns3 to compare two of the oldest TCP variants, Tahoe and Reno.☆11Feb 15, 2023Updated 3 years ago
- One-hot Code for deep learnning 用于深度学习的独热码编码与解码☆12Aug 17, 2019Updated 6 years ago
- Code and resources for EMNLP 2022 paper on 'Robustness of Fusion-based Multimodal Classifiers to Cross-Modal Content Dilutions'☆10Mar 11, 2024Updated last year
- ☆10Apr 26, 2023Updated 2 years ago
- implement gat with batch☆10Nov 28, 2020Updated 5 years ago
- Pan-denoising: Guided Hyperspectral Image Denoising via Weighted Represent Coefficient Total Variation☆10Apr 14, 2025Updated 10 months ago
- ☆14Feb 18, 2022Updated 4 years ago
- Action Scene Graphs for Long-Form Understanding of Egocentric Videos (CVPR 2024)☆45Apr 9, 2025Updated 10 months ago
- ☆23Jun 12, 2025Updated 8 months ago
- Segment graph convolutional neural network for relation classification. Paper in JAMIA.☆10May 13, 2019Updated 6 years ago
- [ICCV 2025] Object-centric Video Question Answering with Visual Grounding and Referring☆24Aug 8, 2025Updated 6 months ago
- Official PyTorch Implementation of Masked Temporal Interpolation Diffusion for Procedure Planning in Instructional Videos☆11Feb 10, 2026Updated 3 weeks ago
- PyTorch code for the CVPR'23 paper: "ConStruct-VL: Data-Free Continual Structured VL Concepts Learning"☆14Feb 5, 2024Updated 2 years ago
- ☆10Jul 23, 2021Updated 4 years ago
- Annotations for the Mistake Detection benchmark of Assembly101☆10Aug 3, 2023Updated 2 years ago
- 🍽️ Get nutrition details from pictures of food.☆13May 3, 2025Updated 10 months ago
- Jupyter notebook showing off how to implement some simple variations of the Quantum random walk using the Qiskit library☆15Sep 7, 2024Updated last year
- Code for TKDE paper "Learning Relation Prototype from Unlabeled Texts for Long-tail Relation Extraction"☆10Feb 19, 2024Updated 2 years ago
- [CVPR2021] VaB-AL: Incorporating Class Imbalance and Difficulty with Variational Bayes for Active Learning☆14Oct 3, 2021Updated 4 years ago
- ☆15Oct 8, 2024Updated last year
- ☆12Sep 28, 2021Updated 4 years ago
- This is a python demo of SGM with stereo camera.☆13Feb 8, 2020Updated 6 years ago