COM Kitchens: An Unedited Overhead-view Video Dataset as a Vision-Language Benchmark
☆14Aug 22, 2024Updated last year
Alternatives and similar repositories for com_kitchens
Users that are interested in com_kitchens are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- HT-Step is a large-scale article grounding dataset of temporal step annotations on how-to videos☆25Mar 20, 2024Updated 2 years ago
- Data release for Step Differences in Instructional Video (CVPR24)☆14Jun 19, 2024Updated last year
- Ego4D Goal-Step: Toward Hierarchical Understanding of Procedural Activities (NeurIPS 2023)☆55Apr 15, 2024Updated last year
- A chrome extension for donwloading arXiv papers into the google drive.☆17Aug 4, 2024Updated last year
- Recursive Visual Programming (ECCV 2024)☆18Nov 20, 2024Updated last year
- 🐾 PdpCLI is a pandas DataFrame processing CLI tool which enables you to build a pandas pipeline from a configuration file.☆15Oct 13, 2023Updated 2 years ago
- ☆27Jul 18, 2025Updated 8 months ago
- Code for "Divergence Optimization for Noisy Universal Domain Adaptation"☆11Jun 12, 2021Updated 4 years ago
- ⚓ Minato: A Unified File I/O Library for Python☆13Dec 6, 2024Updated last year
- The official PyTorch implementation of the IEEE/CVF Computer Vision and Pattern Recognition (CVPR) '24 paper PREGO: online mistake detect…☆31Jun 9, 2025Updated 9 months ago
- Code for the ICRA 2024 Cook2LTL paper on translating free-form cooking recipes to Linear Temporal Logic (LTL) formulae for robot task pla…☆20Oct 18, 2024Updated last year
- Cookbook for the Text Analysis Web API provided by Yahoo! DEVELOPER NETWORK.☆23Mar 3, 2026Updated 3 weeks ago
- Rotated Word Vector Representations and their Interpretability (EMNLP 2017)☆18Jul 13, 2019Updated 6 years ago
- Tomography operators for Pytorch☆17Jul 31, 2024Updated last year
- Code for the paper "AMEGO: Active Memory from long EGOcentric videos" published at ECCV 2024☆43Dec 7, 2024Updated last year
- Official implementation of "Diffusion Model Guided Sampling with Pixel-Wise Aleatoric Uncertainty Estimation"☆16Apr 1, 2025Updated 11 months ago
- [CVPR 2024] Selective, Interpretable and Motion Consistent Privacy Attribute Obfuscation for Action Recognition☆12Mar 20, 2024Updated 2 years ago
- Human-centric environment representations from egocentric video☆14Feb 5, 2026Updated last month
- (ECCV 2024) Official repository of paper "EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding"☆32Apr 8, 2025Updated 11 months ago
- [ICCV 2025] Object-centric Video Question Answering with Visual Grounding and Referring☆25Aug 8, 2025Updated 7 months ago
- Swallowプロジェクト 大規模言語モデル 評価スクリプト☆24Sep 17, 2025Updated 6 months ago
- [CVPR 2026] UFVideo: Towards Unified Fine-Grained Video Cooperative Understanding with Large Language Models☆37Feb 21, 2026Updated last month
- [ICIP2023] Code for the paper 'Action Anticipation with Goal Consistency'☆12Apr 5, 2024Updated last year
- companion code for "Learning to substitute Ingredients in Recipes"☆28Aug 17, 2023Updated 2 years ago
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆17Apr 2, 2025Updated 11 months ago
- [CVPR'25] SyncVP: Joint Diffusion for Synchronous Multi-Modal Video Prediction☆20Jul 28, 2025Updated 7 months ago
- Official Code for CVPR2025 Paper: LatentHOI: On the Generalizable Hand Object Motion Generation with Latent Hand Diffusion☆29Jan 15, 2026Updated 2 months ago
- Code implementation of the paper 'FIction: 4D Future Interaction Prediction from Video'☆18Mar 19, 2025Updated last year
- [ECCV2024, Oral, Best Paper Finalist] This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation…☆39Feb 24, 2025Updated last year
- Action Scene Graphs for Long-Form Understanding of Egocentric Videos (CVPR 2024)☆45Apr 9, 2025Updated 11 months ago
- ☆25Jun 12, 2025Updated 9 months ago
- Codebase for the EMNLP 2021 paper "HittER: Hierarchical Transformers for Knowledge Graph Embeddings".☆12Nov 1, 2021Updated 4 years ago
- ☆12Oct 20, 2023Updated 2 years ago
- [ECCV2024] Gated Temporal Action Anticipation for Stochastic Long-Term Anticipation☆23May 29, 2025Updated 9 months ago
- 日本食品標準成分表2020年版(八訂)をjsonにしたもの☆26Jan 21, 2021Updated 5 years ago
- Official code repository for "Video-Mined Task Graphs for Keystep Recognition in Instructional Videos" arXiv, 2023☆14Apr 1, 2024Updated last year
- ☆23Aug 19, 2024Updated last year
- RDS message logger using a silicon labs si470x chip connected to a raspberry pi☆11Apr 18, 2015Updated 10 years ago
- A Web Extention; Enhance manaba+R☆10Apr 10, 2024Updated last year