COM Kitchens: An Unedited Overhead-view Video Dataset as a Vision-Language Benchmark
☆15Aug 22, 2024Updated last year
Alternatives and similar repositories for com_kitchens
Users that are interested in com_kitchens are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- HT-Step is a large-scale article grounding dataset of temporal step annotations on how-to videos☆26Mar 20, 2024Updated 2 years ago
- Data release for Step Differences in Instructional Video (CVPR24)☆14Jun 19, 2024Updated last year
- Ego4D Goal-Step: Toward Hierarchical Understanding of Procedural Activities (NeurIPS 2023)☆58Apr 15, 2024Updated 2 years ago
- A chrome extension for donwloading arXiv papers into the google drive.☆17Aug 4, 2024Updated last year
- ☆28Jul 18, 2025Updated 10 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 🐾 PdpCLI is a pandas DataFrame processing CLI tool which enables you to build a pandas pipeline from a configuration file.☆15Oct 13, 2023Updated 2 years ago
- Recursive Visual Programming (ECCV 2024)☆18Nov 20, 2024Updated last year
- Code for "Divergence Optimization for Noisy Universal Domain Adaptation"☆11Jun 12, 2021Updated 5 years ago
- ⚓ Minato: A Unified File I/O Library for Python☆13Dec 6, 2024Updated last year
- The official PyTorch implementation of the IEEE/CVF Computer Vision and Pattern Recognition (CVPR) '24 paper PREGO: online mistake detect…☆32Jun 9, 2025Updated last year
- Code for the ICRA 2024 Cook2LTL paper on translating free-form cooking recipes to Linear Temporal Logic (LTL) formulae for robot task pla…☆22Oct 18, 2024Updated last year
- Cookbook for the Text Analysis Web API provided by Yahoo! DEVELOPER NETWORK.☆25Jun 2, 2026Updated last week
- Rotated Word Vector Representations and their Interpretability (EMNLP 2017)☆18Jul 13, 2019Updated 6 years ago
- Official implementation of "Diffusion Model Guided Sampling with Pixel-Wise Aleatoric Uncertainty Estimation"☆17Apr 1, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Tomography operators for Pytorch☆18Jul 31, 2024Updated last year
- Code for the paper "AMEGO: Active Memory from long EGOcentric videos" published at ECCV 2024☆45Dec 7, 2024Updated last year
- [CVPR 2024] Selective, Interpretable and Motion Consistent Privacy Attribute Obfuscation for Action Recognition☆12Mar 20, 2024Updated 2 years ago
- Human-centric environment representations from egocentric video☆15Feb 5, 2026Updated 4 months ago
- (ECCV 2024) Official repository of paper "EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding"☆36Apr 8, 2025Updated last year
- [CVPR 2026] UFVideo: Towards Unified Fine-Grained Video Cooperative Understanding with Large Language Models☆37Feb 21, 2026Updated 3 months ago
- [ICCV 2025] Object-centric Video Question Answering with Visual Grounding and Referring☆25Aug 8, 2025Updated 10 months ago
- [ICIP2023] Code for the paper 'Action Anticipation with Goal Consistency'☆12Apr 5, 2024Updated 2 years ago
- Swallowプロジェクト 大規模言語モデル 評価スクリプト☆24Sep 17, 2025Updated 8 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- companion code for "Learning to substitute Ingredients in Recipes"☆28Aug 17, 2023Updated 2 years ago
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆18Apr 2, 2025Updated last year
- [CVPR'25] SyncVP: Joint Diffusion for Synchronous Multi-Modal Video Prediction☆24Jul 28, 2025Updated 10 months ago
- DepthSense DS325/DS311, Kinect V1, Kinect V2, RealSense D415/D435 からカラーとデプスを取得するサンプル☆12Apr 13, 2026Updated 2 months ago
- Code implementation of the paper 'FIction: 4D Future Interaction Prediction from Video'☆20Mar 19, 2025Updated last year
- Official Code for CVPR2025 Paper: LatentHOI: On the Generalizable Hand Object Motion Generation with Latent Hand Diffusion☆31May 4, 2026Updated last month
- [ECCV2024, Oral, Best Paper Finalist] This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation…☆41Feb 24, 2025Updated last year
- Action Scene Graphs for Long-Form Understanding of Egocentric Videos (CVPR 2024)☆46Apr 9, 2025Updated last year
- ☆28Jun 12, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Codebase for the EMNLP 2021 paper "HittER: Hierarchical Transformers for Knowledge Graph Embeddings".☆12Nov 1, 2021Updated 4 years ago
- ☆12Oct 20, 2023Updated 2 years ago
- [ECCV2024] Gated Temporal Action Anticipation for Stochastic Long-Term Anticipation