COM Kitchens: An Unedited Overhead-view Video Dataset as a Vision-Language Benchmark
☆15Aug 22, 2024Updated last year
Alternatives and similar repositories for com_kitchens
Users that are interested in com_kitchens are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- HT-Step is a large-scale article grounding dataset of temporal step annotations on how-to videos☆26Mar 20, 2024Updated 2 years ago
- Data release for Step Differences in Instructional Video (CVPR24)☆14Jun 19, 2024Updated last year
- Ego4D Goal-Step: Toward Hierarchical Understanding of Procedural Activities (NeurIPS 2023)☆57Apr 15, 2024Updated 2 years ago
- A chrome extension for donwloading arXiv papers into the google drive.☆17Aug 4, 2024Updated last year
- 🐾 PdpCLI is a pandas DataFrame processing CLI tool which enables you to build a pandas pipeline from a configuration file.☆15Oct 13, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Cookbook for the Text Analysis Web API provided by Yahoo! DEVELOPER NETWORK.☆24Mar 3, 2026Updated 2 months ago
- Official implementation of "Diffusion Model Guided Sampling with Pixel-Wise Aleatoric Uncertainty Estimation"☆16Apr 1, 2025Updated last year
- [CVPR 2024] Selective, Interpretable and Motion Consistent Privacy Attribute Obfuscation for Action Recognition☆12Mar 20, 2024Updated 2 years ago
- Human-centric environment representations from egocentric video☆15Feb 5, 2026Updated 3 months ago
- (ECCV 2024) Official repository of paper "EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding"☆32Apr 8, 2025Updated last year
- [CVPR 2026] UFVideo: Towards Unified Fine-Grained Video Cooperative Understanding with Large Language Models☆37Feb 21, 2026Updated 3 months ago
- [ICCV 2025] Object-centric Video Question Answering with Visual Grounding and Referring☆25Aug 8, 2025Updated 9 months ago
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆17Apr 2, 2025Updated last year
- DepthSense DS325/DS311, Kinect V1, Kinect V2, RealSense D415/D435 からカラーとデプスを取得するサンプル☆12Apr 13, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [CVPR'25] SyncVP: Joint Diffusion for Synchronous Multi-Modal Video Prediction☆24Jul 28, 2025Updated 9 months ago
- Official Code for CVPR2025 Paper: LatentHOI: On the Generalizable Hand Object Motion Generation with Latent Hand Diffusion☆31May 4, 2026Updated 3 weeks ago
- Code implementation of the paper 'FIction: 4D Future Interaction Prediction from Video'☆18Mar 19, 2025Updated last year
- [ECCV2024, Oral, Best Paper Finalist] This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation…☆41Feb 24, 2025Updated last year
- Action Scene Graphs for Long-Form Understanding of Egocentric Videos (CVPR 2024)☆46Apr 9, 2025Updated last year
- ☆28Jun 12, 2025Updated 11 months ago
- Codebase for the EMNLP 2021 paper "HittER: Hierarchical Transformers for Knowledge Graph Embeddings".☆12Nov 1, 2021Updated 4 years ago
- ☆25Aug 19, 2024Updated last year
- RDS message logger using a silicon labs si470x chip connected to a raspberry pi☆11Apr 18, 2015Updated 11 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Highly configurable simulation made using ns3 to compare two of the oldest TCP variants, Tahoe and Reno.☆11Feb 15, 2023Updated 3 years ago
- ☆72Feb 1, 2026Updated 3 months ago
- Code for "TAG: Guidance-free Open-Vocabulary Semantic Segmentation"☆15Jul 13, 2024Updated last year
- An ambient sounds mixer to help you study, focus, increase concentration, calm your mind.☆10Jan 6, 2023Updated 3 years ago
- Code release for Adversarial Branch Architecture Search for Unsupervised Domain Adaptation☆13Mar 5, 2022Updated 4 years ago
- A tool for design pattern recognition on blockchain through static code analysis☆10Apr 13, 2026Updated last month
- [CVPR 2025] MANTA: Diffusion Mamba for Efficient and Effective Stochastic Long-Term Dense Anticipation☆25Jun 13, 2025Updated 11 months ago
- ☆29Jul 25, 2025Updated 9 months ago
- 🌈 Implementation of Neural Network based Named Entity Recognizer (Lample+, 2016) using Chainer.☆45Dec 8, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official code for the paper "Understanding Co-speech Gestures in-the-wild"☆24Oct 31, 2025Updated 6 months ago
- Repository containing the code used for running the experiments of the Poincare ResNet paper☆29Aug 25, 2023Updated 2 years ago
- Re-implementation for ICCV23 "Social Diffusion: Long-term Multiple Human Motion Anticipation"☆24Oct 3, 2023Updated 2 years ago
- [CVPR 2024] KEPP: Why Not Use Your Textbook? Knowledge-Enhanced Procedure Planning of Instructional Videos☆12Sep 24, 2024Updated last year
- ☆31Mar 2, 2023Updated 3 years ago
- Implements the loss used in A. Furnari, S. Battiato, G. M. Farinella (2018). Leveraging Uncertainty to Rethink Loss Functions and Evaluat…☆11May 22, 2019Updated 7 years ago
- PyTorch code for "ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning"☆21Oct 28, 2024Updated last year