epic-kitchens / VISOR-HOSLinks
Code for recreating the HoS benchmark of VISOR
☆22Updated 2 years ago
Alternatives and similar repositories for VISOR-HOS
Users that are interested in VISOR-HOS are comparing it to the libraries listed below
Sorting:
- [ICCV2023] EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object Understanding☆77Updated 2 years ago
- [CVPR 2022] Understanding 3D Object Articulation in Internet Videos☆31Updated last year
- Code for IterInpaint model, presented in Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation (CVPR 2024 work…☆25Updated last year
- ☆93Updated 3 months ago
- Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"☆29Updated last year
- [NeurIPS 2022 Spotlight] Hand-Object Interaction Image Generation☆33Updated 2 years ago
- HInt dataset from HaMeR: Reconstructing Hands in 3D with Transformers☆52Updated last year
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning☆63Updated 3 years ago
- [ECCV 2022] PressureVision: Estimating Hand Pressure from a Single RGB Image☆49Updated 2 years ago
- ☆29Updated 7 months ago
- Code Release of "3D Concept Grounding on Neural Fields (NeurIPS2022)"☆15Updated 2 years ago
- DAGM GCPR 2023 Paper: HiFiHR: Enhancing 3D Hand Reconstruction from a Single Image via High-Fidelity Texture☆26Updated last year
- FleVRS: Towards Flexible Visual Relationship Segmentation, NeurIPS 2024☆22Updated 10 months ago
- [ICCV 2023] Understanding 3D Object Interaction from a Single Image☆47Updated last year
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆57Updated last year
- Official Code for the NeurIPS'23 paper "3D-Aware Visual Question Answering about Parts, Poses and Occlusions"☆19Updated last year
- Code for paper Background Prompting for Improved Object Depth☆29Updated 2 years ago
- This repository is for the paper "Is BERT Blind? Exploring the Effect of Vision-and-Language Pretraining on Visual Language Understanding…☆20Updated last year
- [CVPR 2023] Detecting Human-Object Contact in Images☆56Updated 2 years ago
- Code for "Recognizing Scenes from Novel Viewpoints"☆29Updated 3 years ago
- ☆73Updated 3 years ago
- ☆27Updated 2 years ago
- [ICLR 2023 spotlight] Official PyTorch implementation of the paper "Stochastic Multi-Person 3D Motion Forecasting"☆53Updated 2 years ago
- Test-Time Training on Video Streams☆64Updated 2 years ago
- Towers of Babel: Combining Images, Language, and 3D Geometry for Learning Multimodal Vision. ICCV 2021.☆43Updated last year
- Code for the paper "GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos" published at CVPR 2024☆51Updated last year
- ☆37Updated 5 months ago
- ☆50Updated 6 months ago
- ☆33Updated 3 years ago
- ☆19Updated 2 years ago