epic-kitchens / VISOR-HOS
Code for recreating the HoS benchmark of VISOR
☆19Updated last year
Related projects ⓘ
Alternatives and complementary repositories for VISOR-HOS
- Code for IterInpaint model, presented in Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation (CVPR 2024 work…☆23Updated 4 months ago
- [CVPR 2022] Understanding 3D Object Articulation in Internet Videos☆28Updated 8 months ago
- This repository is for the paper "Is BERT Blind? Exploring the Effect of Vision-and-Language Pretraining on Visual Language Understanding…☆20Updated last year
- HInt dataset from HaMeR: Reconstructing Hands in 3D with Transformers☆30Updated 6 months ago
- [CVPR 2023] Detecting Human-Object Contact in Images☆49Updated last year
- Code for paper Background Prompting for Improved Object Depth☆29Updated last year
- ☆28Updated 5 months ago
- [NeurIPS 2022 Spotlight] Hand-Object Interaction Image Generation☆30Updated last year
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning☆64Updated 2 years ago
- Official Repository for "Diffusion HPC: Generate Synthetic Data for Human Mesh Recovery in Challenging Domains" (3DV 2024 Spotlight)☆42Updated last year
- Code for the paper "GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos" published at CVPR 2024☆44Updated 8 months ago
- Code Release of "3D Concept Grounding on Neural Fields (NeurIPS2022)"☆15Updated last year
- 🕊 DOVE: Learning Deformable 3D Objects by Watching Videos (IJCV 2023)☆22Updated last year
- Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"☆20Updated 7 months ago
- ☆11Updated 2 years ago
- ☆22Updated last year
- ☆25Updated 2 years ago
- ☆21Updated 4 months ago
- TORE: Token Reduction for Efficient Human Mesh Recovery with Transformer☆45Updated 11 months ago
- This is the project page of ShowRoom3D☆25Updated 11 months ago
- Visualisation of VISOR Segmentations with Annotations and Relations☆21Updated 2 years ago
- CLEVR3D Dataset: Comprehensive Visual Question Answering on Point Clouds through Compositional Scene Manipulation☆13Updated 9 months ago
- ☆33Updated 2 years ago
- [NeuRIPS, 2024] Multi-Human Dataset for Close Interactions.☆15Updated last week
- [ICCV2023] EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object Understanding☆75Updated last year
- Towers of Babel: Combining Images, Language, and 3D Geometry for Learning Multimodal Vision. ICCV 2021.☆42Updated 6 months ago
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆66Updated last week
- Code for paper "Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning"☆23Updated last year