hq-King / SeqAffordLinks
CVPR 2025
☆35Updated 7 months ago
Alternatives and similar repositories for SeqAfford
Users that are interested in SeqAfford are comparing it to the libraries listed below
Sorting:
- [CVPR-2025] GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance Grounding☆30Updated 3 months ago
- ☆40Updated last year
- VITRA: Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos☆143Updated last week
- [NeurIPS 24] The implementation and dataset of LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and…☆56Updated 8 months ago
- CVPR2025 | TASTE-Rob: Advancing Video Generation of Task-Oriented Hand-Object Interaction for Generalizable Robotic Manipulation☆30Updated 3 months ago
- [NeurIPS 2025] InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.☆200Updated last month
- Code implementation of the paper 'FIction: 4D Future Interaction Prediction from Video'☆17Updated 8 months ago
- [ICLR'24] GeneOH Diffusion: Towards Generalizable Hand-Object Interaction Denoising via Denoising Diffusion☆114Updated last year
- HORT: Monocular Hand-held Objects Reconstruction with Transformers, ICCV 2025☆47Updated 7 months ago
- ☆64Updated 4 months ago
- [CVPR2025] EasyHOI: Unleashing the Power of Large Models for Reconstructing Hand-Object Interactions in the Wild☆90Updated 3 months ago
- Code implementation of CVPR 2024 highlight paper "PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI"☆182Updated 6 months ago
- (Incomplete version) This is an implementation of affordancellm.☆16Updated last year
- ☆89Updated 6 months ago
- Official Repository for ICCV 2025 paper DAViD: Modeling Dynamic Affordance of 3D Objects using Pre-trained Video Diffusion Models☆75Updated 3 months ago
- Official PyTorch implementation of EgoChoir: Capturing 3D Human-Object Interaction Regions from Egocentric Views☆29Updated last year
- Official implementation of Spatial-Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model☆139Updated 3 weeks ago
- ☆19Updated this week
- Code for Decomposed Vector-Quantized Variational Autoencoder for Human Grasp Generation☆24Updated 7 months ago
- Official implementation of ICCV 2025 paper "TACO: Taming Diffusion for in-the-wild Video Amodal Completion"☆27Updated 5 months ago
- Official repository of "TACO: Benchmarking Generalizable Bimanual Tool-ACtion-Object Understanding".☆60Updated last week
- Implementation of Prompting with the Future: Open-World Model Predictive Control with Interactive Digital Twins. [RSS 2025]☆45Updated last month
- [NeurIPS 2025 Spotlight] MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning☆67Updated 2 months ago
- ☆30Updated 11 months ago
- ☆39Updated last year
- Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation☆47Updated 11 months ago
- Official repository for gathering data of Revisit Human-Scene Interaction via Space Occupancy (ECCV 2024).☆28Updated last year
- Official Reporsitory of "EgoMono4D: Self-Supervised Monocular 4D Scene Reconstruction for Egocentric Videos"☆39Updated 2 months ago
- [ICLR 2025] SINGAPO: Single Image Controlled Generation of Articulated Parts in Objects☆87Updated 7 months ago
- HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction☆42Updated 2 months ago