BeingBeyond / Being-M0
☆14Updated this week
Alternatives and similar repositories for Being-M0
Users that are interested in Being-M0 are comparing it to the libraries listed below
Sorting:
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆103Updated 6 months ago
- [ICCV 2023] Understanding 3D Object Interaction from a Single Image☆43Updated last year
- ☆25Updated 2 years ago
- [CVPR'24] "AnySkill: Learning Open-Vocabulary Physical Skill for Interactive Agents"☆122Updated last year
- [CVPR 2022] Joint hand motion and interaction hotspots prediction from egocentric videos☆64Updated last year
- Official implementation of the paper "PACER+: On-Demand Pedestrian Animation Controller in Driving Scenarios" (CVPR 2024).☆68Updated 10 months ago
- [CVPR 2022] Understanding 3D Object Articulation in Internet Videos☆31Updated last year
- code for the paper Predicting Point Tracks from Internet Videos enables Diverse Zero-Shot Manipulation☆86Updated 9 months ago
- [ICLR'24] GeneOH Diffusion: Towards Generalizable Hand-Object Interaction Denoising via Denoising Diffusion☆103Updated 9 months ago
- ☆16Updated last year
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆62Updated 5 months ago
- Official repository of "TACO: Benchmarking Generalizable Bimanual Tool-ACtion-Object Understanding".☆55Updated 3 weeks ago
- ☆80Updated 5 months ago
- Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World☆128Updated 6 months ago
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆44Updated last year
- AffordPose: A Large-scale Dataset of Hand-Object Interactions with Affordance-driven Hand Pose (ICCV 2023)☆76Updated last year
- ☆44Updated 2 years ago
- Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (CoRL 2024)☆44Updated 10 months ago
- A Python package that provides evaluation and visualization tools for the HO-Cap dataset☆36Updated last month
- Bidirectional Mapping between Action Physical-Semantic Space☆31Updated 8 months ago
- ☆31Updated last year
- Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"☆27Updated last year
- [ICRA 2025] In-Context Imitation Learning via Next-Token Prediction☆72Updated 2 months ago
- ManiBox: Enhancing Spatial Grasping Generalization via Scalable Simulation Data Generation☆43Updated last month
- ☆72Updated 8 months ago
- Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning☆64Updated 2 weeks ago
- HaWoR: World-Space Hand Motion Reconstruction from Egocentric Videos☆52Updated last month
- ☆75Updated last month
- Official Implementation of the Paper: Controllable Human-Object Interaction Synthesis (ECCV 2024 Oral))☆109Updated 3 months ago
- Official Reporsitory of "EgoMono4D: Self-Supervised Monocular 4D Scene Reconstruction for Egocentric Videos"☆21Updated last month