[CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision
☆39Dec 2, 2025Updated 3 months ago
Alternatives and similar repositories for EgoScaler
Users that are interested in EgoScaler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICCV 2025] DyWA:Dynamics-adaptive World Action Model for Generalizable Non-prehensile Manipulation☆76Sep 23, 2025Updated 6 months ago
- [ICCV 2025] IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation☆64Aug 4, 2025Updated 7 months ago
- [ICCV 2025] RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping☆41Nov 21, 2025Updated 4 months ago
- Subtask-Aware Visual Reward Learning from Segmented Demonstrations (ICLR 2025 accepted)☆18Apr 11, 2025Updated 11 months ago
- Geometry-Consistent Video Diffusion for Robotic Visual Policy Transfer☆31Mar 5, 2026Updated 3 weeks ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- rmp data ranking☆13Nov 4, 2025Updated 4 months ago
- CVPR2025 | TASTE-Rob: Advancing Video Generation of Task-Oriented Hand-Object Interaction for Generalizable Robotic Manipulation☆42Jan 29, 2026Updated last month
- [CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation☆181Jun 20, 2025Updated 9 months ago
- [CVPR 2025] Scene Splatter: Momentum 3D Scene Generation from Single Image with Video Diffusion Model☆31Jun 26, 2025Updated 9 months ago
- [ECCV'24] 3D Reconstruction of Objects in Hands without Real World 3D Supervision☆17Feb 3, 2025Updated last year
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆44Aug 9, 2025Updated 7 months ago
- [CVPR 2025] VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation☆47Jun 20, 2025Updated 9 months ago
- ☆11Jul 19, 2023Updated 2 years ago
- ☆135Aug 27, 2025Updated 6 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- An open source Multi-View Latent Diffusion Model☆42Feb 23, 2026Updated last month
- ☆115Dec 4, 2025Updated 3 months ago
- Official implementation of SPGrasp: A framework for dynamic grasp synthesis from sparse spatiotemporal prompts.☆20Jan 6, 2026Updated 2 months ago
- Vlaser: Vision-Language-Action Model with Synergistic Embodied Reasoning☆44Mar 18, 2026Updated last week
- ☆18Nov 4, 2024Updated last year
- Code for Stable Control Representations☆26Apr 5, 2025Updated 11 months ago
- Documentation and software tools for the Novel Sensors for Autonomous Vehicle Perception (NSAVP) dataset☆24Updated this week
- [IROS 2025] ManiGaussian++: General Robotic Bimanual Manipulation with Hierarchical Gaussian World Model☆43Jun 26, 2025Updated 9 months ago
- [RSS 2025] GauSS-MI: Gaussian Splatting Shannon Mutual Information for Active 3D Reconstruction☆85Oct 7, 2025Updated 5 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official repostory of the paper: Masked Scene Modeling (CVPR 2025)☆17Dec 13, 2025Updated 3 months ago
- [IROS 2025] Novel Diffusion Models for Multimodal 3D Hand Trajectory Prediction☆24Dec 2, 2025Updated 3 months ago
- ☆18May 7, 2025Updated 10 months ago
- N2M: Bridging Navigation and Manipulation by Learning Initial Pose Preference from Rollout☆29Nov 21, 2025Updated 4 months ago
- MAPLE infuses dexterous manipulation priors from egocentric videos into vision encoders, making their features well-suited for downstream…☆30Dec 9, 2025Updated 3 months ago
- [NeurIPS 2025] Streaming 3D Reconstruction with Explicit Spatial Pointer Memory☆181Mar 10, 2026Updated 2 weeks ago
- Pi0-VLA Repository of "MotionTrans: Human VR Data Enable Motion-Level Learning for Robotic Manipulation Policies"☆27Mar 9, 2026Updated 2 weeks ago
- Code and data for UniEgoMotion (ICCV 2025)☆46Nov 11, 2025Updated 4 months ago
- Code for using the Grasp Affordance Reasoning dataset☆10Sep 17, 2019Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Official implementation of "Latent Action Learning Requires Supervision in the Presence of Distractors", ICML 2025☆34Jul 8, 2025Updated 8 months ago
- [CoRL 2025] UniSkill: Imitating Human Videos via Cross-Embodiment Skill Representations☆78Dec 18, 2025Updated 3 months ago
- [RA-L 2023] Active Implicit Object Reconstruction using Uncertainty-guided Next-Best-View Optimization☆13Aug 28, 2023Updated 2 years ago
- [ICLR 2026] InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation☆108Jan 27, 2026Updated 2 months ago
- [ICCV 2025 Spotlight] DexVLG: Dexterous Vision-Language-Grasp Model at Scale☆50Jul 24, 2025Updated 8 months ago
- [IROS 2025] DynamicPose: Real-time and Robust 6D Object Pose Tracking for Fast-Moving Cameras and Objects☆64Sep 30, 2025Updated 5 months ago
- Official code for EWMBench: Evaluating Scene, Motion, and Semantic Quality in Embodied World Models☆107Jun 13, 2025Updated 9 months ago