[CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision
☆40Dec 2, 2025Updated 4 months ago
Alternatives and similar repositories for EgoScaler
Users that are interested in EgoScaler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICCV 2025] DyWA:Dynamics-adaptive World Action Model for Generalizable Non-prehensile Manipulation☆77Sep 23, 2025Updated 6 months ago
- [ICCV 2025] IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation☆65Aug 4, 2025Updated 8 months ago
- [ICCV 2025] RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping☆43Nov 21, 2025Updated 4 months ago
- Subtask-Aware Visual Reward Learning from Segmented Demonstrations (ICLR 2025 accepted)☆18Apr 11, 2025Updated last year
- Geometry-Consistent Video Diffusion for Robotic Visual Policy Transfer☆33Mar 5, 2026Updated last month
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- CVPR2025 | TASTE-Rob: Advancing Video Generation of Task-Oriented Hand-Object Interaction for Generalizable Robotic Manipulation☆42Jan 29, 2026Updated 2 months ago
- [CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation☆181Jun 20, 2025Updated 9 months ago
- [CVPR 2025] Scene Splatter: Momentum 3D Scene Generation from Single Image with Video Diffusion Model☆31Jun 26, 2025Updated 9 months ago
- VGGT 3D Vision Agent optimized for Apple Silicon with Metal Performance Shaders☆87Mar 25, 2026Updated 3 weeks ago
- Just wanna see what type and how many GPUs/TPUs are used in CVPR 2025 oral papers. Fun vibe coding with LLMs.☆12Apr 24, 2025Updated 11 months ago
- [ECCV'24] 3D Reconstruction of Objects in Hands without Real World 3D Supervision☆17Feb 3, 2025Updated last year
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆44Aug 9, 2025Updated 8 months ago
- [CVPR 2025] VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation☆46Updated this week
- ☆11Jul 19, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆138Aug 27, 2025Updated 7 months ago
- An open source Multi-View Latent Diffusion Model☆44Feb 23, 2026Updated last month
- Official implementation of SPGrasp: A framework for dynamic grasp synthesis from sparse spatiotemporal prompts.☆20Jan 6, 2026Updated 3 months ago
- ☆118Dec 4, 2025Updated 4 months ago
- Vlaser: Vision-Language-Action Model with Synergistic Embodied Reasoning☆45Mar 18, 2026Updated 3 weeks ago
- ☆18Nov 4, 2024Updated last year
- Code for Stable Control Representations☆26Apr 5, 2025Updated last year
- [IROS 2025] ManiGaussian++: General Robotic Bimanual Manipulation with Hierarchical Gaussian World Model☆42Jun 26, 2025Updated 9 months ago
- Documentation and software tools for the Novel Sensors for Autonomous Vehicle Perception (NSAVP) dataset☆24Mar 20, 2026Updated 3 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [RSS 2025] GauSS-MI: Gaussian Splatting Shannon Mutual Information for Active 3D Reconstruction☆84Oct 7, 2025Updated 6 months ago
- Official repostory of the paper: Masked Scene Modeling (CVPR 2025)☆17Dec 13, 2025Updated 4 months ago
- ☆18May 7, 2025Updated 11 months ago
- N2M: Bridging Navigation and Manipulation by Learning Initial Pose Preference from Rollout☆30Nov 21, 2025Updated 4 months ago
- MAPLE infuses dexterous manipulation priors from egocentric videos into vision encoders, making their features well-suited for downstream…☆30Dec 9, 2025Updated 4 months ago
- [NeurIPS 2025] Streaming 3D Reconstruction with Explicit Spatial Pointer Memory☆181Mar 10, 2026Updated last month
- Pi0-VLA Repository of "MotionTrans: Human VR Data Enable Motion-Level Learning for Robotic Manipulation Policies"☆27Mar 9, 2026Updated last month
- Code and data for UniEgoMotion (ICCV 2025)☆48Nov 11, 2025Updated 5 months ago
- Code for using the Grasp Affordance Reasoning dataset☆10Sep 17, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official implementation of "Latent Action Learning Requires Supervision in the Presence of Distractors", ICML 2025☆34Jul 8, 2025Updated 9 months ago
- [RA-L 2023] Active Implicit Object Reconstruction using Uncertainty-guided Next-Best-View Optimization☆13Aug 28, 2023Updated 2 years ago
- [ICLR 2026] InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation☆112Jan 27, 2026Updated 2 months ago
- [IROS 2025] DynamicPose: Real-time and Robust 6D Object Pose Tracking for Fast-Moving Cameras and Objects☆64Sep 30, 2025Updated 6 months ago
- Official code for EWMBench: Evaluating Scene, Motion, and Semantic Quality in Embodied World Models☆117Jun 13, 2025Updated 10 months ago
- ☆41Aug 27, 2024Updated last year
- ☆27Jun 2, 2025Updated 10 months ago