[CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision
☆43Dec 2, 2025Updated 5 months ago
Alternatives and similar repositories for EgoScaler
Users that are interested in EgoScaler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICCV 2025] DyWA:Dynamics-adaptive World Action Model for Generalizable Non-prehensile Manipulation☆85May 12, 2026Updated 2 weeks ago
- [ICCV 2025] IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation☆66Aug 4, 2025Updated 9 months ago
- [ICCV 2025] RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping☆47Nov 21, 2025Updated 6 months ago
- Subtask-Aware Visual Reward Learning from Segmented Demonstrations (ICLR 2025 accepted)☆19Apr 11, 2025Updated last year
- Geometry-Consistent Video Diffusion for Robotic Visual Policy Transfer☆37Apr 17, 2026Updated last month
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- rmp data ranking☆13Nov 4, 2025Updated 6 months ago
- CVPR2025 | TASTE-Rob: Advancing Video Generation of Task-Oriented Hand-Object Interaction for Generalizable Robotic Manipulation☆43Jan 29, 2026Updated 3 months ago
- [CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation☆183Jun 20, 2025Updated 11 months ago
- [CVPR 2025] Scene Splatter: Momentum 3D Scene Generation from Single Image with Video Diffusion Model☆32Jun 26, 2025Updated 10 months ago
- VGGT 3D Vision Agent optimized for Apple Silicon with Metal Performance Shaders☆88Mar 25, 2026Updated 2 months ago
- Just wanna see what type and how many GPUs/TPUs are used in CVPR 2025 oral papers. Fun vibe coding with LLMs.☆12Apr 24, 2025Updated last year
- [ECCV'24] 3D Reconstruction of Objects in Hands without Real World 3D Supervision☆17Feb 3, 2025Updated last year
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆44Aug 9, 2025Updated 9 months ago
- [CVPR 2025] VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation☆49Apr 10, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆143Aug 27, 2025Updated 8 months ago
- ☆11Jul 19, 2023Updated 2 years ago
- An open source Multi-View Latent Diffusion Model☆44Feb 23, 2026Updated 3 months ago
- a Video Quality Analysis Toolkit☆14May 16, 2025Updated last year
- Official implementation of SPGrasp: A framework for dynamic grasp synthesis from sparse spatiotemporal prompts.☆20Jan 6, 2026Updated 4 months ago
- Vlaser: Vision-Language-Action Model with Synergistic Embodied Reasoning☆47Mar 18, 2026Updated 2 months ago
- ☆19Nov 4, 2024Updated last year
- Code for Stable Control Representations☆26Apr 5, 2025Updated last year
- ☆132Dec 4, 2025Updated 5 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [IROS 2025] ManiGaussian++: General Robotic Bimanual Manipulation with Hierarchical Gaussian World Model☆43Jun 26, 2025Updated 11 months ago
- Documentation and software tools for the Novel Sensors for Autonomous Vehicle Perception (NSAVP) dataset☆24Mar 20, 2026Updated 2 months ago
- [RSS 2025] GauSS-MI: Gaussian Splatting Shannon Mutual Information for Active 3D Reconstruction☆87Oct 7, 2025Updated 7 months ago
- Official repostory of the paper: Masked Scene Modeling (CVPR 2025)☆17Dec 13, 2025Updated 5 months ago
- [IROS 2025] Novel Diffusion Models for Multimodal 3D Hand Trajectory Prediction☆25Dec 2, 2025Updated 5 months ago
- ☆18May 7, 2025Updated last year
- N2M: Bridging Navigation and Manipulation by Learning Initial Pose Preference from Rollout☆35May 13, 2026Updated last week
- MAPLE infuses dexterous manipulation priors from egocentric videos into vision encoders, making their features well-suited for downstream…☆32Dec 9, 2025Updated 5 months ago
- Pi0-VLA Repository of "MotionTrans: Human VR Data Enable Motion-Level Learning for Robotic Manipulation Policies"☆27Mar 9, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code and data for UniEgoMotion (ICCV 2025)☆54Apr 18, 2026Updated last month
- [NeurIPS 2025] Streaming 3D Reconstruction with Explicit Spatial Pointer Memory☆187Mar 10, 2026Updated 2 months ago
- Official implementation of "Latent Action Learning Requires Supervision in the Presence of Distractors", ICML 2025☆36Jul 8, 2025Updated 10 months ago
- [RA-L 2023] Active Implicit Object Reconstruction using Uncertainty-guided Next-Best-View Optimization☆13Aug 28, 2023Updated 2 years ago
- [CoRL 2025] UniSkill: Imitating Human Videos via Cross-Embodiment Skill Representations☆86Dec 18, 2025Updated 5 months ago
- [ICLR 2026] InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation☆113Jan 27, 2026Updated 3 months ago
- [IROS 2025] DynamicPose: Real-time and Robust 6D Object Pose Tracking for Fast-Moving Cameras and Objects☆75Sep 30, 2025Updated 7 months ago