[CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision
☆46Dec 2, 2025Updated 6 months ago
Alternatives and similar repositories for EgoScaler
Users that are interested in EgoScaler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICCV 2025] IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation☆66Aug 4, 2025Updated 10 months ago
- [ICCV 2025] RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping☆48Nov 21, 2025Updated 6 months ago
- Subtask-Aware Visual Reward Learning from Segmented Demonstrations (ICLR 2025 accepted)☆19Apr 11, 2025Updated last year
- Geometry-Consistent Video Diffusion for Robotic Visual Policy Transfer☆37Apr 17, 2026Updated last month
- rmp data ranking☆13Nov 4, 2025Updated 7 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- CVPR2025 | TASTE-Rob: Advancing Video Generation of Task-Oriented Hand-Object Interaction for Generalizable Robotic Manipulation☆43Jan 29, 2026Updated 4 months ago
- [CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation☆183Jun 20, 2025Updated 11 months ago
- [CVPR 2025] Scene Splatter: Momentum 3D Scene Generation from Single Image with Video Diffusion Model☆33Jun 26, 2025Updated 11 months ago
- VGGT 3D Vision Agent optimized for Apple Silicon with Metal Performance Shaders☆91Mar 25, 2026Updated 2 months ago
- Just wanna see what type and how many GPUs/TPUs are used in CVPR 2025 oral papers. Fun vibe coding with LLMs.☆12Apr 24, 2025Updated last year
- [ECCV'24] 3D Reconstruction of Objects in Hands without Real World 3D Supervision☆17Feb 3, 2025Updated last year
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆44Aug 9, 2025Updated 10 months ago
- [CVPR 2025] VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation☆50Apr 10, 2026Updated 2 months ago
- ☆143Aug 27, 2025Updated 9 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆11Jul 19, 2023Updated 2 years ago
- An open source Multi-View Latent Diffusion Model☆44Feb 23, 2026Updated 3 months ago
- a Video Quality Analysis Toolkit☆14May 16, 2025Updated last year
- Official implementation of SPGrasp: A framework for dynamic grasp synthesis from sparse spatiotemporal prompts.☆20Jun 2, 2026Updated last week
- Vlaser: Vision-Language-Action Model with Synergistic Embodied Reasoning☆48Mar 18, 2026Updated 2 months ago
- ☆19Nov 4, 2024Updated last year
- Code for Stable Control Representations☆27Apr 5, 2025Updated last year
- [IROS 2025] ManiGaussian++: General Robotic Bimanual Manipulation with Hierarchical Gaussian World Model☆44Jun 26, 2025Updated 11 months ago
- Documentation and software tools for the Novel Sensors for Autonomous Vehicle Perception (NSAVP) dataset☆24Mar 20, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [RSS 2025] GauSS-MI: Gaussian Splatting Shannon Mutual Information for Active 3D Reconstruction☆91Oct 7, 2025Updated 8 months ago
- Official repostory of the paper: Masked Scene Modeling (CVPR 2025)☆17Dec 13, 2025Updated 6 months ago
- ☆147Dec 4, 2025Updated 6 months ago
- [IROS 2025] Novel Diffusion Models for Multimodal 3D Hand Trajectory Prediction☆25Dec 2, 2025Updated 6 months ago
- ☆18May 7, 2025Updated last year
- N2M: Bridging Navigation and Manipulation by Learning Initial Pose Preference from Rollout☆36May 25, 2026Updated 3 weeks ago
- MAPLE infuses dexterous manipulation priors from egocentric videos into vision encoders, making their features well-suited for downstream…☆33Dec 9, 2025Updated 6 months ago
- Pi0-VLA Repository of "MotionTrans: Human VR Data Enable Motion-Level Learning for Robotic Manipulation Policies"☆27Mar 9, 2026Updated 3 months ago
- Code and data for UniEgoMotion (ICCV 2025)☆57Apr 18, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [NeurIPS 2025] Streaming 3D Reconstruction with Explicit Spatial Pointer Memory☆190Mar 10, 2026Updated 3 months ago
- Code for using the Grasp Affordance Reasoning dataset☆10Sep 17, 2019Updated 6 years ago
- Official implementation of "Latent Action Learning Requires Supervision in the Presence of Distractors", ICML 2025☆36Jul 8, 2025Updated 11 months ago
- [RA-L 2023] Active Implicit Object Reconstruction using Uncertainty-guided Next-Best-View Optimization☆13Aug 28, 2023Updated 2 years ago
- [CoRL 2025] UniSkill: Imitating Human Videos via Cross-Embodiment Skill Representations☆86Dec 18, 2025Updated 5 months ago
- [IROS 2025] DynamicPose: Real-time and Robust 6D Object Pose Tracking for Fast-Moving Cameras and Objects☆79Jun 1, 2026Updated 2 weeks ago
- ☆41Aug 27, 2024Updated last year