HaoyiZhu / RealRobotLinks
Open-source implementations on real robots
☆34Updated last year
Alternatives and similar repositories for RealRobot
Users that are interested in RealRobot are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning☆90Updated last year
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆172Updated 7 months ago
- Official implementation of "SUGAR: Pre-training 3D Visual Representations for Robotics" (CVPR'24).☆45Updated 7 months ago
- View-Invariant Policy Learning via Zero-Shot Novel View Synthesis (CoRL 2024)☆28Updated 4 months ago
- [CVPR 25] G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation☆93Updated 8 months ago
- [ICLR 2026] Codebase for paper "Geometry-aware 4D Video Generation for Robot Manipulation"☆72Updated last month
- [ICCV 2025] RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping☆33Updated 2 months ago
- Evo-0: Vision-Language-Action Model with Implicit Spatial Understanding.☆53Updated 2 months ago
- [CVPR 2024] GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction☆83Updated 6 months ago
- [CVPR 2025] VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation☆45Updated 7 months ago
- LaSSM: Efficient Semantic-Spatial Query Decoding via Local Aggregation and State Space Models for 3D Instance Segmentation☆16Updated 8 months ago
- MAPLE infuses dexterous manipulation priors from egocentric videos into vision encoders, making their features well-suited for downstream…☆29Updated 2 months ago
- [RAL 2024] OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding☆32Updated 11 months ago
- [RA-L] Lost & Found dynamically tracks object poses from egocentric videos while updating a scene graph, enabling richer semantic 3D unde…☆54Updated 4 months ago
- Splat-MOVER: Multi-Stage, Open-Vocabulary Robotic Manipulation via Editable Gaussian Splatting☆41Updated last year
- Click to Grasp takes calibrated RGB-D images of a tabletop and user-defined part instances in diverse source images as input, and produce…☆21Updated last year
- A Framework for Open-Vocabulary Object Retrieval and Drawer Manipulation in Point Clouds☆29Updated last year
- Geometry-Consistent Video Diffusion for Robotic Visual Policy Transfer☆28Updated 3 months ago
- ☆103Updated 3 weeks ago
- [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding☆129Updated 8 months ago
- Official PyTorch implementation for ICML 2025 paper: UP-VLA.☆55Updated 3 weeks ago
- Implementation of Prompting with the Future: Open-World Model Predictive Control with Interactive Digital Twins. [RSS 2025]☆48Updated 3 months ago
- Unifying 2D and 3D Vision-Language Understanding☆121Updated 6 months ago
- Code, data and weights for the paper **What drives success in physical planning with Joint-Embedding Predictive World Models?**☆131Updated last week
- code for CoRL2025 "LaDiWM: A Latent Diffusion-based World Model for Predictive Manipulation"☆44Updated 2 months ago
- ☆47Updated 7 months ago
- GraspSplats: Efficient Manipulation with 3D Feature Splatting☆146Updated last year
- Official implementation of the paper "GSWorld: Closed-Loop Photo-Realistic Simulation Suite for Robotic Manipulation"☆161Updated 3 months ago
- [RSS 2025] Novel Demonstration Generation with Gaussian Splatting Enables Robust One-Shot Manipulation☆162Updated 8 months ago
- SceneFun3D ToolKit☆166Updated 9 months ago