OpenDriveLab / UniVLA
[RSS 2025] Learning to Act Anywhere with Task-centric Latent Actions
☆71Updated this week
Alternatives and similar repositories for UniVLA
Users that are interested in UniVLA are comparing it to the libraries listed below
Sorting:
- ☆52Updated 2 months ago
- [NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning☆77Updated 6 months ago
- [CVPR 2025] Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning☆32Updated last month
- ☆49Updated 7 months ago
- FLaRe: Achieving Masterful and Adaptive Robot Policies with Large-Scale Reinforcement Learning Fine-Tuning☆18Updated 4 months ago
- [RSS 2024] Learning Manipulation by Predicting Interaction☆106Updated 8 months ago
- [CVPR 25] G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation☆62Updated last month
- EMMOE: A Comprehensive Benchmark for Embodied Mobile Manipulation in Open Environments☆14Updated 3 weeks ago
- Open-source implementations on real robots☆32Updated 5 months ago
- Planning as In-Painting: A Diffusion-Based Embodied Task Planning Framework for Environments under Uncertainty☆20Updated last year
- ☆78Updated this week
- ☆62Updated 4 months ago
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆62Updated 5 months ago
- PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators☆76Updated 5 months ago
- Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning☆61Updated last week
- Official implementation of "SUGAR: Pre-training 3D Visual Representations for Robotics" (CVPR'24).☆38Updated 9 months ago
- Official implementation of SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts☆18Updated 5 months ago
- ☆59Updated last week
- [CoRL2024] Official repo of `A3VLM: Actionable Articulation-Aware Vision Language Model`☆110Updated 7 months ago
- ☆71Updated 8 months ago
- ☆33Updated 4 months ago
- List of papers on video-centric robot learning☆19Updated 5 months ago
- Official implementation of Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation (CoRL'24).☆62Updated 2 months ago
- [NeurIPS 2024] CLOVER: Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation☆113Updated 5 months ago
- Splat-MOVER: Multi-Stage, Open-Vocabulary Robotic Manipulation via Editable Gaussian Splatting☆32Updated 7 months ago
- Unifying 2D and 3D Vision-Language Understanding☆79Updated last month
- [RSS 2025] Novel Demonstration Generation with Gaussian Splatting Enables Robust One-Shot Manipulation☆76Updated 3 weeks ago
- View-Invariant Policy Learning via Zero-Shot Novel View Synthesis (CoRL 2024)☆20Updated 4 months ago
- Implementation of our ICCV 2023 paper DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation☆19Updated last year
- [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding☆101Updated last week