GigaAI-research / SwiftVLALinks
☆50Updated last month
Alternatives and similar repositories for SwiftVLA
Users that are interested in SwiftVLA are comparing it to the libraries listed below
Sorting:
- [ICCV 2025] Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding☆67Updated 11 months ago
- [NeurIPS 2025]Genesis: Multimodal Driving Scene Generation with Spatio-Temporal and Cross-Modal Consistency☆73Updated 3 months ago
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆171Updated 6 months ago
- [CVPR 2024] Memory-based Adapters for Online 3D Scene Perception☆124Updated 9 months ago
- Official implementation of "From Forecasting to Planning: Policy World Model for Collaborative State-Action Prediction"☆53Updated last month
- [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding☆125Updated 7 months ago
- [ICCV 2025] IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation☆58Updated 5 months ago
- [CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding☆201Updated 8 months ago
- Official implementation for "SocialNav: Training Human-Inspired Foundation Model for Socially-Aware Embodied Navigation"☆47Updated last month
- [NeurIPS 2025] DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge☆265Updated 3 months ago
- Geometry-Consistent Video Diffusion for Robotic Visual Policy Transfer☆25Updated 2 months ago
- Nav-R1: Reasoning and Navigation in Embodied Scenes☆96Updated 2 months ago
- Official Github Repo for GEM☆99Updated 2 months ago
- [ACM MM 2025] EmbodiedOcc++: Boosting Embodied 3D Occupancy Prediction with Plane Regularization and Uncertainty Sampler☆25Updated 5 months ago
- ☆54Updated last year
- CoRL2024 | Hint-AD: Holistically Aligned Interpretability for End-to-End Autonomous Driving☆71Updated last year
- [ICCV 2025] Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model☆96Updated last year
- ☆25Updated 11 months ago
- This repository is the official implementation of our paper (From reactive to cognitive: brain-inspired spatial intelligence for embodied…☆69Updated 2 months ago
- [ECCV 2024] TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes☆128Updated 10 months ago
- [ECCV24] Navigation Instruction Generation with BEV Perception and Large Language Models☆30Updated last year
- Official implementation of “4D LangVGGT: 4D Language-Visual Geometry Grounded Transformer”☆67Updated 3 weeks ago
- official code of *DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model*☆57Updated 11 months ago
- Official implementation of "Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language Navigation" (NeurIPS'25 Oral)☆67Updated 2 weeks ago
- ☆61Updated 7 months ago
- Evo-0: Vision-Language-Action Model with Implicit Spatial Understanding.☆51Updated last month
- ☆222Updated 5 months ago
- [CVPR 2025 Highlight] Towards Autonomous Micromobility through Scalable Urban Simulation☆153Updated last month
- [NeurIPS 2025 Spotlight] ReSim: Reliable World Simulation for Autonomous Driving☆136Updated this week
- [CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation☆173Updated 6 months ago