GigaAI-research / SwiftVLALinks
☆24Updated last week
Alternatives and similar repositories for SwiftVLA
Users that are interested in SwiftVLA are comparing it to the libraries listed below
Sorting:
- Official implementation of "From Forecasting to Planning: Policy World Model for Collaborative State-Action Prediction"☆44Updated 2 weeks ago
- Official implementation for "SocialNav: Training Human-Inspired Foundation Model for Socially-Aware Embodied Navigation"☆36Updated last week
- Official implementation of T-PAMI25 paper "M²Diffuser: Diffusion-based Trajectory Optimization for Mobile Manipulation in 3D Scenes"☆100Updated 5 months ago
- ✨✨【NeurIPS 2025】Official implementation of BridgeVLA☆158Updated 2 months ago
- [NeurIPS 2025] DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge☆245Updated 2 months ago
- ☆46Updated 4 months ago
- [CVPR 2025] RoomTour3D - Geometry-aware, cheap and automatic data from web videos for embodied navigation☆67Updated 8 months ago
- [NeurIPS 2025 Spotlight] ReSim: Reliable World Simulation for Autonomous Driving☆129Updated last month
- [CVPR2025] CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos☆162Updated 2 months ago
- Official implementation of the paper: "StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling"☆323Updated last month
- ☆73Updated 4 months ago
- InternRobotics' open platform for building generalized navigation foundation models.☆440Updated last week
- Official repository for OmniVLA training and inference code☆114Updated this week
- Code for OctoNav-R1☆61Updated 5 months ago
- [ICCV 2025] Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding☆64Updated 11 months ago
- Code of the paper "EvolveNav: Empowering LLM-Based Vision-Language Navigation via Self-Improving Embodied Reasoning"☆25Updated last month
- ☆25Updated 10 months ago
- CoRL2024 | Hint-AD: Holistically Aligned Interpretability for End-to-End Autonomous Driving☆70Updated last year
- ☆86Updated 6 months ago
- Nav-R1: Reasoning and Navigation in Embodied Scenes☆81Updated last month
- [NeurIPS 2025]Genesis: Multimodal Driving Scene Generation with Spatio-Temporal and Cross-Modal Consistency☆69Updated 2 months ago
- [ICCV 2025] Nexus: Decoupled Diffusion Sparks Adaptive Scene Generation☆100Updated last month
- ☆26Updated 3 months ago
- [RA-L 2025] Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation☆124Updated 7 months ago
- [IROS'25 Oral] WMNav: Integrating Vision-Language Models into World Models for Object Goal Navigation☆132Updated last month
- ☆52Updated 3 months ago
- ☆91Updated 11 months ago
- [CoRL 2025] Repository relating to "TrackVLA: Embodied Visual Tracking in the Wild"☆289Updated 2 weeks ago
- Official implementation of "g3D-LF: Generalizable 3D-Language Feature Fields for Embodied Tasks" (CVPR'25).☆45Updated 4 months ago
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆169Updated 5 months ago