A Curated List of Vision-Language-Action (VLA) and World Action Models (WAM) Research and Beyond
☆100Mar 8, 2026Updated this week
Alternatives and similar repositories for awesome-vla-wam
Users that are interested in awesome-vla-wam are comparing it to the libraries listed below
Sorting:
- Just wanna see what type and how many GPUs/TPUs are used in CVPR 2025 oral papers. Fun vibe coding with LLMs.☆12Apr 24, 2025Updated 10 months ago
- StereoVLA is powered by stereo vision and supports flexible deployment with high tolerance to camera pose variations.☆53Jan 12, 2026Updated last month
- A self-made NeurIPS poster template, infused with the unique design style of ShanghaiTech.☆15Dec 26, 2023Updated 2 years ago
- HiF-VLA: An efficient, bidirectional spatiotemporal expansion Vision-Language-Action Model☆47Dec 11, 2025Updated 2 months ago
- LDA-1B: Scaling Latent Dynamics Action Model via Universal Embodied Data Ingestion☆46Updated this week
- Are Video Models Ready as Zero-shot Reasoners?☆84Nov 24, 2025Updated 3 months ago
- The official implementation of Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight☆83Jan 16, 2026Updated last month
- The official implementation of "DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation". (arXiv 2601.22153)☆163Jan 30, 2026Updated last month
- 4RC: 4D Reconstruction via Conditional Querying Anytime and Anywhere☆89Feb 11, 2026Updated 3 weeks ago
- PyTorch implementation of the descriptor DEAL presented at NeurIPS 2021 "Extracting Deformation-Aware Local Features by Learning to Defor…☆31Jan 12, 2022Updated 4 years ago
- EO: Open-source Unified Embodied Foundation Model Series☆51Jan 15, 2026Updated last month
- [ECCV24] official code for "OGNI-DC: Robust Depth Completion with Optimization-Guided Neural Iterations"☆79Apr 28, 2025Updated 10 months ago
- NORA-1.5: A Vision-Language-Action Model Trained using World Model- and Action-based Preference Rewards☆93Jan 11, 2026Updated last month
- 3D Gaussian Splatting for underwater scene reconstruction via physcial-based appearance-medium decoupling☆23Feb 13, 2026Updated 3 weeks ago
- Multi-step AI agents powered by Gemini 2.0 and the LangGraph framework. These agents orchestrate complex workflows and enhance their reas…☆10Dec 19, 2024Updated last year
- ☆36Feb 6, 2026Updated last month
- ☆30Dec 16, 2025Updated 2 months ago
- mHC-lite: You Don’t Need 20 Sinkhorn-Knopp Iterations☆70Jan 12, 2026Updated last month
- Holistic Evaluation of Multimodal LLMs on Spatial Intelligence☆87Feb 25, 2026Updated last week
- Official implementation of VLANeXt.☆112Updated this week
- ☆39Dec 8, 2023Updated 2 years ago
- 3D Odometry Visualization and Processing Tool☆26Dec 17, 2021Updated 4 years ago
- Code for "Taxonomy Adaptive Cross-Domain Adaptation in Medical Imaging via Optimization Trajectory Distillation", ICCV 2023☆16Aug 31, 2023Updated 2 years ago
- Arduino library for Gavesha® Robomatics Gear Motor.☆10Feb 15, 2025Updated last year
- AI-native knowledge kernel for human/agent collaboration. Use it as a Knowledge Base, Wiki, Annotator, Research Tool, or Agentic Memory.☆29Updated this week
- ☆29Aug 6, 2025Updated 7 months ago
- ☆14Aug 10, 2025Updated 6 months ago
- Software to enable data-rich collaboration from high-resolution display walls to your laptop☆16Updated this week
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆23Nov 13, 2025Updated 3 months ago
- [CVPR 2026] Visual Geometry Transformer for Autonomous Driving☆191Dec 19, 2025Updated 2 months ago
- (ICCV 2025) ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations☆131Nov 14, 2025Updated 3 months ago
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆28Dec 30, 2025Updated 2 months ago
- Benchmark evaluating ocean forecasting systems against reference datasets and observations.☆26Updated this week
- AI foundation and trend seminar tutorial with code☆24Oct 25, 2025Updated 4 months ago
- ☆13Oct 21, 2024Updated last year
- F1: A Vision Language Action Model Bridging Understanding and Generation to Actions☆162Jan 2, 2026Updated 2 months ago
- MCP server for Grok AI API integration☆22Jun 2, 2025Updated 9 months ago
- The project is intended to demonstrate Lane tracking & detection on Qualcomm’s Robotics Platform RB5. YOLOP is the architecture used to i…☆10Aug 22, 2023Updated 2 years ago
- ☆17Jun 18, 2025Updated 8 months ago