video-fm / LASERLinks
This is a public version of LASER: A Neuro-Symbolic Framework for Learning Spatial-Temporal Scene Graphs with Weak Supervision
☆164Updated 2 months ago
Alternatives and similar repositories for LASER
Users that are interested in LASER are comparing it to the libraries listed below
Sorting:
- GigaWorld-0: World Models as Data Engine to Empower Embodied AI☆1,439Updated 2 months ago
- Official code of Motus: A Unified Latent Action World Model☆616Updated last month
- 🔥 The first open-sourced diffusion vision-langauge-action model.☆159Updated 3 weeks ago
- GigaBrain-0: A World Model-Powered Vision-Language-Action Model☆2,064Updated 2 months ago
- Official implementation for "HA-VLN 2.0: An Open Benchmark and Leaderboard for Human-Aware Navigation in Discrete and Continuous Environm…☆378Updated last month
- DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Models☆480Updated 2 weeks ago
- Official implementation of paper "Unified World Models: Memory-Augmented Planning and Foresight for Visual Navigation"☆268Updated 3 months ago
- RealMirror, a comprehensive, open-source embodied AI VLA platform.☆489Updated 3 weeks ago
- Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views☆181Updated last month
- Awesome collection of resources and papers on Diffusion Models for Robotic Manipulation.☆757Updated 5 months ago
- The accepted paper for cvpr2025.☆55Updated last month
- ☆128Updated 2 months ago
- ☆545Updated 3 months ago
- ☆30Updated last year
- ScaleCUA is the open-sourced computer use agents that can operate on cross-platform environments (Windows, macOS, Ubuntu, Android).☆1,065Updated 3 weeks ago
- Embodied Co-Design for Rapidly Evolving Agents: Taxonomy, Frontiers, and Challenges☆295Updated last week
- 🌐 WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World☆177Updated 2 weeks ago
- 🔥 [AAAI 2026 Oral] Official code for Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptat…☆75Updated last year
- Explain Before You Answer: A Survey on Compositional Visual Reasoning☆306Updated 3 months ago
- VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model☆1,953Updated 2 months ago
- This is the source code for the ECCV paper "MTFormer: Multi-Task Learning via Transformer and Cross-Task Reasoning"☆199Updated 3 years ago
- (ICCV-2025 Official Code)) Improving Generalist Model with Domain-Specific Experts☆87Updated 3 months ago
- ☆19Updated last year
- [AAAI 2026 Oral] Official repository for InfiGUI-G1. We introduce Adaptive Exploration Policy Optimization (AEPO) to overcome semantic al…☆129Updated 2 months ago
- CoNav : Collaborative Cross-Modal Reasoning for Embodied Navigation☆17Updated 8 months ago
- Official repository of DARE: dLLM Alignment and Reinforcement Executor☆159Updated this week
- ☆246Updated last year
- [ICML 2025 Poster] Official PyTorch Implementation of "Habitizing Diffusion Planning for Efficient and Effective Decision Making"☆35Updated 8 months ago
- [MM 2025] EventVAD: Training-Free Event-Aware Video Anomaly Detection☆518Updated 6 months ago
- Data and sample evaluation codes for Multimodal Rewardbench 2☆133Updated last month