worldbench / WorldLensLinks
π WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World
β178Updated 2 weeks ago
Alternatives and similar repositories for WorldLens
Users that are interested in WorldLens are comparing it to the libraries listed below
Sorting:
- π Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systemsβ133Updated 3 weeks ago
- Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Viewsβ181Updated last month
- [ICRA 2026] A Unified Driving World Model for Future Generation and Perceptionβ136Updated 6 months ago
- β93Updated 6 months ago
- π₯ The first open-sourced diffusion vision-langauge-action model.β160Updated 3 weeks ago
- WAM-Diff: A Masked Diffusion VLA Framework with MoE and Online Reinforcement Learning for Autonomous Drivingβ166Updated this week
- OmniNWM: Omniscient Navigation World Models for Autonomous Drivingβ272Updated 3 months ago
- Official code of Motus: A Unified Latent Action World Modelβ616Updated last month
- [NeurIPS 2025 DB Track] 3EED: Ground Everything Everywhere in 3Dβ200Updated last month
- Wan2.1 with Controlnetβ181Updated 10 months ago
- β321Updated 3 months ago
- [NeurIPS 2025] More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Modelsβ215Updated 3 months ago
- Official implementation for "HA-VLN 2.0: An Open Benchmark and Leaderboard for Human-Aware Navigation in Discrete and Continuous Environmβ¦β378Updated last month
- Data and sample evaluation codes for Multimodal Rewardbench 2β133Updated last month
- GigaWorld-0: World Models as Data Engine to Empower Embodied AIβ1,439Updated 2 months ago
- [CoRL2024] Let Occ Flow: Self-Supervised 3D Occupancy Flow Predictionβ130Updated 3 months ago
- First Video Deep Research Benchmarkβ139Updated 2 weeks ago
- [Tutorial] Few-Step Distillation for Text-to-Image Generation: A Practical Guideβ337Updated last month
- [ICLR 2025] Ctrl-U: Robust Conditional Image Generation via Uncertainty-aware Reward Modelingβ82Updated this week
- [EMNLP2025]Official implementation: Agent-style vision question answer in Autonomous Driving!β136Updated 4 months ago
- [ACMMM 2025] Officially implement of the paper "DriVerse: Navigation World Model for Driving Simulation via Multimodal Trajectory Promptiβ¦β217Updated 8 months ago
- This is the source code for the ECCV paper "MTFormer: Multi-Task Learning via Transformer and Cross-Task Reasoning"β199Updated 3 years ago
- Official Implementation of Puzzles: Unbounded Video-Depth Augmentation for Scalable, End-to-End 3D Reconstruction.β210Updated 4 months ago
- π₯[NeurIPS 2024] Official Implementation of Hawk: Learning to Understand Open-World Video Anomaliesβ224Updated 9 months ago
- Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Betterβ186Updated this week
- This repository contains the code of the paper "IC-World: In-Context Generation for Shared World Modeling".β123Updated 3 weeks ago
- GigaBrain-0: A World Model-Powered Vision-Language-Action Modelβ2,236Updated this week
- The accepted paper for cvpr2025.β55Updated last month
- [NeurIPS'2025] Official repository for "LiveStar: Live Streaming Assistant for Real-World Online Video Understanding"β108Updated 2 months ago
- (ICCV-2025 Official Code)) Improving Generalist Model with Domain-Specific Expertsβ87Updated 3 months ago