worldbench / WorldLensLinks
π WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World
β166Updated 2 weeks ago
Alternatives and similar repositories for WorldLens
Users that are interested in WorldLens are comparing it to the libraries listed below
Sorting:
- π Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systemsβ120Updated this week
- A Unified Driving World Model for Future Generation and Perceptionβ132Updated 5 months ago
- β94Updated 6 months ago
- OmniNWM: Omniscient Navigation World Models for Autonomous Drivingβ265Updated 2 months ago
- β303Updated 2 months ago
- Data and sample evaluation codes for Multimodal Rewardbench 2β122Updated 2 weeks ago
- Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Viewsβ174Updated 3 weeks ago
- Official code of Motus: A Unified Latent Action World Modelβ541Updated this week
- [NeurIPS 2025 DB Track] 3EED: Ground Everything Everywhere in 3Dβ197Updated last week
- Wan2.1 with Controlnetβ179Updated 9 months ago
- π₯ The first open-sourced diffusion vision-langauge-action model.β149Updated last week
- GigaBrain-0: A World Model-Powered Vision-Language-Action Modelβ1,243Updated last month
- [CoRL2024] Let Occ Flow: Self-Supervised 3D Occupancy Flow Predictionβ128Updated 3 months ago
- [Tutorial] Few-Step Distillation for Text-to-Image Generation: A Practical Guideβ321Updated last week
- Official Implementation of Puzzles: Unbounded Video-Depth Augmentation for Scalable, End-to-End 3D Reconstruction.β210Updated 3 months ago
- GigaWorld-0: World Models as Data Engine to Empower Embodied AIβ1,021Updated last month
- [NeurIPS 2025] More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Modelsβ215Updated 2 months ago
- Official implementation of "ReCamDriving: LiDAR-Free Camera-Controlled Novel Trajectory Video Generation"β71Updated 2 weeks ago
- [ICLR 2025] Ctrl-U: Robust Conditional Image Generation via Uncertainty-aware Reward Modelingβ82Updated 10 months ago
- (ICCV-2025 Official Code)) Improving Generalist Model with Domain-Specific Expertsβ86Updated 2 months ago
- [NeurIPS 25] TrackingWorld: World-centric Monocular 3D Tracking of Almost All Pixelsβ165Updated 2 weeks ago
- π₯ OneThinker: All-in-one Reasoning Model for Image and Videoβ355Updated 3 weeks ago
- Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Betterβ185Updated 6 months ago
- Official implementation for "HA-VLN 2.0: An Open Benchmark and Leaderboard for Human-Aware Navigation in Discrete and Continuous Environmβ¦β377Updated 3 weeks ago
- [AAAI 2026 Oral] LiDARCrafter: Dynamic 4D World Modeling from LiDAR Sequencesβ181Updated 3 weeks ago
- [ICCV 2025] Perspective-Invariant 3D Object Detectionβ153Updated 2 weeks ago
- This is the source code for the ECCV paper "MTFormer: Multi-Task Learning via Transformer and Cross-Task Reasoning"β200Updated 3 years ago
- [Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics]: VisuoThink: Empowering LVLM Reasoning with Mulβ¦β100Updated 5 months ago
- [EMNLP2025]Official implementation: Agent-style vision question answer in Autonomous Driving!β134Updated 3 months ago
- [AAAI 2026 π₯] Official implementation of "NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representation"β175Updated 4 months ago