worldbench / WorldLensLinks
π WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World
β178Updated 3 weeks ago
Alternatives and similar repositories for WorldLens
Users that are interested in WorldLens are comparing it to the libraries listed below
Sorting:
- π Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systemsβ135Updated last week
- [ICRA 2026] A Unified Driving World Model for Future Generation and Perceptionβ136Updated 6 months ago
- Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Viewsβ181Updated last month
- WAM-Diff: A Masked Diffusion VLA Framework with MoE and Online Reinforcement Learning for Autonomous Drivingβ166Updated last week
- β93Updated 7 months ago
- OmniNWM: Omniscient Navigation World Models for Autonomous Drivingβ272Updated 3 months ago
- Wan2.1 with Controlnetβ181Updated 10 months ago
- π₯ The first open-sourced diffusion vision-langauge-action model.β160Updated last month
- [NeurIPS 2025 DB Track] 3EED: Ground Everything Everywhere in 3Dβ200Updated last month
- β321Updated 3 months ago
- [EMNLP2025]Official implementation: Agent-style vision question answer in Autonomous Driving!β136Updated 4 months ago
- Official code of Motus: A Unified Latent Action World Modelβ616Updated last month
- First Video Deep Research Benchmarkβ139Updated 2 weeks ago
- [AAAI 2026 π₯] Official implementation of "NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representation"β176Updated 5 months ago
- Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Betterβ186Updated this week
- [ICLR 2025] Ctrl-U: Robust Conditional Image Generation via Uncertainty-aware Reward Modelingβ82Updated this week
- (ICCV-2025 Official Code)) Improving Generalist Model with Domain-Specific Expertsβ87Updated 3 months ago
- [CoRL2024] Let Occ Flow: Self-Supervised 3D Occupancy Flow Predictionβ130Updated 4 months ago
- [NeurIPS 2025] More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Modelsβ215Updated 3 months ago
- GigaWorld-0: World Models as Data Engine to Empower Embodied AIβ1,439Updated 2 months ago
- [CoRLW 2025 (Oral), IASEAI 2026] Implementation for "Challenger: Affordable Adversarial Driving Video Generation"β139Updated last month
- 4DNeX: Feed-Forward 4D Generative Modeling Made Easyβ819Updated last month
- This is the source code for the ECCV paper "MTFormer: Multi-Task Learning via Transformer and Cross-Task Reasoning"β199Updated 3 years ago
- [Tutorial] Few-Step Distillation for Text-to-Image Generation: A Practical Guideβ337Updated last month
- Official Implementation of Puzzles: Unbounded Video-Depth Augmentation for Scalable, End-to-End 3D Reconstruction.β210Updated 4 months ago
- Data and sample evaluation codes for Multimodal Rewardbench 2β135Updated last month
- π₯ [AAAI 2026 Oral] Official code for Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptatβ¦β75Updated last year
- Official implemetation of "Enhancing Close-up Novel View Synthesis via Pseudo-labeling" [AAAI 2025]β15Updated 9 months ago
- π₯[NeurIPS 2024] Official Implementation of Hawk: Learning to Understand Open-World Video Anomaliesβ224Updated 9 months ago
- [ACMMM 2025] Officially implement of the paper "DriVerse: Navigation World Model for Driving Simulation via Multimodal Trajectory Promptiβ¦β217Updated 9 months ago