zhanghm1995 / D2-WorldView external linksLinks
Official PyTorch implementation of D^2-World as the second place and innovation award of CVPR 2024 Predictive World Model Challenge.
☆16Apr 10, 2025Updated 10 months ago
Alternatives and similar repositories for D2-World
Users that are interested in D2-World are comparing it to the libraries listed below
Sorting:
- [AAAI 2026] SparseWorld: A Flexible, Adaptive, and Efficient 4D Occupancy World Model Powered by Sparse and Dynamic Queries☆36Jan 14, 2026Updated last month
- [CVPR 2025] Spatiotemporal Decoupling for Efficient Vision-Based Occupancy Forecasting☆25Mar 3, 2025Updated 11 months ago
- Joint Perception and Motion Prediction for Autonomous Driving Based on Bird's Eye View Maps☆20Feb 5, 2024Updated 2 years ago
- official code of *DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model*☆61Jan 10, 2025Updated last year
- Adding Scene-Centric Forecasting Control to Occupancy World Model☆37Aug 24, 2025Updated 5 months ago
- ☆22Mar 18, 2025Updated 10 months ago
- ☆50Oct 26, 2025Updated 3 months ago
- [ICCV 2025] ALOcc: Adaptive Lifting-based 3D Semantic Occupancy and Cost Volume-based Flow Prediction☆38Dec 1, 2025Updated 2 months ago
- [ECCV 2024] Occupancy as Set of Points☆92Jul 8, 2024Updated last year
- Official implementation for 'SparseOcc: Rethinking Sparse Latent Representation for Vision-Based Semantic Occupancy Prediction' (CVPR 202…☆72Aug 5, 2024Updated last year
- ☆139Dec 4, 2025Updated 2 months ago
- [ECCV 2024] RecurrentBEV: A Long-term Temporal Fusion Framework for Multi-view 3D Detection☆32Sep 28, 2024Updated last year
- [IEEE RA-L] Co-Occ: Coupling Explicit Feature Fusion with Volume Rendering Regularization for Multi-Modal 3D Semantic Occupancy Predictio…☆75Sep 26, 2024Updated last year
- ☆65Jul 13, 2025Updated 7 months ago
- This is the implementation of the paper "FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection" (ECCV 2024)☆33Aug 14, 2025Updated 6 months ago
- Official repo for PIWM: Enhancing Physical Consistency in Lightweight World Models☆21Nov 26, 2025Updated 2 months ago
- ☆37Feb 16, 2025Updated 11 months ago
- M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models☆46Jul 17, 2025Updated 6 months ago
- Fully Sparse Transformer 3D Detector for LiDAR Point Cloud☆37Dec 8, 2023Updated 2 years ago
- ☆10Apr 7, 2025Updated 10 months ago
- Official implementation of paper "CoIRL-AD: Collaborative and Competitive Imitation–Reinforcement Learning for Autonomous Driving"☆30Jan 25, 2026Updated 2 weeks ago
- [ECCV 2024] LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection☆41Nov 1, 2024Updated last year
- This is the publish code of TrackAny3D (ICCV2025).☆15Oct 20, 2025Updated 3 months ago
- Code for "Spatial-Temporal Enhanced Transformer Towards Multi-Frame 3D Object Detection" (TPAMI2024)☆13Apr 3, 2025Updated 10 months ago
- Persistent dense gemm for Hopper in `CuTeDSL`☆15Aug 9, 2025Updated 6 months ago
- [ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception☆14Jul 4, 2025Updated 7 months ago
- ☆11Jun 17, 2024Updated last year
- Implementation of 'FADet: A Multi-sensor 3D Object Detection Network based on Local Featured Attention'☆11Mar 27, 2024Updated last year
- [AAAI 2025] SSLFusion: Scale and Space Aligned Latent Fusion Model for Multimodal 3D Object Detection☆15Nov 14, 2025Updated 3 months ago
- ☆11Jun 19, 2024Updated last year
- Computes the Henry coefficient of methane in IRMOF-1☆10Oct 5, 2021Updated 4 years ago
- DiFSD: Ego-Centric Fully Sparse Paradigm for End-to-End Self-Driving☆14Mar 9, 2025Updated 11 months ago
- [ECCV 2024, TPAMI 2025]Official PyTorch Implementation of HTCL : Hierarchical Temporal Context Learning for Camera-based Semantic Scene C…☆49Dec 31, 2025Updated last month
- [ECCV 2024] ViewFormer: Exploring Spatiotemporal Modeling for Multi-View 3D Occupancy Perception via View-Guided Transformers☆60Nov 1, 2024Updated last year
- [ICCV 2025] HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation☆230Jul 14, 2025Updated 7 months ago
- A python toolkit to create photorealistic image datasets for machine learning with Blender.☆12Jun 14, 2021Updated 4 years ago
- ☆12Jul 18, 2024Updated last year
- bayes-people-tracker from strands-perception-people☆11Nov 18, 2025Updated 2 months ago
- [ICRA'2024] MonoOcc: Digging into Monocular Semantic Occupancy Prediction☆114Oct 23, 2023Updated 2 years ago