zhanghm1995 / D2-WorldView external linksLinks
Official PyTorch implementation of D^2-World as the second place and innovation award of CVPR 2024 Predictive World Model Challenge.
☆16Apr 10, 2025Updated 10 months ago
Alternatives and similar repositories for D2-World
Users that are interested in D2-World are comparing it to the libraries listed below
Sorting:
- Joint Perception and Motion Prediction for Autonomous Driving Based on Bird's Eye View Maps☆20Feb 5, 2024Updated 2 years ago
- official code of *DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model*☆61Jan 10, 2025Updated last year
- Adding Scene-Centric Forecasting Control to Occupancy World Model☆37Aug 24, 2025Updated 5 months ago
- ☆22Mar 18, 2025Updated 10 months ago
- ☆50Oct 26, 2025Updated 3 months ago
- MambaOcc: Visual State Space Model for BEV-based Occupancy Prediction with Local Adaptive Reordering☆22Nov 28, 2025Updated 2 months ago
- [ICCV 2025] ALOcc: Adaptive Lifting-based 3D Semantic Occupancy and Cost Volume-based Flow Prediction☆38Dec 1, 2025Updated 2 months ago
- [ECCV 2024] Occupancy as Set of Points☆92Jul 8, 2024Updated last year
- Official implementation for 'SparseOcc: Rethinking Sparse Latent Representation for Vision-Based Semantic Occupancy Prediction' (CVPR 202…☆72Aug 5, 2024Updated last year
- [ECCV 2024] RecurrentBEV: A Long-term Temporal Fusion Framework for Multi-view 3D Detection☆32Sep 28, 2024Updated last year
- ☆139Dec 4, 2025Updated 2 months ago
- [IEEE RA-L] Co-Occ: Coupling Explicit Feature Fusion with Volume Rendering Regularization for Multi-Modal 3D Semantic Occupancy Predictio…☆75Sep 26, 2024Updated last year
- HINTED: Hard Instance Enhanced Detector with Mixed-Density Feature Fusion for Sparsely-Supervised 3D Object Detection(CVPR 2024)☆38Sep 6, 2025Updated 5 months ago
- ☆65Jul 13, 2025Updated 7 months ago
- This is the implementation of the paper "FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection" (ECCV 2024)☆33Aug 14, 2025Updated 6 months ago
- A curated list of resources focused on Visual AutoRegressive Modeling, makes GPT-style AR models surpass diffusion transformers in image …☆39Mar 2, 2025Updated 11 months ago
- M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models☆46Jul 17, 2025Updated 6 months ago
- Official repo for PIWM: Enhancing Physical Consistency in Lightweight World Models☆21Nov 26, 2025Updated 2 months ago
- Fully Sparse Transformer 3D Detector for LiDAR Point Cloud☆37Dec 8, 2023Updated 2 years ago
- The official implementation of the TIP 2025 paper UncTrack: Reliable Visual Object Tracking with Uncertainty-Aware Prototype Memory Netwo…☆13Jun 16, 2025Updated 7 months ago
- ☆10Apr 7, 2025Updated 10 months ago
- Official implementation of paper "CoIRL-AD: Collaborative and Competitive Imitation–Reinforcement Learning for Autonomous Driving"☆30Jan 25, 2026Updated 2 weeks ago
- Implementation of 'FADet: A Multi-sensor 3D Object Detection Network based on Local Featured Attention'☆11Mar 27, 2024Updated last year
- ☆11Jun 19, 2024Updated last year
- [AAAI 2025] SSLFusion: Scale and Space Aligned Latent Fusion Model for Multimodal 3D Object Detection☆15Nov 14, 2025Updated 3 months ago
- This is the publish code of TrackAny3D (ICCV2025).☆15Oct 20, 2025Updated 3 months ago
- Persistent dense gemm for Hopper in `CuTeDSL`☆15Aug 9, 2025Updated 6 months ago
- ☆11Jun 17, 2024Updated last year
- Computes the Henry coefficient of methane in IRMOF-1☆10Oct 5, 2021Updated 4 years ago
- [ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception☆14Jul 4, 2025Updated 7 months ago
- DiFSD: Ego-Centric Fully Sparse Paradigm for End-to-End Self-Driving☆14Mar 9, 2025Updated 11 months ago
- Prototype implementation of an architecture suggested in Robot Dream paper (http://arxiv.org/abs/1603.03007)☆12Jul 3, 2019Updated 6 years ago
- Code for "Spatial-Temporal Enhanced Transformer Towards Multi-Frame 3D Object Detection" (TPAMI2024)☆13Apr 3, 2025Updated 10 months ago
- [CVPR 2025] GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding☆207Jan 5, 2026Updated last month
- [ECCV 2024, TPAMI 2025]Official PyTorch Implementation of HTCL : Hierarchical Temporal Context Learning for Camera-based Semantic Scene C…☆49Dec 31, 2025Updated last month
- [ECCV 2024] ViewFormer: Exploring Spatiotemporal Modeling for Multi-View 3D Occupancy Perception via View-Guided Transformers☆60Nov 1, 2024Updated last year
- [ICCV 2025] HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation☆230Jul 14, 2025Updated 7 months ago
- [RA-L 25] Self-Supervised Diffusion-Based Scene Flow Estimation and Motion Segmentation with 4D Radar☆23May 16, 2025Updated 8 months ago
- Towards Visual Explanations for Convolutional Neural Networks via Input Resampling☆13Aug 16, 2017Updated 8 years ago