Official PyTorch implementation of D^2-World as the second place and innovation award of CVPR 2024 Predictive World Model Challenge.
☆16Apr 10, 2025Updated 10 months ago
Alternatives and similar repositories for D2-World
Users that are interested in D2-World are comparing it to the libraries listed below
Sorting:
- Collect papers and codes about VQGAN in various Computer Vision tasks☆10Dec 20, 2022Updated 3 years ago
- [AAAI 2026] SparseWorld: A Flexible, Adaptive, and Efficient 4D Occupancy World Model Powered by Sparse and Dynamic Queries☆40Jan 14, 2026Updated last month
- [CVPR 2025] Spatiotemporal Decoupling for Efficient Vision-Based Occupancy Forecasting☆25Mar 3, 2025Updated last year
- Joint Perception and Motion Prediction for Autonomous Driving Based on Bird's Eye View Maps☆21Feb 5, 2024Updated 2 years ago
- official code of *DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model*☆61Jan 10, 2025Updated last year
- Adding Scene-Centric Forecasting Control to Occupancy World Model☆38Aug 24, 2025Updated 6 months ago
- ☆52Oct 26, 2025Updated 4 months ago
- ☆22Mar 18, 2025Updated 11 months ago
- MambaOcc: Visual State Space Model for BEV-based Occupancy Prediction with Local Adaptive Reordering☆23Nov 28, 2025Updated 3 months ago
- ☆21Jul 1, 2024Updated last year
- [ECCV 2024] Occupancy as Set of Points☆91Jul 8, 2024Updated last year
- Official implementation for 'SparseOcc: Rethinking Sparse Latent Representation for Vision-Based Semantic Occupancy Prediction' (CVPR 202…☆72Aug 5, 2024Updated last year
- [ECCV 2024] RecurrentBEV: A Long-term Temporal Fusion Framework for Multi-view 3D Detection☆32Sep 28, 2024Updated last year
- [IEEE RA-L] Co-Occ: Coupling Explicit Feature Fusion with Volume Rendering Regularization for Multi-Modal 3D Semantic Occupancy Predictio…☆76Sep 26, 2024Updated last year
- HINTED: Hard Instance Enhanced Detector with Mixed-Density Feature Fusion for Sparsely-Supervised 3D Object Detection(CVPR 2024)☆38Sep 6, 2025Updated 6 months ago
- This is the implementation of the paper "FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection" (ECCV 2024)☆34Aug 14, 2025Updated 6 months ago
- ☆66Jul 13, 2025Updated 7 months ago
- A curated list of resources focused on Visual AutoRegressive Modeling, makes GPT-style AR models surpass diffusion transformers in image …☆39Mar 2, 2025Updated last year
- Official repo for PIWM: Enhancing Physical Consistency in Lightweight World Models☆21Nov 26, 2025Updated 3 months ago
- ☆37Feb 16, 2025Updated last year
- The official implementation of the TIP 2025 paper UncTrack: Reliable Visual Object Tracking with Uncertainty-Aware Prototype Memory Netwo…☆13Jun 16, 2025Updated 8 months ago
- ☆10Apr 7, 2025Updated 11 months ago
- Official implementation of paper "CoIRL-AD: Collaborative and Competitive Imitation–Reinforcement Learning for Autonomous Driving"☆31Jan 25, 2026Updated last month
- [ECCV 2024] LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection☆41Nov 1, 2024Updated last year
- Official Implementation of Driv3R☆106Dec 12, 2024Updated last year
- DiFSD: Ego-Centric Fully Sparse Paradigm for End-to-End Self-Driving☆14Mar 9, 2025Updated 11 months ago
- ☆11Jun 19, 2024Updated last year
- [ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception☆14Jul 4, 2025Updated 8 months ago
- [ICCV 2025] RadarSplat: Radar Gaussian Splatting for High-Fidelity Data Synthesis and 3D Reconstruction of Autonomous Driving Scenes☆24Feb 10, 2026Updated 3 weeks ago
- web audio player☆21Mar 3, 2011Updated 15 years ago
- SparseWorld-TC: Trajectory-Conditioned Sparse Occupancy World Model☆25Dec 17, 2025Updated 2 months ago
- ☆11Jun 17, 2024Updated last year
- Implementation of 'FADet: A Multi-sensor 3D Object Detection Network based on Local Featured Attention'☆11Mar 27, 2024Updated last year
- Persistent dense gemm for Hopper in `CuTeDSL`☆15Aug 9, 2025Updated 6 months ago
- Code for "Spatial-Temporal Enhanced Transformer Towards Multi-Frame 3D Object Detection" (TPAMI2024)☆13Apr 3, 2025Updated 11 months ago
- Prototype implementation of an architecture suggested in Robot Dream paper (http://arxiv.org/abs/1603.03007)☆12Jul 3, 2019Updated 6 years ago
- [CVPR 2025] GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding☆206Jan 5, 2026Updated 2 months ago
- [ECCV 2024, IEEE TPAMI] Official PyTorch Implementation of HTCL : Hierarchical Temporal Context Learning for Camera-based Semantic Scene …☆51Feb 27, 2026Updated last week
- [CVPR 2023] Official implementation of the paper "Semi-DETR: Semi-Supervised Object Detection with Detection Transformers"☆104Nov 26, 2024Updated last year