Official PyTorch implementation of D^2-World as the second place and innovation award of CVPR 2024 Predictive World Model Challenge.
☆16Apr 10, 2025Updated last year
Alternatives and similar repositories for D2-World
Users that are interested in D2-World are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Collect papers and codes about VQGAN in various Computer Vision tasks☆10Dec 20, 2022Updated 3 years ago
- ☆21Jul 1, 2024Updated last year
- [CVPR 2025] Spatiotemporal Decoupling for Efficient Vision-Based Occupancy Forecasting☆25Mar 3, 2025Updated last year
- [AAAI 2026] SparseWorld: A Flexible, Adaptive, and Efficient 4D Occupancy World Model Powered by Sparse and Dynamic Queries☆48Jan 14, 2026Updated 3 months ago
- ☆23Mar 18, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Adding Scene-Centric Forecasting Control to Occupancy World Model☆42Aug 24, 2025Updated 8 months ago
- official code of *DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model*☆64Apr 12, 2026Updated 3 weeks ago
- MambaOcc: Visual State Space Model for BEV-based Occupancy Prediction with Local Adaptive Reordering☆25Nov 28, 2025Updated 5 months ago
- A curated list of resources focused on Visual AutoRegressive Modeling, makes GPT-style AR models surpass diffusion transformers in image …☆40Mar 2, 2025Updated last year
- ☆57Oct 26, 2025Updated 6 months ago
- Official repo for "StreamingVLA: Streaming Vision-Language-Action Model with Action Flow Matching and Adaptive Early Observation"☆23Apr 22, 2026Updated 2 weeks ago
- [ECCV 2024] Occupancy as Set of Points☆93Jul 8, 2024Updated last year
- Joint Perception and Motion Prediction for Autonomous Driving Based on Bird's Eye View Maps☆22Feb 5, 2024Updated 2 years ago
- [CVPR 2025] Gaussian World Model for Streaming 3D Occupancy Prediction☆152Dec 4, 2025Updated 5 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [IEEE RA-L] Co-Occ: Coupling Explicit Feature Fusion with Volume Rendering Regularization for Multi-Modal 3D Semantic Occupancy Predictio…☆76Sep 26, 2024Updated last year
- [ICCV 2025] ALOcc: Adaptive Lifting-based 3D Semantic Occupancy and Cost Volume-based Flow Prediction☆45Dec 1, 2025Updated 5 months ago
- ☆113Oct 21, 2025Updated 6 months ago
- The official implementation of the TIP 2025 paper UncTrack: Reliable Visual Object Tracking with Uncertainty-Aware Prototype Memory Netwo…☆15Jun 16, 2025Updated 10 months ago
- ☆11Jun 19, 2024Updated last year
- Official implementation for 'SparseOcc: Rethinking Sparse Latent Representation for Vision-Based Semantic Occupancy Prediction' (CVPR 202…☆71Aug 5, 2024Updated last year
- Code for "Spatial-Temporal Enhanced Transformer Towards Multi-Frame 3D Object Detection" (TPAMI2024)☆13Apr 3, 2025Updated last year
- [NeurIPS 2025] Scene as Superquadrics for 3D Semantic Occupancy Prediction☆66Jul 13, 2025Updated 9 months ago
- [ECCV 2024] RecurrentBEV: A Long-term Temporal Fusion Framework for Multi-view 3D Detection☆33Sep 28, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation of 'FADet: A Multi-sensor 3D Object Detection Network based on Local Featured Attention'☆11Mar 27, 2024Updated 2 years ago
- ☆19Dec 8, 2024Updated last year
- Official implementation of Dense Prediction with Attentive Feature Aggregation, WACV 2023☆12Jan 31, 2023Updated 3 years ago
- [ICCV 2025] RadarSplat: Radar Gaussian Splatting for High-Fidelity Data Synthesis and 3D Reconstruction of Autonomous Driving Scenes☆26Feb 10, 2026Updated 2 months ago
- This is the publish code of TrackAny3D (ICCV2025).☆18Oct 20, 2025Updated 6 months ago
- This is the implementation of the paper "FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection" (ECCV 2024)☆34Aug 14, 2025Updated 8 months ago
- A comprehensive survey of forging vision foundation models for autonomous driving, including challenges, methodologies, and opportunities…☆272Jul 1, 2024Updated last year
- Official repo for PIWM: Enhancing Physical Consistency in Lightweight World Models☆21Nov 26, 2025Updated 5 months ago
- ☆17Oct 24, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆21Mar 5, 2025Updated last year
- [CVPR'24] "Unsupervised Occupancy Learning from Sparse Point Cloud"☆16Sep 25, 2024Updated last year
- ECCV[2024] "Modelling Competitive Behaviors in Autonomous Driving Under Generative World Model" official implement☆17Jul 15, 2025Updated 9 months ago
- ☆17Jul 18, 2024Updated last year
- [ICRA'2024] MonoOcc: Digging into Monocular Semantic Occupancy Prediction☆114Oct 23, 2023Updated 2 years ago
- The official implementation of the paper DBQ-SSD: Dynamic Ball Query for Efficient 3D Object Detection (ICLR 2023)☆18Sep 17, 2023Updated 2 years ago
- [AAAI 2025] SSLFusion: Scale and Space Aligned Latent Fusion Model for Multimodal 3D Object Detection☆18Nov 14, 2025Updated 5 months ago