shalfun / DriVerseLinks
Officially implement of the paper "DriVerse: Navigation World Model for Driving Simulation via Multimodal Trajectory Prompting and Motion Alignment"
☆199Updated 3 weeks ago
Alternatives and similar repositories for DriVerse
Users that are interested in DriVerse are comparing it to the libraries listed below
Sorting:
- A Unified Driving World Model for Future Generation and Perception☆103Updated 2 months ago
- [ICRA 2025]AVD2: Accident Video Diffusion for Accident Video Description☆82Updated 2 weeks ago
- ☆142Updated 3 months ago
- [CVPR 2025] UniScene: Unified Occupancy-centric Driving Scene Generation☆389Updated last month
- [CoRL2024] Let Occ Flow: Self-Supervised 3D Occupancy Flow Prediction☆108Updated last month
- Implementation for "Challenger: Affordable Adversarial Driving Video Generation"☆95Updated last week
- ☆83Updated this week
- ☆126Updated 2 months ago
- [CVPR2025] STCOcc: Sparse Spatial-Temporal Cascade Renovation for 3D Occupancy and Scene Flow Prediction☆48Updated last month
- [ICCV23] Bird’s-Eye-View Scene Graph for Vision-Language Navigation☆115Updated last year
- ☆52Updated last month
- [CVPR2025] Don’t Shake the Wheel: Momentum-Aware Planning in End-to-End Autonomous Driving☆183Updated last month
- A Strong Tracking Framework for 3D SOT on LiDAR Point Clouds☆73Updated 2 weeks ago
- [ICCV23] DQS3D: Densely-matched Quantization-aware Semi-supervised 3D Detection☆84Updated last year
- ☆79Updated this week
- Wan2.1 with Controlnet☆166Updated 2 months ago
- This is the repository that contains source code for the PhysGen3D.☆197Updated last month
- [ICRA 2024] RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision. (Early version: UniOcc)☆395Updated 5 months ago
- [ICRA 2025] PUGS: Zero-shot Physical Understanding with Gaussian Splatting.☆95Updated 2 months ago
- ✨✨latest advancements in VLA models(VIsion Language Action)☆73Updated last month
- ☆80Updated 7 months ago
- A lightweight LMM-based Document Parsing Model☆118Updated this week
- [ICRA 2023] LODE: Locally Conditioned Eikonal Implicit Scene Completion from Sparse LiDAR☆159Updated 2 years ago
- Awesome papers and code about Multi-Camera 3D Occupancy Prediction, such as TPVFormer, SurroundOcc, PanoOcc, OccFormer, FB-OCC, SelfOcc, …☆138Updated 2 weeks ago
- This repo is the official implementation of "BEVTrack: A Simple and Strong Baseline for 3D Single Object Tracking in Bird's-Eye View".☆75Updated 3 weeks ago
- [T-PAMI 2025] SceneTracker: Long-term Scene Flow Estimation Network☆117Updated last week
- ☆89Updated last year
- 📌 [Arxiv2025] Official implementation of "NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representation"☆171Updated 2 months ago
- [NeurIPS 2022] TOIST: Task Oriented Instance Segmentation Transformer with Noun-Pronoun Distillation☆114Updated 2 years ago
- [NeurIPS 2024] Referring Human Pose and Mask Estimation In the Wild☆43Updated 4 months ago