LMD0311 / HERMESLinks
[ICCV 2025] HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation
☆155Updated last month
Alternatives and similar repositories for HERMES
Users that are interested in HERMES are comparing it to the libraries listed below
Sorting:
- Official implementation of the paper “MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes”☆284Updated last year
- [ICCV 2025] Official implementation of the paper “MagicDrive-V2: High-Resolution Long Video Generation for Autonomous Driving with Adapti…☆639Updated 2 months ago
- GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models☆410Updated last month
- Official implementation for "FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving"☆176Updated 3 weeks ago
- [CVPR 2024] Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud Analysis☆173Updated 10 months ago
- [ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”☆1,118Updated 4 months ago
- Awesome Data-Driven Autonomous Driving Solutions. Also the official repository of our survey paper: Data-Centric Evolution in Autonomous …☆178Updated last year
- [CVPR 2024] Adaptive Multi-Modal Cross-Entropy Loss for Stereo Matching☆62Updated 2 months ago
- [CVPR'24 Highlight] GPT4Point: A Unified Framework for Point-Language Understanding and Generation.☆421Updated last year
- Official Implementation for TC-Light: Temporally Coherent Generative Rendering for Realistic World Transfer☆74Updated last month
- [ECCV 2024] Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression☆46Updated 11 months ago
- [CVPR 2025] ReconDreamer☆172Updated 8 months ago
- [ECCV 2024] ViewFormer: Exploring Spatiotemporal Modeling for Multi-View 3D Occupancy Perception via View-Guided Transformers☆53Updated 9 months ago
- [ICCV 2025] Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model☆83Updated 8 months ago
- [ECCV 2024] GVGEN: Text-to-3D Generation with Volumetric Representation☆124Updated 9 months ago
- [NeurIPS 2024] A Unified Framework for 3D Scene Understanding☆152Updated last month
- [CVPR 2025] DrivingSphere: Building a High-fidelity 4D World for Closed-loop Simulation☆57Updated 3 months ago
- [CVPR 2025] GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding☆162Updated last month
- ☆27Updated 11 months ago
- ☆39Updated last month
- Code release for the ECCV 2024 paper 'Fully Test-Time Adaptation for Monocular 3D Object Detection'☆53Updated 8 months ago
- [ICLR 2025] Official code implementation for the paper "X-Drive: Cross-modality Consistent Multi-Sensor Data Synthesis for Driving Scenar…☆58Updated 5 months ago
- official implement for 《LiON: Learning Point-wise Abstaining Penalty for LiDAR Outlier DetectioN Using Diverse Synthetic Data 》☆18Updated 8 months ago
- [CVPR2024] Official Repository of Paper "Panacea: Panoramic and Controllable Video Generation for Autonomous Driving"☆239Updated last year
- the official code of DriveMonkey☆31Updated 3 months ago
- [ECCV 2024] Official implementation of "RangeLDM: Fast Realistic LiDAR Point Cloud Generation"☆38Updated 9 months ago
- [ICCV 2025] Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding☆56Updated 7 months ago
- Official PyTorch implementation of D^2-World as the second place and innovation award of CVPR 2024 Predictive World Model Challenge.☆13Updated 4 months ago
- [ECCV 2024] Occupancy as Set of Points☆90Updated last year
- [ICLR 2025] Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Driving☆41Updated 6 months ago