JihyeokKim / MonoDINO-DETR
MonoDINO-DETR: Depth-Enhanced Monocular 3D Object Detection Using a Vision Foundation Model
☆17Updated last month
Alternatives and similar repositories for MonoDINO-DETR:
Users that are interested in MonoDINO-DETR are comparing it to the libraries listed below
- [CVPR2025] Don’t Shake the Wheel: Momentum-Aware Planning in End-to-End Autonomous Driving☆60Updated last month
- ECCV 2024 Paper List about Autonomous Driving☆129Updated 6 months ago
- ☆74Updated last month
- Official implementation of PointBeV: A Sparse Approach to BeV Predictions☆117Updated last year
- ☆47Updated 7 months ago
- OpenDriveVLA: Towards End-to-end Autonomous Driving with Large Vision Language Action Model☆118Updated 2 weeks ago
- (ICLR2025) Enhancing End-to-End Autonomous Driving with Latent World Model☆148Updated last month
- VADv2: End-to-End Vectorized Autonomous Driving via Probabilistic Planning☆37Updated last month
- ☆24Updated 3 months ago
- [ECCV 2024] The official implementation of DualBEV☆56Updated 9 months ago
- Talk2BEV: Language-Enhanced Bird's Eye View Maps (ICRA'24)☆109Updated 5 months ago
- [ECCV'24] LISO: Lidar-only Self-Supervised 3D Object Detection☆43Updated 2 months ago
- End-to-End Driving with Online Trajectory Evaluation via BEV World Model☆42Updated last week
- [ECCV 2024] Accelerating Online Mapping and Behavior Prediction via Direct BEV Feature Attention☆99Updated last month
- ☆87Updated 6 months ago
- ☆33Updated 6 months ago
- [CVPR2024] Official implementation of "RCooper: A Real-world Large-scale Dataset for Roadside Cooperative Perception"☆94Updated 10 months ago
- [IROS2024] Camera-Radar Fusion for BEV Map and Object Segmentation☆78Updated last month
- Repo of "GoalFlow: Goal-Driven Flow Matching for Multimodal Trajectories Generation in End-to-End Autonomous Driving"☆132Updated last week
- [IV'24] UniBEV: the official implementation of UniBEV☆36Updated 9 months ago
- ☆108Updated 9 months ago
- Official Code Release of "FusionAD"☆140Updated 9 months ago
- [CVPR 2024] MAPLM: A Large-Scale Vision-Language Dataset for Map and Traffic Scene Understanding☆133Updated last year
- [ECCV 2024] A Simple and Effective 3D DETR in Point Clouds☆72Updated 5 months ago
- ☆219Updated 9 months ago
- [IV2024] MultiCorrupt: A benchmark for robust multi-modal 3D object detection, evaluating LiDAR-Camera fusion models in autonomous drivin…☆61Updated this week
- Open source training framework for vision tasks. Scales up on data and scales up on tasks. Official Implementation for https://arxiv.org/…☆38Updated last year
- POWERBEV, a novel and elegant vision-based end-to-end framework that only consists of 2D convolutional layers to perform perception and f…☆88Updated last year
- [ICLR 2025] The official implementation of SSR☆146Updated 3 weeks ago
- DifFUSER: Diffusion Model for Robust Multi-Sensor Fusion in 3D Object Detection and BEV Segmentation☆69Updated 3 months ago