Eaphan / STEMDLinks
Code for "Spatial-Temporal Enhanced Transformer Towards Multi-Frame 3D Object Detection" (TPAMI2024)
☆13Updated 6 months ago
Alternatives and similar repositories for STEMD
Users that are interested in STEMD are comparing it to the libraries listed below
Sorting:
- [AAAI24]This is the implementation for the paper M-BEV: Masked BEV Perception for Robust Autonomous Driving☆39Updated last year
- Fine-grained Image-to-LiDAR Contrastive Distillation with Visual Foundation Models (NeurIPS2024)☆41Updated 10 months ago
- [ICCV 2025] Language Driven Occupancy Prediction☆26Updated 9 months ago
- Deep Height Decoupling for Precise Vision-based 3D Occupancy Prediction (ICRA 2025)☆46Updated 2 weeks ago
- [ICLR 2025] Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Driving☆46Updated 7 months ago
- Official code for SOAP implementation☆10Updated 2 months ago
- Code release for the ECCV 2024 paper 'Fully Test-Time Adaptation for Monocular 3D Object Detection'☆53Updated 10 months ago
- Offboard Occupancy Refinement with Hybrid Propagation for Autonomous Driving☆15Updated 8 months ago
- ☆36Updated 7 months ago
- [ICCV2025] CoopTrack: Exploring End-to-End Learning for Efficient Cooperative Sequential Perception☆29Updated last month
- The official implementation of "Not All Voxels Are Equal: Hardness-Aware Semantic Scene Completion with Self-Distillation" (CVPR 2024)☆28Updated last year
- Vispy-based NuScenes Visualization Toolkit☆15Updated 2 years ago
- Multi-Space Alignments Towards Universal LiDAR Segmentation☆48Updated 11 months ago
- [CVPR 2024] This is official implementation of our CVPR 2024 paper "Building a Strong Pre-Training Baseline for Universal 3D Large-Scale …☆15Updated last year
- [CVPR 2023] BEVGuide: BEV-Guided Multi-Modality Fusion for Driving Perception☆34Updated 2 years ago
- ☆11Updated 8 months ago
- [ECCV 2024] Towards Stable 3D Object Detection☆47Updated last year
- [ECCV2024] Weakly Supervised 3D Object Detection via Multi-Level Visual Guidance☆22Updated last year
- [ECCV 2024] OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection☆70Updated last year
- This is the official project repository for "DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diff…☆29Updated last month
- [ECCV 2024] LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection☆38Updated 11 months ago
- ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention (ECCV 2024)☆80Updated 4 months ago
- Learning to Detect Objects from Multi-Agent LiDAR Scans without Manual Labels. (CVPR2025)☆32Updated last week
- This is the implementation of the paper "SA-BEV: Generating Semantic-Aware Bird's-Eye-View Feature for Multi-view 3D Object Detection" (I…☆80Updated 2 years ago
- Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving (AAAI-25)☆58Updated 8 months ago
- [CVPR2024] PTT: Point-Trajectory Transformer for Efficient Temporal 3D Object Detection☆73Updated last year
- DifFUSER: Diffusion Model for Robust Multi-Sensor Fusion in 3D Object Detection and BEV Segmentation☆85Updated 9 months ago
- (AAAI2024) Regulating Intermediate 3D Features for Vision-Centric Autonomous Driving☆20Updated last year
- [ICCV 2025 Highlight] Mamba-Fusion, Multi-modal 3D Object Detection☆46Updated 2 months ago
- This is the implementation of the paper "FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection" (ECCV 2024)☆32Updated last month