Eaphan / STEMDLinks
Code for "Spatial-Temporal Enhanced Transformer Towards Multi-Frame 3D Object Detection" (TPAMI2024)
☆13Updated 7 months ago
Alternatives and similar repositories for STEMD
Users that are interested in STEMD are comparing it to the libraries listed below
Sorting:
- [AAAI24]This is the implementation for the paper M-BEV: Masked BEV Perception for Robust Autonomous Driving☆39Updated last year
- Fine-grained Image-to-LiDAR Contrastive Distillation with Visual Foundation Models (NeurIPS2024)☆40Updated last year
- [ICCV2025] CoopTrack: Exploring End-to-End Learning for Efficient Cooperative Sequential Perception☆42Updated 2 months ago
- Code release for the ECCV 2024 paper 'Fully Test-Time Adaptation for Monocular 3D Object Detection'☆55Updated 11 months ago
- [CVPR 2024] This is official implementation of our CVPR 2024 paper "Building a Strong Pre-Training Baseline for Universal 3D Large-Scale …☆15Updated last year
- The official implementation of "Not All Voxels Are Equal: Hardness-Aware Semantic Scene Completion with Self-Distillation" (CVPR 2024)☆28Updated last year
- [ICLR 2025] Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Driving☆48Updated 9 months ago
- Vispy-based NuScenes Visualization Toolkit☆15Updated 2 years ago
- This is the official project repository for "DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diff…☆33Updated 2 months ago
- [ICCV 2025] Language Driven Occupancy Prediction☆31Updated 11 months ago
- (CVPR 2024) Paper: Self-Supervised Class-Agnostic Motion Prediction with Spatial and Temporal Consistency Regularizations☆26Updated last year
- Official code for SOAP implementation☆11Updated 3 months ago
- [CVPR2024] PTT: Point-Trajectory Transformer for Efficient Temporal 3D Object Detection☆76Updated last year
- ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention (ECCV 2024)☆80Updated 6 months ago
- [ICCV 2023] GeoMIM: towards better 3d knowledge transfer via masked image modeling for multi-view 3d understanding☆49Updated 2 years ago
- ☆43Updated 6 months ago
- ☆38Updated 9 months ago
- Learning to Detect Objects from Multi-Agent LiDAR Scans without Manual Labels. (CVPR2025)☆33Updated last month
- [ICCV 2025 Highlight] Mamba-Fusion, Multi-modal 3D Object Detection☆58Updated 3 months ago
- [INFFUS 2025] CoreNet: Conflict Resolution Network for point-pixel misalignment and sub-task suppression of 3D LiDAR-camera object detect…☆21Updated 8 months ago
- [ECCV 2024] OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection☆74Updated last year
- MSF: Motion-guided Sequential Fusion for Efficient 3D Object Detection from Point Cloud Sequences (CVPR 2023)☆63Updated 2 years ago
- This is the official implementation of ECCV2024 paper "Plug and Play: A Representation Enhanced Domain Adapter for Collaborative Percepti…☆18Updated last year
- Deep Height Decoupling for Precise Vision-based 3D Occupancy Prediction (ICRA 2025)☆47Updated 2 months ago
- DifFUSER: Diffusion Model for Robust Multi-Sensor Fusion in 3D Object Detection and BEV Segmentation☆89Updated 10 months ago
- [CVPR 2023] BEVGuide: BEV-Guided Multi-Modality Fusion for Driving Perception☆35Updated 2 years ago
- [CVPR2025] The code for "Uncertainty-Instructed Structure Injection for Generalizable HD Map Construction."☆17Updated last month
- [ECCV2022] Homogeneous Multi-modal Feature Fusion and Interaction for 3D Object Detection☆25Updated 3 years ago
- Multi-Space Alignments Towards Universal LiDAR Segmentation☆50Updated last year
- This is the implementation of the paper "SA-BEV: Generating Semantic-Aware Bird's-Eye-View Feature for Multi-view 3D Object Detection" (I…☆81Updated 2 years ago