NVlabs / FocalFormer3D
Official PyTorch implementation of FocalFormer3D [ICCV 2023]
☆180Updated 2 weeks ago
Alternatives and similar repositories for FocalFormer3D:
Users that are interested in FocalFormer3D are comparing it to the libraries listed below
- [ICCV 2023] SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object Detection☆205Updated 5 months ago
- [AAAI 2023] PolarFormer: Multi-camera 3D Object Detection with Polar Transformers☆166Updated last year
- HEDNet (NeurIPS 2023) & SAFDNet (CVPR 2024 Oral)☆133Updated 3 months ago
- Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detection☆244Updated last year
- [ECCV 2024] Ray Denoising (RayDN): Depth-aware Hard Negative Sampling for Multi-view 3D Object Detection☆93Updated 3 months ago
- [AAAI2024] Far3D: Expanding the Horizon for Surround-view 3D Object Detection☆160Updated last year
- Official implementation of our RAL'24 paper: Multi-Camera Unified Pre-training for Autonomous Driving☆215Updated 11 months ago
- Unifying Voxel-based Representation with Transformer for 3D Object Detection (NeurIPS 2022)☆229Updated 2 years ago
- [CVPR 2024] PanoOcc: Unified Occupancy Representation for Camera-based 3D Panoptic Segmentation☆158Updated 5 months ago
- [NeurIPS 2022] DeepInteraction: 3D Object Detection via Modality Interaction☆227Updated 4 months ago
- (CVPR2023) CAPE: Camera View Position Embedding for Multi-View 3D Object Detection☆104Updated last year
- [CVPR 2024] Learning Occupancy for Monocular 3D Object Detection☆113Updated last year
- This repository contains the PyTorch implementation of the CVPR'2024 paper (Highlight), IS-Fusion: Instance-Scene Collaborative Fusion fo…☆123Updated 5 months ago
- UniPAD: A Universal Pre-training Paradigm for Autonomous Driving (CVPR 2024)☆180Updated 6 months ago
- [IROS 2024 Oral Presentation] WidthFormer: Toward Efficient Transformer-based BEV View Transformation☆136Updated 7 months ago
- An Efficient, Flexible, and General deep learning framework that retains minimal.☆111Updated last year
- EA-LSS: Edge-aware Lift-splat-shot Framework for 3D BEV Object Detection☆253Updated last year
- [CVPR2021] PointAugmenting: Cross-Modal Augmentation for 3D Object Detection☆112Updated 2 years ago
- Open Source 3D Occupancy Prediction Library.☆139Updated last year
- [ICCV2023] Official Implementation of "UniTR: A Unified and Efficient Multi-Modal Transformer for Bird’s-Eye-View Representation"☆299Updated 4 months ago
- Vision-based 3D occupancy prediction in autonomous driving: a review and outlook☆186Updated 6 months ago
- "Rethinking IoU-based Optimization for Single-stage 3D Object Detection", ECCV2022 accept!☆129Updated last year
- [ICLR 2023] BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object Detection☆117Updated last year
- (NeurlPS 2022) Towards Efficient 3D Object Detection with Knowledge Distillation☆116Updated last year
- Implementation of PF-Track☆216Updated last year
- Source code of PivotNet (ICCV2023, PivotNet: Vectorized Pivot Learning for End-to-end HD Map Construction)☆108Updated 9 months ago
- Efficient Point-based 3D Semantic Occupancy Prediction☆136Updated 6 months ago
- [NeurIPS 2024] Official code of ”LION: Linear Group RNN for 3D Object Detection in Point Clouds“☆152Updated 3 months ago
- ☆55Updated last year
- [ECCV2022, IJCAI2022] AutoAlignV2: Deformable Feature Aggregation for Dynamic Multi-Modal 3D Object Detection☆147Updated 2 years ago