Haiyang-W / UniTRLinks
[ICCV2023] Official Implementation of "UniTR: A Unified and Efficient Multi-Modal Transformer for Bird’s-Eye-View Representation"
☆338Updated last year
Alternatives and similar repositories for UniTR
Users that are interested in UniTR are comparing it to the libraries listed below
Sorting:
- EA-LSS: Edge-aware Lift-splat-shot Framework for 3D BEV Object Detection☆311Updated last year
- [ICCV 2023] Cross Modal Transformer: Towards Fast and Robust 3D Object Detection☆385Updated 2 years ago
- [NeurIPS 2022 & TPAMI 2025] DeepInteraction: 3D Object Detection via Modality Interaction☆245Updated 5 months ago
- [ICCV 2023] SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera Videos☆428Updated last year
- [ICCV 2023] SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object Detection☆241Updated last year
- ☆302Updated last year
- 3D occupancy☆392Updated 2 years ago
- [CVPR2023] Official Implementation of "DSVT: Dynamic Sparse Voxel Transformer with Rotated Sets"☆424Updated last year
- [AAAI2024] Far3D: Expanding the Horizon for Surround-view 3D Object Detection☆183Updated last year
- The official repository for BEVerse☆422Updated 3 years ago
- Code for paper: FUTR3D: a unified sensor fusion framework for 3d detection☆326Updated 2 years ago
- Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detection☆263Updated 2 years ago
- [ECCV 2024] Fully Sparse 3D Occupancy Prediction & RayIoU Evaluation Metric☆365Updated last year
- Efficient and Robust 2D-to-BEV Representation Learning via Geometry-guided Kernel Transformer☆243Updated 2 years ago
- Awesome papers about Multi-Camera Semantic Occupancy Prediction, such as TPVFormer, OccFormer, Occ3D, OpenOccupancy☆251Updated 2 years ago
- Sparse4D v1 & v2☆471Updated last year
- [ICCV 2023] OccFormer: Dual-path Transformer for Vision-based 3D Semantic Occupancy Prediction☆382Updated 2 years ago
- An official code release of our CVPR'23 paper, BEVHeight☆222Updated last year
- [AAAI 2023] PolarFormer: Multi-camera 3D Object Detection with Polar Transformers☆174Updated 2 years ago
- Vision-based 3D occupancy prediction in autonomous driving: a review and outlook☆244Updated last year
- [ICLR2024] TopoMLP: A Simple yet Strong Pipeline for Driving Topology Reasoning☆199Updated last month
- ☆237Updated last year
- This repository contains the PyTorch implementation of the CVPR'2024 paper (Highlight), IS-Fusion: Instance-Scene Collaborative Fusion fo…☆175Updated last year
- Implementation of PF-Track☆242Updated 2 years ago
- Implemented BEVFormer support for BEV segmentation☆142Updated 2 years ago
- 3D Occupancy Prediction Benchmark in Autonomous Driving☆382Updated 3 months ago
- [CVPR 2023] MSMDFusion: Fusing LiDAR and Camera at Multiple Scales with Multi-Depth Seeds for 3D Object Detection☆200Updated 2 years ago
- Official PyTorch implementation of FocalFormer3D [ICCV 2023]☆198Updated 9 months ago
- PillarNeXt: Rethinking Network Designs for 3D Object Detection in LiDAR Point Clouds (CVPR 2023)☆230Updated last year
- [ICCV2023 Oral] LATR: 3D Lane Detection from Monocular Images with Transformer☆217Updated 10 months ago