ZhanYang-nwpu / Mono3DVG
[AAAI 2024] Mono3DVG: 3D Visual Grounding in Monocular Images, AAAI, 2024
☆22Updated 5 months ago
Related projects: ⓘ
- Code release for our NeurIPS 2023 paper "Uni3DETR: Unified 3D Detection Transformer", our ECCV 2024 paper "OV-Uni3DETR: Towards Unified O…☆73Updated last month
- [ECCV 2024] Occupancy as Set of Points☆63Updated 2 months ago
- Multi-Space Alignments Towards Universal LiDAR Segmentation☆33Updated 2 months ago
- A Unified Framework for 3D Scene Understanding☆80Updated last month
- ☆28Updated last month
- Codes for ICLR 2024: "MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object Detection"☆61Updated 2 months ago
- [ECCV 2024] Towards Stable 3D Object Detection☆37Updated 2 months ago
- ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention (ECCV 2024)☆68Updated last month
- Fully Sparse Fusion for 3D Object Detection☆83Updated 2 months ago
- 😎 Awesome lists of papers and codes about open-vocabulary perception, including both 3D and 2D☆20Updated 2 months ago
- [ECCV2024] UniM2AE: Multi-modal Masked Autoencoders with Unified 3D Representation for 3D Perception in Autonomous Driving☆38Updated 2 weeks ago
- ☆15Updated last month
- [ECCV 2024] TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes☆93Updated last month
- Voxel Mamba: Group-Free State Space Models for Point Cloud based 3D Object Detection☆61Updated last month
- Curricular Object Manipulation in LiDAR-based Object Detection(CVPR 2023)☆37Updated last year
- AeDet: Azimuth-invariant Multi-view 3D Object Detection, CVPR2023☆72Updated last year
- ☆71Updated last year
- MambaOcc: Visual State Space Model for BEV-based Occupancy Prediction with Local Adaptive Reordering☆17Updated last month
- MetaBEV: Solving Sensor Failures for BEV Detection and Map Segmentation☆86Updated last year
- [ICCV 2023] Revisiting Domain-Adaptive 3D Object Detection by Reliable, Diverse and Class-balanced Pseudo-Labeling☆46Updated 11 months ago
- Our OpenYOLO3D model achieves state-of-the-art performance in Open Vocabulary 3D Instance Segmentation on ScanNet200 and Replica dataset…☆62Updated 3 weeks ago
- This is the implementation of the paper "SA-BEV: Generating Semantic-Aware Bird's-Eye-View Feature for Multi-view 3D Object Detection" (I…☆59Updated last year
- [ECCV 2024] Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression☆20Updated 2 weeks ago
- (ICCV2023) MonoNeRD: NeRF-like Representations for Monocular 3D Object Detection☆78Updated 9 months ago
- ☆82Updated 5 months ago
- InverseMatrixVT3D: An Efficient Projection Matrix-Based Approach for 3D Occupancy Prediction☆22Updated 2 months ago
- Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding☆42Updated 5 months ago
- The official implementation of "Not All Voxels Are Equal: Hardness-Aware Semantic Scene Completion with Self-Distillation" (CVPR 2024)☆27Updated last month
- [CVPR 2023] MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training☆47Updated last year
- [CVPR2024] The code for "MGMap: Mask-Guided Learning for Online Vectorized HD Map Construction"☆82Updated 5 months ago