Senwang98 / MonoSKD
[ECAI 2023] MonoSKD: General Distillation Framework for Monocular 3D Object Detection via Spearman Correlation Coefficient
☆30Updated 9 months ago
Related projects: ⓘ
- EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection☆65Updated 4 months ago
- MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning☆60Updated 6 months ago
- WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation☆75Updated 9 months ago
- [CVPR 2024] Official PyTorch Code of SeaBird: Segmentation in Bird's View with Dice Loss Improves Monocular 3D Detection of Large Objects☆78Updated 2 months ago
- A systematic survey of multi-modal and multi-task visual understanding foundation models for driving scenarios☆45Updated 3 months ago
- Curricular Object Manipulation in LiDAR-based Object Detection(CVPR 2023)☆37Updated last year
- 😎 Awesome lists of papers and codes about open-vocabulary perception, including both 3D and 2D☆20Updated 2 months ago
- [ECCV 2024] Sparse Beats Dense: Rethinking Supervision in Radar-Camera Depth Completion☆12Updated last month
- state-of-the-art open vocabulary detector on COCO/LVIS/V3Det☆23Updated 5 months ago
- ☆19Updated 4 months ago
- Official Pytorch Implementation of Self-emerging Token Labeling☆30Updated 5 months ago
- InverseMatrixVT3D: An Efficient Projection Matrix-Based Approach for 3D Occupancy Prediction☆22Updated 2 months ago
- [ECCV2024] ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation☆45Updated 2 weeks ago
- Multi-Space Alignments Towards Universal LiDAR Segmentation☆33Updated 2 months ago
- [ECCV2024] UniM2AE: Multi-modal Masked Autoencoders with Unified 3D Representation for 3D Perception in Autonomous Driving☆38Updated 2 weeks ago
- This repository compiles a list of papers related to Video LLM.☆16Updated 2 months ago
- ☆51Updated 10 months ago
- Detectron2 Toolbox and Benchmark for V3Det☆15Updated 3 months ago
- Project for "LaSagnA: Language-based Segmentation Assistant for Complex Queries".☆43Updated 4 months ago
- [CVPR 2024 Highlight] OpenESS: Event-Based Semantic Scene Understanding with Open Vocabularies☆39Updated 4 months ago
- [AAAI 2024] Mono3DVG: 3D Visual Grounding in Monocular Images, AAAI, 2024☆22Updated 5 months ago
- [ICCV2023] DETRDistill: A Universal Knowledge Distillation Framework for DETR-families☆37Updated 10 months ago
- [AAAI 2024] The official implementation of the paper "3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Refer…☆31Updated 9 months ago
- Implementation of "PG-RCNN: Semantic Surface Point Generation for 3D Object Detection" (ICCV 2023)☆28Updated 6 months ago
- ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention (ECCV 2024)☆68Updated last month
- ☆46Updated last month
- This repo contains the code for our paper MaXTron: Mask Transformer with Trajectory Attention for Video Panoptic Segmentation☆28Updated 3 months ago
- (ICLR 2024, CVPR 2024) SparseFormer☆62Updated 5 months ago
- Tracking through Containers and Occluders in the Wild (CVPR 2023) - Official Implementation☆39Updated 3 months ago
- [CVPR 2024] Symphonies (Scene-from-Insts): Symphonize 3D Semantic Scene Completion with Contextual Instance Queries☆156Updated 2 months ago