MSunDYY / SparseOccVLALinks
SparseOccVLA: Bridging Occupancy and Vision-Language Models via Sparse Queries for Unified 4D Scene Understanding and Planning
☆39Updated 2 weeks ago
Alternatives and similar repositories for SparseOccVLA
Users that are interested in SparseOccVLA are comparing it to the libraries listed below
Sorting:
- [ECCV 2024] SimPB: A Single Model for 2D and 3D Object Detection from Multiple Cameras☆34Updated last year
- StreamPETR with 3dppe Extension☆51Updated 2 years ago
- ☆73Updated 5 months ago
- [ECCV 2024] LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection☆41Updated last year
- [WACV 2025] PrevPredMap: Exploring Temporal Modeling with Previous Predictions for Online Vectorized HD Map Construction☆24Updated last year
- [ECCV 2024] HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras☆60Updated last year
- ☆33Updated 2 years ago
- [ECCV 2024] Official implementation for "RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception"☆33Updated 10 months ago
- [ECCV 2024] OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection☆74Updated last year
- ☆22Updated last year
- EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy Network☆63Updated 9 months ago
- AeDet: Azimuth-invariant Multi-view 3D Object Detection, CVPR2023☆77Updated 2 years ago
- ☆21Updated last year
- Enhancing 3D Lane Detection and Topology Reasoning with 2D Lane Priors☆77Updated last month
- [TCSVT] DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy Prediction☆96Updated 3 months ago
- ☆24Updated last year
- This is the implementation of the paper "SA-BEV: Generating Semantic-Aware Bird's-Eye-View Feature for Multi-view 3D Object Detection" (I…☆82Updated 2 years ago
- ☆60Updated last year
- [NeurIPS 2025] OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection☆67Updated last year
- MambaOcc: Visual State Space Model for BEV-based Occupancy Prediction with Local Adaptive Reordering☆21Updated 2 months ago
- Target Inner-Geometry Learning for BEV 3D Object Detection☆90Updated 2 years ago
- [ECCV 2024] Towards Stable 3D Object Detection☆49Updated last year
- Official implementation for 'SparseOcc: Rethinking Sparse Latent Representation for Vision-Based Semantic Occupancy Prediction' (CVPR 202…☆71Updated last year
- DrivePI: Spatial-aware 4D MLLM for Unified Autonomous Driving Understanding, Perception, Prediction and Planning☆78Updated last month
- Code for "Object as Query: Lifting any 2D Object Detector to 3D Detection"☆113Updated 2 years ago
- The official implementation of the ECCV 2024 paper: Continuity Preserving Online CenterLine Graph Learning☆34Updated last year
- [ECCV 2024] Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression☆50Updated last year
- [ECCV2024] UniM2AE: Multi-modal Masked Autoencoders with Unified 3D Representation for 3D Perception in Autonomous Driving☆66Updated last year
- Is Your HD Map Constructor Reliable under Sensor Corruptions?☆37Updated last year
- [AAAI 2025] ProtoOcc: Accurate, Efficient 3D Occupancy Prediction Using Dual Branch Encoder-Prototype Query Decoder☆44Updated 6 months ago