vita-epfl / VoxDetView external linksLinks
[NeurIPS 25 Spotlight] VoxDet: Rethinking 3D Semantic Occupancy Prediction as Dense Object Detection
☆61Oct 16, 2025Updated 4 months ago
Alternatives and similar repositories for VoxDet
Users that are interested in VoxDet are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] Rethinking Temporal Fusion with a Unified Gradient Descent View for 3D Semantic Occupancy Prediction☆32Dec 1, 2025Updated 2 months ago
- Undistorted Depth Support for ScanNet++☆17Dec 8, 2023Updated 2 years ago
- [ICCV 2025] ALOcc: Adaptive Lifting-based 3D Semantic Occupancy and Cost Volume-based Flow Prediction☆39Dec 1, 2025Updated 2 months ago
- LiDAR-based Traversability estimation in unstructured environments (RA-L 2024)☆17Jun 24, 2025Updated 7 months ago
- ☆65Jul 13, 2025Updated 7 months ago
- [ICASSP2025] ConcealGS: Conceal Implicit Information in 3D Gaussian Splatting☆20Jan 22, 2025Updated last year
- Out-of-Distribution Semantic Occupancy Prediction☆20Oct 22, 2025Updated 3 months ago
- [ECCV 2024, TPAMI 2025]Official PyTorch Implementation of HTCL : Hierarchical Temporal Context Learning for Camera-based Semantic Scene C…☆50Dec 31, 2025Updated last month
- [ACM MM 2025] EmbodiedOcc++: Boosting Embodied 3D Occupancy Prediction with Plane Regularization and Uncertainty Sampler☆26Aug 7, 2025Updated 6 months ago
- [ICRA 2025] Polyp-Gen: Realistic and Diverse Polyp Image Generation for Endoscopic Dataset Expansion☆24Feb 5, 2026Updated last week
- [NeurIPS DB 2025] IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Rendering☆43Oct 15, 2025Updated 4 months ago
- Satellite-Ground Fusion for 3D Semantic Scene Completion☆28Sep 8, 2025Updated 5 months ago
- [ECCV 2024] SimPB: A Single Model for 2D and 3D Object Detection from Multiple Cameras☆34Sep 22, 2024Updated last year
- [ICCV 2025] Official implementation of X2-Gaussian: 4D Radiative Gaussian Splatting for Continuous-time Tomographic Reconstruction☆52Oct 27, 2025Updated 3 months ago
- ☆44May 10, 2025Updated 9 months ago
- [NeurIPS'25] EndoBench: A Comprehensive Evaluation of Multi-Modal Large Language Models for Endoscopy Analysis☆57Oct 21, 2025Updated 3 months ago
- [ICRA 2025] Official implementation for "TrackOcc: Camera-based 4D Panoptic Occupancy Tracking"☆54Jun 23, 2025Updated 7 months ago
- 🌐 Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems☆137Feb 1, 2026Updated 2 weeks ago
- Leveraging BEV Maps in LLMs for Robust Closed-Loop Driving☆64Jul 25, 2025Updated 6 months ago
- CVPR 2025: VoxelSplat: Dynamic Gaussian Splatting as an Effective Loss for Occupancy and Flow Prediction☆73Aug 1, 2025Updated 6 months ago
- [ICCV 2025] TurboTrain: Towards Efficient and Balanced Multi-Task Learning for Multi-Agent Perception and Prediction.☆16Jan 31, 2026Updated 2 weeks ago
- Annotated dataset of quadrotor Eagle for object detection of UAVs☆14Apr 4, 2022Updated 3 years ago
- Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion☆12Jan 14, 2026Updated last month
- ☆50Oct 26, 2025Updated 3 months ago
- ☆540Jul 29, 2024Updated last year
- [IROS2023] Calibration-free BEV Representation for Infrastructure Perception☆42May 31, 2023Updated 2 years ago
- [MICCAI 2024] EndoSparse: Real-Time Sparse View Synthesis of Endoscopic Scenes using Gaussian Splatting☆41Jul 9, 2024Updated last year
- [ECCV 2024] "REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models"☆13Aug 6, 2024Updated last year
- [ICLR 2026] This is the official implementation of PG-Occ: Progressive Gaussian Transformer with Anisotropy-aware Sampling for Open Vocab…☆28Jan 26, 2026Updated 3 weeks ago
- Estimate depth from surface normal.☆12Aug 14, 2020Updated 5 years ago
- MGDLoss based yolov11 Knowledge Distillation☆13Oct 27, 2025Updated 3 months ago
- Inferring distributions over depth from a single image, IROS 2019☆39Jul 21, 2023Updated 2 years ago
- [ICLR 2025] Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Driving☆52Feb 14, 2025Updated last year
- Dur360BEV: (ICRA 2025) A Real-world 360-degree Single Camera Dataset and Benchmark for Bird-Eye View Mapping in Autonomous Driving☆23Feb 2, 2026Updated 2 weeks ago
- Efficient Adversarial Attack Strategy Against 3D Object Detection in Autonomous Driving Systems☆38Oct 7, 2025Updated 4 months ago
- An implementation of several unsupervised object discovery models (Slot Attention, SLATE, GNM) in PyTorch with pre-trained models.☆14May 26, 2025Updated 8 months ago
- "BadPart: Unified Black-box Adversarial Patch Attacks against Pixel-wise Regression Tasks"☆13May 10, 2024Updated last year
- ☆11Nov 30, 2023Updated 2 years ago
- logit lens for VGGT☆26Dec 2, 2025Updated 2 months ago