Official code for CVPR 2026 paper: VGGT-Det: Mining VGGT Internal Priors for Sensor-Geometry-Free Multi-View Indoor 3D Object Detection
☆89Mar 3, 2026Updated 3 weeks ago
Alternatives and similar repositories for VGGT-Det-CVPR2026
Users that are interested in VGGT-Det-CVPR2026 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Jun 4, 2025Updated 9 months ago
- Code for NeurIPS 2024 work "MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane Sweeps"☆17Dec 11, 2024Updated last year
- CVT-xRF: Contrastive In-Voxel Transformer for 3D Consistent Radiance Fields from Sparse Inputs (CVPR2024)☆17Jun 14, 2024Updated last year
- Official code for ECCV2024 paper: GScream: Learning 3D Geometry and Feature Consistent Gaussian Splatting for Object Removal☆104Nov 25, 2025Updated 4 months ago
- [ICLR 2026] This is the official implementation of PG-Occ: Progressive Gaussian Transformer with Anisotropy-aware Sampling for Open Vocab…☆31Feb 19, 2026Updated last month
- Official Repository of Personalized Visual Instruct Tuning☆34Mar 6, 2025Updated last year
- ☆90Updated this week
- code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"☆62Aug 23, 2024Updated last year
- This is the official repo of the paper "Latent Guard: a Safety Framework for Text-to-image Generation"☆53Oct 24, 2024Updated last year
- Awesome-4D-Radar☆12Feb 17, 2024Updated 2 years ago
- ICML2025, I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models☆192Sep 7, 2025Updated 6 months ago
- [ICLR 2026] Mono4DGS-HDR: High Dynamic Range 4D Gaussian Splatting from Alternating-exposure Monocular Videos☆28Jan 26, 2026Updated last month
- [CVPR 2026] "GenieDrive: Towards Physics-Aware Driving World Model with 4D Occupancy Guided Video Generation"☆68Dec 17, 2025Updated 3 months ago
- One4D: Unified 4D Generation and Reconstruction☆92Dec 2, 2025Updated 3 months ago
- The official repository for paper "MLLM-Protector: Ensuring MLLM’s Safety without Hurting Performance"☆45Apr 21, 2024Updated last year
- The official code for ICML 2024 "FedREDefense: Defending against Model Poisoning Attacks for Federated Learning using Model Update Recons…☆29Jun 6, 2024Updated last year
- [Arxiv'25] SaLon3R: Structure-aware Long-term Generalizable 3D Reconstruction from Unposed Images☆47Oct 18, 2025Updated 5 months ago
- Visual Localization with an image sequence. 3DV Project @ ETH Zurich, 2022.☆20Jul 20, 2022Updated 3 years ago
- [CVPR 2025] GaussHDR: High Dynamic Range Gaussian Splatting via Learning Unified 3D and 2D Local Tone Mapping☆44Oct 22, 2025Updated 5 months ago
- Codes for Switch-NeRF (ICLR 2023)☆211Aug 25, 2025Updated 6 months ago
- Official code for paper: F3D-Gaus: Feed-forward 3D-aware Generation on ImageNet with Cycle-Aggregative Gaussian Splatting☆50Mar 11, 2025Updated last year
- This is the implementation of the paper "ResWorld: Temporal Residual World Model for End-to-End Autonomous Driving" (ICLR 2026)☆26Feb 5, 2026Updated last month
- Code of our CVPR2024 paper - DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data☆59Mar 25, 2024Updated 2 years ago
- ☆34Aug 26, 2025Updated 6 months ago
- A Unified Perspective-to-Equirectangular Visual Place Recognition Framework☆21Dec 19, 2025Updated 3 months ago
- awsome ai tools☆12Apr 21, 2023Updated 2 years ago
- Taming Video Diffusion Prior with Scene-Grounding Guidance for 3D Gaussian Splatting from Sparse Inputs (CVPR2025 Highlight)☆127Sep 18, 2025Updated 6 months ago
- Vision-Language Global Localization (VLG-Loc) is a global localization method that uses camera images and a human-readable labeled footpr…☆38Dec 24, 2025Updated 3 months ago
- [SIGGRAPH Asia 2025] The official repo for the conference paper "MV-Performer: Taming Video Diffusion Model for Faithful and Synchronized …☆39Dec 13, 2025Updated 3 months ago
- ☆13Apr 5, 2023Updated 2 years ago
- ☆13Sep 14, 2022Updated 3 years ago
- ☆26Jan 20, 2025Updated last year
- An open source code repository of driving world models, with training, inferencing, evaluation tools, and pretrained checkpoints.☆381Jun 19, 2025Updated 9 months ago
- [CVPR2023] NeRF-RPN: A general framework for object detection in NeRFs☆236Mar 16, 2025Updated last year
- Unsupervised muti-metric fusion for Full-Reference (FR) Image Quality Assessment (IQA)☆11Jul 11, 2014Updated 11 years ago
- Official Code for ACL 2024 paper "GradSafe: Detecting Unsafe Prompts for LLMs via Safety-Critical Gradient Analysis"☆66Oct 27, 2024Updated last year
- Official code for NeurIPS2023 paper: CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Dete…☆220Updated this week
- pytorch implementation of "Efficiently Reconstructing Dynamic Scenes One 🎯 D4RT at a Time"☆48Jan 27, 2026Updated last month
- [ICLR2026] WeTok: Powerful Discrete Tokenization for High-Fidelity Visual Reconstruction☆65Sep 3, 2025Updated 6 months ago