Official code for paper: N3D-VLM: Native 3D Grounding Enables Accurate Spatial Reasoning in Vision-Language Models
☆99Jan 14, 2026Updated 3 months ago
Alternatives and similar repositories for N3D-VLM
Users that are interested in N3D-VLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code release for 'Struct2D: A Perception-Guided Framework for Spatial Reasoning in MLLMs' (NeurIPS 2025)☆30Oct 28, 2025Updated 5 months ago
- [NeurIPS 2025] LabelAny3D: Label Any Object 3D in the Wild☆126Jan 6, 2026Updated 3 months ago
- [NeurIPS'25] HyRF: Hybrid Radiance Fields for Efficient and High-quality Novel View Synthesis☆73Dec 17, 2025Updated 3 months ago
- [ICLR 2026] RefAny3D: 3D Asset-Referenced Diffusion Models for Image Generation☆32Mar 10, 2026Updated last month
- Official code for ICCV2023 paper: Learning Unified Decompositional and Compositional NeRF for Editable Novel View Synthesis☆34Dec 27, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Taming Video Diffusion Prior with Scene-Grounding Guidance for 3D Gaussian Splatting from Sparse Inputs (CVPR2025 Highlight)☆127Sep 18, 2025Updated 6 months ago
- Mirage: One-Step Video Diffusion for Photorealistic and Coherent Asset Editing in Driving Scenes☆27Mar 12, 2026Updated last month
- ☆41Apr 8, 2026Updated last week
- Official codes for paper: 3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for Indoor 3D Object …☆162Mar 16, 2026Updated last month
- Code of Strips as Tokens: Artist Mesh Generation with Native UV Segmentation. ACM Transactions on Graphics (SIGGRAPH 2026)☆98Updated this week
- From Flatland to Space (SPAR). Accepted to NeurIPS 2025 Datasets & Benchmarks. A large-scale dataset & benchmark for 3D spatial perceptio…☆84Jan 5, 2026Updated 3 months ago
- ☆11Jul 17, 2024Updated last year
- Visualizing point clouds with transparency in Switch-NeRF (ICLR2023)☆13Mar 27, 2023Updated 3 years ago
- StableWorld: Towards Stable and Consistent Long Interactive Video Generation☆93Mar 18, 2026Updated 3 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- XLD: A Cross-Lane Dataset for Benchmarking Novel Driving View Synthesis☆23Sep 26, 2024Updated last year
- CVPR2025-Multi-party Collaborative Attention Control for Image Customization☆16May 14, 2025Updated 11 months ago
- CoPart (ICCV 2025): A part-based 3D generation framework & the first large-scale part-level 3D dataset.☆198Jul 23, 2025Updated 8 months ago
- [NeurIPS'24] This repository is the implementation of "SpatialRGPT: Grounded Spatial Reasoning in Vision Language Models"☆317Dec 14, 2024Updated last year
- ☆38Jan 10, 2026Updated 3 months ago
- Official PyTorch implementation of our CVPR 2025 paper, "LoRA Subtraction for Drift-Resistant Space in Exemplar-Free Continual Learning."☆17Mar 28, 2025Updated last year
- The code release for "Variational Structured Attention Networks for Visual Dense Representation Learning"☆14Nov 28, 2022Updated 3 years ago
- [ICLR 2026 Oral] Latent Particle World Models official repository☆81Mar 19, 2026Updated 3 weeks ago
- [ICCV 2025] NuiScene: Exploring Efficient Generation of Unbounded Outdoor Scenes☆91Oct 26, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for Language-Inspired Relation Transfer for Few-Shot Class-Incremental Learning in IEEE TPAMI☆15Apr 18, 2025Updated 11 months ago
- Official implementation of "RoboTracer: Mastering Spatial Trace with Reasoning in Vision-Language Models for Robotics"☆71Jan 19, 2026Updated 2 months ago
- Official repository of paper "LOVE-R1: Advancing Long Video Understanding with Adaptive Zoom-in Mechanism via Multi-Step Reasoning"☆24Nov 1, 2025Updated 5 months ago
- Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence☆295Mar 31, 2026Updated 2 weeks ago
- [ AAAI 2026 ] The official implementation of 'MonoCLUE: Object-Aware Clustering Enhances Monocular 3D Object Detection'☆18Mar 23, 2026Updated 3 weeks ago
- ☆16Dec 7, 2024Updated last year
- A user-friendly evaluation tool that encompasses all necessary components for boundary detection on PASCAL-Context and NYUD-v2 datasets.☆15Oct 31, 2023Updated 2 years ago
- Official PyTorch implementation of our ECCV2024 paper “Rethinking Few-shot Class-incremental Learning: Learning from Yourself”☆21Jan 12, 2025Updated last year
- Official implementation of “4D LangVGGT: 4D Language-Visual Geometry Grounded Transformer”☆84Mar 25, 2026Updated 3 weeks ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆19Jul 30, 2021Updated 4 years ago
- This is a curated list of "Continual Learning with Pretrained Models" research.☆19May 29, 2025Updated 10 months ago
- [NeurIPS 2024] DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos☆230Sep 27, 2024Updated last year
- Official codebase for the paper "Reasoning Within the Mind: Dynamic Multimodal Interleaving in Latent Space"☆74Mar 20, 2026Updated 3 weeks ago
- [CVPR 2025🎉] Official implementation for paper "Point-Level Visual Affordance Guided Retrieval and Adaptation for Cluttered Garments Man…☆45Mar 25, 2025Updated last year
- ☆97Jun 15, 2024Updated last year
- Code for "AffordanceLLM: Grounding Affordance from Vision Language Models"☆14Oct 18, 2024Updated last year