Official code for paper: N3D-VLM: Native 3D Grounding Enables Accurate Spatial Reasoning in Vision-Language Models
☆108Jan 14, 2026Updated 5 months ago
Alternatives and similar repositories for N3D-VLM
Users that are interested in N3D-VLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code release for 'Struct2D: A Perception-Guided Framework for Spatial Reasoning in MLLMs' (NeurIPS 2025)☆30Oct 28, 2025Updated 7 months ago
- [NeurIPS'25] HyRF: Hybrid Radiance Fields for Efficient and High-quality Novel View Synthesis☆76Dec 17, 2025Updated 5 months ago
- Official code for ICCV2023 paper: Learning Unified Decompositional and Compositional NeRF for Editable Novel View Synthesis☆34Dec 27, 2023Updated 2 years ago
- Taming Video Diffusion Prior with Scene-Grounding Guidance for 3D Gaussian Splatting from Sparse Inputs (CVPR2025 Highlight)☆136Sep 18, 2025Updated 8 months ago
- Mirage: One-Step Video Diffusion for Photorealistic and Coherent Asset Editing in Driving Scenes☆29Mar 12, 2026Updated 3 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICLR 2026] RefAny3D: 3D Asset-Referenced Diffusion Models for Image Generation☆36Mar 10, 2026Updated 3 months ago
- Official codes for paper: 3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for Indoor 3D Object …☆164Mar 16, 2026Updated 3 months ago
- From Flatland to Space (SPAR). Accepted to NeurIPS 2025 Datasets & Benchmarks. A large-scale dataset & benchmark for 3D spatial perceptio…☆87Jan 5, 2026Updated 5 months ago
- ☆11Jul 17, 2024Updated last year
- Visualizing point clouds with transparency in Switch-NeRF (ICLR2023)☆13Mar 27, 2023Updated 3 years ago
- Official repo for paper "HiMoE-VLA: Hierarchical Mixture-of-Experts for Generalist Vision-Language-Action Policies"☆33Dec 12, 2025Updated 6 months ago
- StableWorld: Towards Stable and Consistent Long Interactive Video Generation☆97Mar 18, 2026Updated 2 months ago
- ☆58Apr 8, 2026Updated 2 months ago
- ☆25Oct 13, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- XLD: A Cross-Lane Dataset for Benchmarking Novel Driving View Synthesis☆25Sep 26, 2024Updated last year
- CoPart (ICCV 2025): A part-based 3D generation framework & the first large-scale part-level 3D dataset.☆207Jul 23, 2025Updated 10 months ago
- ☆13May 6, 2025Updated last year
- CVPR2025-Multi-party Collaborative Attention Control for Image Customization☆17May 14, 2025Updated last year
- A collection of VLMs papers, blogs, and projects, with a focus on VLMs in Autonomous Driving and related reasoning techniques.☆11Nov 16, 2024Updated last year
- Official PyTorch implementation of our CVPR 2025 paper, "LoRA Subtraction for Drift-Resistant Space in Exemplar-Free Continual Learning."☆18Mar 28, 2025Updated last year
- The code release for "Variational Structured Attention Networks for Visual Dense Representation Learning"☆14Nov 28, 2022Updated 3 years ago
- [ICCV 2025] NuiScene: Exploring Efficient Generation of Unbounded Outdoor Scenes☆91Oct 26, 2025Updated 7 months ago
- Code for Language-Inspired Relation Transfer for Few-Shot Class-Incremental Learning in IEEE TPAMI☆15Apr 18, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official repository of paper "LOVE-R1: Advancing Long Video Understanding with Adaptive Zoom-in Mechanism via Multi-Step Reasoning"☆24Nov 1, 2025Updated 7 months ago
- Official implementation of "RoboTracer: Mastering Spatial Trace with Reasoning in Vision-Language Models for Robotics"☆74Jan 19, 2026Updated 4 months ago
- [ICML 2026 Oral] Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence☆320May 25, 2026Updated 3 weeks ago
- [ICLR 2026 Oral] Latent Particle World Models official repository☆108Mar 19, 2026Updated 2 months ago
- ☆16Dec 7, 2024Updated last year
- A user-friendly evaluation tool that encompasses all necessary components for boundary detection on PASCAL-Context and NYUD-v2 datasets.☆16Oct 31, 2023Updated 2 years ago
- Official PyTorch implementation of our ECCV2024 paper “Rethinking Few-shot Class-incremental Learning: Learning from Yourself”☆21Jan 12, 2025Updated last year
- [ AAAI 2026 ] The official implementation of 'MonoCLUE: Object-Aware Clustering Enhances Monocular 3D Object Detection'☆21Mar 23, 2026Updated 2 months ago
- Official implementation of “4D LangVGGT: 4D Language-Visual Geometry Grounded Transformer”☆89Mar 25, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆19Jul 30, 2021Updated 4 years ago
- This is a curated list of "Continual Learning with Pretrained Models" research.☆20May 29, 2025Updated last year
- [NeurIPS 2024] DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos☆231Sep 27, 2024Updated last year
- [CVPR 2025🎉] Official implementation for paper "Point-Level Visual Affordance Guided Retrieval and Adaptation for Cluttered Garments Man…☆48Mar 25, 2025Updated last year
- Code for "AffordanceLLM: Grounding Affordance from Vision Language Models"☆14Oct 18, 2024Updated last year
- [CVPR2026] Official codebase for the paper "Reasoning Within the Mind: Dynamic Multimodal Interleaving in Latent Space"☆83May 12, 2026Updated last month
- An unofficial implementation of Tensor4D with support for the D-NeRF dataset☆13Nov 8, 2023Updated 2 years ago