Official code for paper: N3D-VLM: Native 3D Grounding Enables Accurate Spatial Reasoning in Vision-Language Models
☆102Jan 14, 2026Updated 3 months ago
Alternatives and similar repositories for N3D-VLM
Users that are interested in N3D-VLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code release for 'Struct2D: A Perception-Guided Framework for Spatial Reasoning in MLLMs' (NeurIPS 2025)☆30Oct 28, 2025Updated 6 months ago
- [NeurIPS'25] HyRF: Hybrid Radiance Fields for Efficient and High-quality Novel View Synthesis☆74Dec 17, 2025Updated 4 months ago
- Official code for ICCV2023 paper: Learning Unified Decompositional and Compositional NeRF for Editable Novel View Synthesis☆34Dec 27, 2023Updated 2 years ago
- [CVPR 2025] Official code for "Synergizing Motion and Appearance: Multi-Scale Compensatory Codebooks for Talking Head Video Generation"☆65Jun 6, 2025Updated 11 months ago
- Taming Video Diffusion Prior with Scene-Grounding Guidance for 3D Gaussian Splatting from Sparse Inputs (CVPR2025 Highlight)☆128Sep 18, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Mirage: One-Step Video Diffusion for Photorealistic and Coherent Asset Editing in Driving Scenes☆28Mar 12, 2026Updated last month
- ☆43Apr 8, 2026Updated 3 weeks ago
- [ICLR 2026] RefAny3D: 3D Asset-Referenced Diffusion Models for Image Generation☆34Mar 10, 2026Updated last month
- Official codes for paper: 3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for Indoor 3D Object …☆163Mar 16, 2026Updated last month
- From Flatland to Space (SPAR). Accepted to NeurIPS 2025 Datasets & Benchmarks. A large-scale dataset & benchmark for 3D spatial perceptio…☆84Jan 5, 2026Updated 4 months ago
- ☆11Jul 17, 2024Updated last year
- Visualizing point clouds with transparency in Switch-NeRF (ICLR2023)☆13Mar 27, 2023Updated 3 years ago
- StableWorld: Towards Stable and Consistent Long Interactive Video Generation☆96Mar 18, 2026Updated last month
- Official repo for paper "HiMoE-VLA: Hierarchical Mixture-of-Experts for Generalist Vision-Language-Action Policies"☆31Dec 12, 2025Updated 4 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- XLD: A Cross-Lane Dataset for Benchmarking Novel Driving View Synthesis☆24Sep 26, 2024Updated last year
- CVPR2025-Multi-party Collaborative Attention Control for Image Customization☆16May 14, 2025Updated 11 months ago
- OpenMMLab Rotated Object Detection Toolbox and Benchmark☆10Jun 22, 2023Updated 2 years ago
- CoPart (ICCV 2025): A part-based 3D generation framework & the first large-scale part-level 3D dataset.☆202Jul 23, 2025Updated 9 months ago
- [NeurIPS'24] This repository is the implementation of "SpatialRGPT: Grounded Spatial Reasoning in Vision Language Models"☆320Dec 14, 2024Updated last year
- ☆11May 6, 2025Updated last year
- ☆38Jan 10, 2026Updated 3 months ago
- Official PyTorch implementation of our CVPR 2025 paper, "LoRA Subtraction for Drift-Resistant Space in Exemplar-Free Continual Learning."☆18Mar 28, 2025Updated last year
- The code release for "Variational Structured Attention Networks for Visual Dense Representation Learning"☆14Nov 28, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICCV 2025] NuiScene: Exploring Efficient Generation of Unbounded Outdoor Scenes☆91Oct 26, 2025Updated 6 months ago
- Code for Language-Inspired Relation Transfer for Few-Shot Class-Incremental Learning in IEEE TPAMI☆15Apr 18, 2025Updated last year
- Official implementation of "RoboTracer: Mastering Spatial Trace with Reasoning in Vision-Language Models for Robotics"☆72Jan 19, 2026Updated 3 months ago
- Official repository of paper "LOVE-R1: Advancing Long Video Understanding with Adaptive Zoom-in Mechanism via Multi-Step Reasoning"☆24Nov 1, 2025Updated 6 months ago
- [ICLR 2026 Oral] Latent Particle World Models official repository☆91Mar 19, 2026Updated last month
- [ICML 2026 Spotlight] Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence☆302Updated this week
- Stepping VLMs onto the Court: Benchmarking Spatial Intelligence in Sports☆68Mar 15, 2026Updated last month
- Official PyTorch implementation of our ECCV2024 paper “Rethinking Few-shot Class-incremental Learning: Learning from Yourself”☆21Jan 12, 2025Updated last year
- Official implementation of “4D LangVGGT: 4D Language-Visual Geometry Grounded Transformer”☆87Mar 25, 2026Updated last month
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆19Jul 30, 2021Updated 4 years ago
- This is a curated list of "Continual Learning with Pretrained Models" research.☆20May 29, 2025Updated 11 months ago
- [NeurIPS 2024] DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos☆230Sep 27, 2024Updated last year
- [CVPR 2025🎉] Official implementation for paper "Point-Level Visual Affordance Guided Retrieval and Adaptation for Cluttered Garments Man…☆45Mar 25, 2025Updated last year
- [CVPR2026] Official codebase for the paper "Reasoning Within the Mind: Dynamic Multimodal Interleaving in Latent Space"☆78Mar 20, 2026Updated last month
- ☆97Jun 15, 2024Updated last year
- Code for "AffordanceLLM: Grounding Affordance from Vision Language Models"☆14Oct 18, 2024Updated last year