[NeurIPS 2025]《SD-VLM: Spatial Measuring and Understanding with Depth-encoded Vision Language Models》
☆596Dec 29, 2025Updated 2 months ago
Alternatives and similar repositories for SD-VLM
Users that are interested in SD-VLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆20Oct 15, 2025Updated 5 months ago
- ImageNet3D: Towards General-Purpose Object-Level 3D Understanding☆20Dec 6, 2024Updated last year
- A benchmark that focuses on the sampling dilemma in long-video tasks. Through well-designed tasks, it evaluates the sampling efficiency o…☆27Aug 7, 2025Updated 7 months ago
- Code for paper "Half-Physics: Enabling Kinematic 3D Human Model with Physical Interactions". Coming soon.☆33Jul 31, 2025Updated 7 months ago
- ☆22Aug 17, 2024Updated last year
- [ICCV 2025] Official implementation of "AD-GS: Object-Aware B-Spline Gaussian Splatting for Self-Supervised Autonomous Driving"☆37Jul 15, 2025Updated 8 months ago
- [ICCAD 2024] SNNGX: Securing Spiking Neural Networks with Genetic XOR Encryption on RRAM-based Neuromorphic Accelerator☆11Feb 3, 2026Updated last month
- MEt3R: Measuring Multi-View Consistency in Generated Images☆163Feb 23, 2026Updated last month
- ☆18Sep 25, 2025Updated 5 months ago
- Dexterous World Models☆75Feb 22, 2026Updated last month
- Minute-long video generation at 24FPS.☆59Feb 2, 2026Updated last month
- ☆14Feb 26, 2025Updated last year
- ☆11May 15, 2024Updated last year
- ☆12Jun 11, 2025Updated 9 months ago
- ☆14Sep 11, 2025Updated 6 months ago
- YOLO-NAS for ROS 2☆14Jun 5, 2023Updated 2 years ago
- Unlocking Iterative Reasoning for Any Image Editor☆99Jan 18, 2026Updated 2 months ago
- official code for "3D Question Answering via only 2D Vision-Language Models"☆24Mar 4, 2026Updated 2 weeks ago
- Data augmentation using OpenCV☆11Jan 12, 2017Updated 9 years ago
- [NeurIPS 2025] Streaming 3D Reconstruction with Explicit Spatial Pointer Memory☆181Mar 10, 2026Updated 2 weeks ago
- [AAAI 2026] Official code for "Agent Journey Beyond RGB: Unveiling Hybrid Semantic-Spatial Environmental Representations for Vision-and-L…☆14Updated this week
- [ICLR 2026 Oral] Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation☆92Mar 12, 2026Updated last week
- Röttger et al. (2025): "MSTS: A Multimodal Safety Test Suite for Vision-Language Models"☆16Mar 31, 2025Updated 11 months ago
- [NeurIPS 2024] Official Implementation of GrounDiT☆59Dec 12, 2024Updated last year
- This is a repository of Binary General Matrix Multiply (BGEMM) by customized CUDA kernel. Thank FP6-LLM for the wheels!☆18Aug 30, 2024Updated last year
- The official implementation of NeurlPS 2025 D&B paper: IndustryEQA: Pushing the frontiers of Embodied Question Answering in Industrial Sc…☆12Sep 25, 2025Updated 5 months ago
- prototyping stuff☆14Aug 17, 2025Updated 7 months ago
- [NeurIPS 2025] AutoSeg3D, online real-time 3D segmentation as instance tracking with long-short term query memory for embodied perception☆47Dec 18, 2025Updated 3 months ago
- ☆20Apr 12, 2025Updated 11 months ago
- [CVPR 2025] Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding☆17Oct 4, 2025Updated 5 months ago
- Official implementation of StochSync: a zero-shot approach for image generation in arbitrary spaces via stochastic diffusion synchronizat…☆21Jun 24, 2025Updated 8 months ago
- ☆80Nov 4, 2025Updated 4 months ago
- Logging in with Scrapy☆14Jan 26, 2018Updated 8 years ago
- [CVPR 2023] Segmenting objects in videos without human annotations 🤯: Official implementation for Bootstrapping Objectness from Videos b…☆38Nov 23, 2023Updated 2 years ago
- [CVPR 2026] Thinking in 360°: Humanoid Visual Search in the Wild☆139Mar 3, 2026Updated 2 weeks ago
- Yahboom ROS Transbot Robot with Lidar Depth camera support MoveIt 3D mapping for Raspberry Pi☆13Aug 22, 2025Updated 7 months ago
- Official Repository of Recovering Dynamic 3D Sketches from Videos (CVPR 2025)☆14Mar 2, 2026Updated 3 weeks ago
- ☆12Apr 18, 2025Updated 11 months ago
- PiX: Dynamic Channel Sampling for ConvNets (CVPR 2024)☆14Jun 14, 2024Updated last year