[NeurIPS'25] SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning
☆40Oct 14, 2025Updated 7 months ago
Alternatives and similar repositories for SSR
Users that are interested in SSR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Score and Distribution Matching Policy: Advanced accelerated Visuomotor Policies via matched distillation☆12May 9, 2025Updated last year
- ☆23May 8, 2025Updated last year
- Modality Gap Theory☆74May 16, 2026Updated 3 weeks ago
- OpenHelix: An Open-source Dual-System VLA Model for Robotic Manipulation☆378Aug 27, 2025Updated 9 months ago
- Learning 1D Causal Visual Representation with De-focus Attention Networks☆35Jun 7, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆18Sep 10, 2025Updated 9 months ago
- Multi-Organ Foundation Model for Universal Ultrasound Image Segmentation with Task Prompt and Anatomical Prior☆16Sep 30, 2024Updated last year
- [ECCV'24 Oral] PiTe: Pixel-Temporal Alignment for Large Video-Language Model☆17Feb 13, 2025Updated last year
- A Framework for Benchmarking and Improving Coding Agents for Robot Manipulation☆526May 28, 2026Updated 2 weeks ago
- ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content Moderation. AAAI, 2025☆16Aug 25, 2025Updated 9 months ago
- Findings of EMNLP 2023: InfoCL: Alleviating Catastrophic Forgetting in Continual Text Classification from An Information Theoretic Perspe…☆14Aug 13, 2024Updated last year
- The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism☆30Jul 17, 2024Updated last year
- [ICCV2025] Official code repository of "CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction"☆61Aug 10, 2025Updated 10 months ago
- ☆101May 31, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official code for the paper: DRA-GRPO: Exploring Diversity-Aware Reward Adjustment for R1-Zero-Like Training of Large Language Models☆24Jan 6, 2026Updated 5 months ago
- [NeurIPS 2025] Official repository for “FlowCut: Rethinking Redundancy via Information Flow for Efficient Vision-Language Models”☆32Dec 9, 2025Updated 6 months ago
- VR-based Robot Teleoperation and Data Collection System for Humanoid Whole-Body VLA (Unitree G1)☆166Feb 17, 2026Updated 3 months ago
- Code and data for paper "Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation".☆25Oct 22, 2025Updated 7 months ago
- ☆16Dec 25, 2025Updated 5 months ago
- An unofficial pytorch dataloader for Open X-Embodiment Datasets https://github.com/google-deepmind/open_x_embodiment☆25Jan 9, 2025Updated last year
- Official PyTorch Implementation of "Better Source, Better Flow: Learning Condition-Dependent Source Distribution for Flow Matching"