song2yu / SIBench-VSRView external linksLinks
This is a project on visual spatial reasoning tasks-SIBench
☆25Jan 12, 2026Updated last month
Alternatives and similar repositories for SIBench-VSR
Users that are interested in SIBench-VSR are comparing it to the libraries listed below
Sorting:
- Self-similarity Prior Distillation for Unsupervised Remote Physiological Measurement☆10Oct 18, 2024Updated last year
- ☆16Jan 23, 2026Updated 3 weeks ago
- [NeurIPS 2025] EOC-Bench, an innovative benchmark designed to systematically evaluate object-centric embodied cognition in dynamic egocen…☆22Jun 17, 2025Updated 8 months ago
- ☆20Oct 15, 2025Updated 4 months ago
- The official implementation of NeurlPS 2025 D&B paper: IndustryEQA: Pushing the frontiers of Embodied Question Answering in Industrial Sc…☆11Sep 25, 2025Updated 4 months ago
- TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics☆21Nov 18, 2025Updated 2 months ago
- ☆15Sep 11, 2025Updated 5 months ago
- The official repository for "SurgNet: Self-supervised Pretraining with Semantic Consistency for Vessel and Instrument Segmentation in Sur…☆14Dec 30, 2024Updated last year
- [ICLR'26] Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology☆73Jan 26, 2026Updated 3 weeks ago
- This is the code corresponding to the paper "Resolve Domain Conflicts for Generalizable Remote Physiological Measurement." accepted in AC…☆15Apr 15, 2024Updated last year
- Implementation of DeepMind's "Sobolev Training for Neural Networks"☆11Apr 2, 2018Updated 7 years ago
- ☆21Sep 16, 2025Updated 5 months ago
- Benchmarking Multi-Image Understanding in Vision and Language Models☆12Jul 29, 2024Updated last year
- Toward Ambulatory Vision: Learning Visually-Grounded Active View Selection☆19Feb 5, 2026Updated last week
- (NeurIPS 2025 D&B Track) OverLayBench: A Benchmark for Layout-to-Image Generation with Dense Overlaps☆25Jan 22, 2026Updated 3 weeks ago
- https://github.com/jzhang38/TinyLlama using only PyTorch☆13Jan 24, 2024Updated 2 years ago
- The code implementation for TTCS: Test-Time Curriculum Synthesis for Self-Evolving.☆31Feb 6, 2026Updated last week
- ☆31Dec 4, 2025Updated 2 months ago
- 🚀 Official code for “XStreamVGGT: Extremely Memory-Efficient Streaming Vision Geometry Grounded Transformer with KV Cache Compression”, …☆30Jan 27, 2026Updated 3 weeks ago
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆14Sep 30, 2023Updated 2 years ago
- ☆25Sep 18, 2025Updated 4 months ago
- MATLAB code for "PR2013 - A Comparative Study on Illumination Preprocessing in Face Recognition"☆13Aug 13, 2016Updated 9 years ago
- ☆15Feb 3, 2025Updated last year
- Official PyTorch implementation for "Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas", presenting the Merge-Att…☆14Jul 9, 2025Updated 7 months ago
- This repo is the official implementation of "Euclid’s Gift: Enhancing Spatial Perception and Reasoning in Vision‑Language Models via Geom…☆26Nov 7, 2025Updated 3 months ago
- Benchmarking Joint Face Spoofing and Forgery Detection with Visual and Physiological Cues (TDSC'24)☆14Jan 14, 2024Updated 2 years ago
- ☆13Apr 22, 2025Updated 9 months ago
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- [WACV2025] Linking Omni-Depth with View Synthesis through Multi-Sphere Image aided Generalizable Neural Radiance Field☆14Nov 3, 2024Updated last year
- Pi0-VLA Repository of "MotionTrans: Human VR Data Enable Motion-Level Learning for Robotic Manipulation Policies"☆26Sep 25, 2025Updated 4 months ago
- A precise and stable CFG for negative prompts, derived via guided sampling with contrastive loss.☆13Dec 27, 2024Updated last year
- Spatial Aptitude Training for Multimodal Langauge Models☆24Feb 8, 2026Updated last week
- Official implementation of Deep Factorized Metric Learning.☆20Jun 6, 2023Updated 2 years ago
- Official implementation of the paper: "ActiveVLN: Towards Active Exploration via Multi-Turn RL in Vision-and-Language Navigation"☆46Updated this week
- Control Arduino RGB LED over serial using MATLAB App Designer☆13Jan 26, 2020Updated 6 years ago
- ☆16Jun 9, 2023Updated 2 years ago
- Official implementation of StochSync: a zero-shot approach for image generation in arbitrary spaces via stochastic diffusion synchronizat…☆19Jun 24, 2025Updated 7 months ago
- ☆14Oct 12, 2024Updated last year
- Code release for 'Struct2D: A Perception-Guided Framework for Spatial Reasoning in MLLMs' (NeurIPS 2025)☆30Oct 28, 2025Updated 3 months ago