We introduce Reasoning via Video, a new paradigm that uses maze-solving video generation to probe multimodal reasoning; our VR-Bench shows that fine-tuned video models consistently outperform strong VLMs on long-horizon spatial planning tasks.
☆64Feb 4, 2026Updated 4 months ago
Alternatives and similar repositories for VR-Bench
Users that are interested in VR-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [MICCAI 2025] GL-LCM: Global-Local Latent Consistency Models for Fast High-Resolution Bone Suppression in Chest X-Ray Images☆16Mar 12, 2026Updated 2 months ago
- On Policy Distillation Build on top of Verl☆69May 25, 2026Updated 2 weeks ago
- ☆18Jul 31, 2025Updated 10 months ago
- 🌟 手把手教你在论文中插入代码链接☆25Aug 2, 2025Updated 10 months ago
- Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give…☆229Apr 13, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- v1: Learning to Point Visual Tokens for Multimodal Grounded Reasoning☆20Oct 6, 2025Updated 8 months ago
- [MICCAI 2025] Bridging the Gap in Missing Modalities: Leveraging Knowledge Distillation and Style Matching for Brain Tumor Segmentation☆21Jul 13, 2025Updated 10 months ago
- Official Implementation of DMT: Dual Mean-Teacher in PyTorch.☆10Oct 27, 2023Updated 2 years ago
- ☆20Jan 26, 2026Updated 4 months ago
- Environments by the Prime Intellect Research Team☆57Updated this week
- This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.☆19Jun 27, 2024Updated last year
- ☆17Jun 10, 2025Updated last year
- 🔥 [ICLR 2025] Official PyTorch Model "Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark"☆26Feb 9, 2025Updated last year
- 🌟 SwarmAgent: A framework for simulating social group dynamics using multi-agent collaboration, aiding insights into collective behavior…☆13Dec 5, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Towards Accurate and Lightweight Neuroblastoma Diagnosis via Contrastive Multi-scale Pathological Image Analysis☆16Mar 10, 2026Updated 3 months ago
- ChartSum is a large scale benchmark for automatic chart to text summarization☆11Jul 20, 2023Updated 2 years ago
- Modality Gap Theory☆74May 16, 2026Updated 3 weeks ago
- VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation [TMLR26]☆16Jun 1, 2026Updated last week
- 同济大学数据挖掘课程期末作业:股票走势预测☆10Jan 11, 2021Updated 5 years ago
- 🧑🏻⚕️ 医学人工智能入门指南 Medical-AI-Guide☆32Oct 14, 2025Updated 7 months ago
- ☆15Jan 9, 2026Updated 5 months ago
- FakePartsBench: 25K+ AI-generated videos with pixel- and frame-level annotations of full and partial deepfakes.☆25May 29, 2026Updated last week
- The code and weight for LoVA. LoVA is a novel model for Long-form Video-to-Audio generation. Based on the Diffusion Transformer (DiT) arc…☆15Feb 27, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ECCV 2024] Teach CLIP to Develop a Number Sense for Ordinal Regression☆22Apr 1, 2025Updated last year
- Code for Mind the Label Shift of Augmentation-based Graph OOD generalization (LiSA) in CVPR 2023. LiSA is a model-agnostic Graph OOD fram…☆16Jun 24, 2023Updated 2 years ago
- [ICLR'26] Easier Painting Than Thinking: Can Text-to-Image Models Set the Stage, but Not Direct the Play?☆53Mar 9, 2026Updated 3 months ago
- ☆218Dec 19, 2025Updated 5 months ago
- ☆21Aug 18, 2024Updated last year
- When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought☆32Feb 14, 2026Updated 3 months ago
- ☆22Jul 23, 2025Updated 10 months ago
- This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception o…☆29Jul 9, 2025Updated 11 months ago
- [CVPR' 26] MajutsuCity: Language-driven Aesthetic-adaptive City Generation with Controllable 3D Assets and Layouts☆44Apr 27, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Measuring RAG solutions throughput and latency☆20Jul 23, 2024Updated last year
- An automatic prompt iteration and optimization generator suitable for any scenario☆16Jan 31, 2025Updated last year
- ☆86Feb 5, 2026Updated 4 months ago
- Visual and Embodied Concepts evaluation benchmark☆21Oct 10, 2023Updated 2 years ago
- DataMosaic: Explainable and Verifiable Document-Based Data Analytics☆20Jun 30, 2025Updated 11 months ago
- A Split Tunneling Solution through Tailscale based on domain matching☆20Jan 8, 2026Updated 5 months ago
- A Collection of Data sets and Approaches to UAD in Brain MRI.☆24Apr 21, 2026Updated last month