(ArXiv25) Vision Matters: Simple Visual Perturbations Can Boost Multimodal Math Reasoning
☆60Sep 30, 2025Updated 7 months ago
Alternatives and similar repositories for Vision-Matters
Users that are interested in Vision-Matters are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch Implementation of ECCV 2024 OOD-CV Workshop SSB Challenge (Open-Set Recognition Track) - 1st Place☆29Sep 13, 2024Updated last year
- [ICML2025] Official Code of From Local Details to Global Context: Advancing Vision-Language Models with Attention-Based Selection☆26Jun 27, 2025Updated 10 months ago
- 🦾 A Dual-System VLA with System2 Thinking☆143Aug 21, 2025Updated 8 months ago
- [ICCV 2025] Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation.☆53Aug 27, 2025Updated 8 months ago
- The official repository of our paper "Reinforcing Video Reasoning with Focused Thinking"☆35Jun 12, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ACL'25] UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench☆36Aug 12, 2025Updated 8 months ago
- ☆24Aug 8, 2025Updated 8 months ago
- [ACM MM 2025] SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation. https://arxiv.org/abs/2506.03139☆80Nov 10, 2025Updated 5 months ago
- Official code for "ZigZag: Universal Sampling-free Uncertainty Estimation Through Two-Step Inference" (TMLR 2024)☆17Nov 7, 2024Updated last year
- Official Implementation of "VAU-R1: Advancing Video Anomaly Understanding via Reinforcement Fine-Tuning".☆67Nov 20, 2025Updated 5 months ago
- 💻 Terminal-Agent with Human-in-the-Loop Learning☆39Jan 16, 2026Updated 3 months ago
- [NeurIPS 2025] Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO☆84Oct 29, 2025Updated 6 months ago
- A non-official re-implementation of article "[ECCV 18] Image Inpainting for Irregular Holes Using Partial Convolutions"☆11Mar 1, 2025Updated last year
- Pytorch implementation of "SKEL-CF: Coarse-to-Fine Biomechanical Skeleton and Surface Mesh Recovery"☆61Mar 17, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts☆40Sep 30, 2025Updated 7 months ago
- BatteryBERT: A Pre-trained Language Model for Battery Database Enhancement☆35Sep 6, 2022Updated 3 years ago
- ☆20Apr 16, 2025Updated last year
- [IEEE TII 2025] Official Implementation for "Dual-Detector Reoptimization for Federated Weakly Supervised Video Anomaly Detection via Ada…☆27Nov 11, 2025Updated 5 months ago
- The official repository of 'Unnatural Language Are Not Bugs but Features for LLMs'☆24May 20, 2025Updated 11 months ago
- ☆13Jun 7, 2022Updated 3 years ago
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]☆188Jun 5, 2025Updated 10 months ago
- ☆12Jun 13, 2025Updated 10 months ago
- [MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.☆41Apr 7, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆13Feb 5, 2022Updated 4 years ago
- ☆10Jul 11, 2022Updated 3 years ago
- Code for Heima☆59Apr 21, 2025Updated last year
- ☆40Jul 3, 2025Updated 10 months ago
- Building self-refined guardrails via DSPy☆14Jul 2, 2024Updated last year
- SAEval: A benchmark for sentiment analysis to evaluate the model's performance on various subtasks.☆14Apr 29, 2024Updated 2 years ago
- [ECCV 2024] Official Implementation of "Disentangling Masked Autoencoders for Unsupervised Domain Generalization"☆14Jul 31, 2024Updated last year
- WeThink: Toward General-purpose Vision-Language Reasoning via Reinforcement Learning☆36Jun 10, 2025Updated 10 months ago
- [MM 2023 Oral] Online Distillation-enhanced Multi-modal Transformer for Sequential Recommendation☆17Jan 10, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official Pytorch implementation of NeuralWalker (ICLR 2025)☆39Jun 25, 2025Updated 10 months ago
- A recipe app built with React and Ruby on Rails☆19Aug 5, 2019Updated 6 years ago
- ☆14Feb 26, 2025Updated last year
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆57May 28, 2025Updated 11 months ago
- [ACL2026 Findings] "Towards Hierarchical Multi-Step Reward Models for Enhanced Reasoning in Large Language Models"☆20Mar 25, 2025Updated last year
- [NeurIPS 2025] The implementation of paper "On Reasoning Strength Planning in Large Reasoning Models"☆32Jul 6, 2025Updated 9 months ago
- [CVPR 2024 CVinW] Multi-Agent VQA: Exploring Multi-Agent Foundation Models on Zero-Shot Visual Question Answering☆22Sep 21, 2024Updated last year