Description for MV-MATH
☆15Jul 20, 2025Updated 10 months ago
Alternatives and similar repositories for MV-MATH
Users that are interested in MV-MATH are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A unified neural-symbolic framework for solving plane and solid geometric problems via Parse2Reason & Official repository for the CVPR 20…☆41Jun 6, 2026Updated last week
- ☆46Dec 16, 2025Updated 6 months ago
- This is the Repository for Geometry Problem Solving Method Evaluation☆27Oct 8, 2024Updated last year
- [NeurIPS'25] The official code of "PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning"☆30Mar 30, 2026Updated 2 months ago
- ☆12Dec 19, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"? [ICLR26]☆40Jun 23, 2025Updated 11 months ago
- The official source code of our AAAI25 paper "D&M: Enriching E-commerce Videos with Sound Effects by Key Moment Detection and SFX Matchin…☆10Feb 9, 2025Updated last year
- ☆34Feb 12, 2026Updated 4 months ago
- [ICLR 2025] Language Imbalance Driven Rewarding for Multilingual Self-improving☆25Apr 6, 2026Updated 2 months ago
- ☆18Aug 19, 2024Updated last year
- ☆22Jul 15, 2024Updated last year
- ☆132Sep 20, 2025Updated 8 months ago
- AgentsCourt: Building Judicial Decision-Making Agents with Court Debate Simulation and Legal Knowledge Augmentation (EMNLP 2024 Findings)☆16Dec 30, 2024Updated last year
- ☆21May 14, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆80Jun 20, 2025Updated 11 months ago
- ☆31Jan 18, 2026Updated 5 months ago
- Cross-domain word representation learning☆10May 23, 2015Updated 11 years ago
- ☆70Feb 4, 2026Updated 4 months ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆17Dec 19, 2024Updated last year
- [NeurIPS25] RULE: Reinforcement UnLEarning Achieves Forge-retain Pareto Optimality☆21Oct 22, 2025Updated 7 months ago
- Official code for CVPR2023 Boosting Video Object Segmentation via Space-time Correspondence Learning☆24Jun 6, 2023Updated 3 years ago
- [ICML'25] MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents☆31Jul 31, 2025Updated 10 months ago
- Implementation of followinf estimation algorithms in python: Kalman Filter, Extended Kalman Filter, Unscented Kalman Filter, Cubature Kal…☆11Dec 2, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Repository of paper "Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis" (ACL 2025 Main)☆19Jul 19, 2025Updated 11 months ago
- Revisiting Mid-training in the Era of Reinforcement Learning Scaling☆188Jul 23, 2025Updated 10 months ago
- ☆15Jul 22, 2024Updated last year
- Code for Research Project TLDR☆25Jul 28, 2025Updated 10 months ago
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 8 months ago
- ☆11Apr 29, 2019Updated 7 years ago
- ANDROID APP that can RECOGNIZE VLC LIVE AUDIO/VIDEO STREAMING (using free Android Developers Speech Recognition API) then TRANSLATE (usin…☆13May 5, 2024Updated 2 years ago
- A Self-Training Framework for Vision-Language Reasoning☆89Jan 23, 2025Updated last year
- ☆13Nov 5, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- An Ultra-Long Output Reinforcement Learning Approach☆23Jul 31, 2025Updated 10 months ago
- ☆39Jan 9, 2026Updated 5 months ago
- ☆14Jan 22, 2025Updated last year
- RACE is a multi-dimensional benchmark for code generation that focuses on Readability, mAintainability, Correctness, and Efficiency.☆14Oct 12, 2024Updated last year
- Official Implementation of MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models☆16Nov 1, 2025Updated 7 months ago
- Akcio is a demonstration project for Retrieval Augmented Generation (RAG). It leverages the power of LLM to generate responses and uses v…☆12Oct 30, 2023Updated 2 years ago
- Can VLMs understand students' hand-drawn math work?☆19Jan 20, 2026Updated 4 months ago