HiThink-Research / MME-FinanceLinks
[MM 2025] A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning
☆38Updated 3 weeks ago
Alternatives and similar repositories for MME-Finance
Users that are interested in MME-Finance are comparing it to the libraries listed below
Sorting:
- CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models (NeurIPS 2025)☆166Updated this week
- Offical Repository of "AtomThink: Multimodal Slow Thinking with Atomic Step Reasoning"☆57Updated 3 months ago
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆146Updated 4 months ago
- Scaling Preference Data Curation via Human-AI Synergy☆125Updated 4 months ago
- ☆110Updated 4 months ago
- ☆162Updated 3 weeks ago
- Latest Advances on Reasoning of Multimodal Large Language Models (Multimodal R1 \ Visual R1) ) 🍓☆35Updated 7 months ago
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$☆49Updated last year
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆135Updated last year
- ☆47Updated 9 months ago
- PSFT is a trust-region–inspired fine-tuning objective that views SFT as a policy gradient method with constant advantages, constraining p…☆29Updated 2 months ago
- [NeurIPS'24] Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models☆62Updated 11 months ago
- ☆39Updated 3 months ago
- ☆46Updated 7 months ago
- MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)☆44Updated 4 months ago
- Paper collections of multi-modal LLM for Math/STEM/Code.☆129Updated 2 weeks ago
- Pre-trained, Scalable, High-performance Reward Models via Policy Discriminative Learning.☆159Updated last month
- ☆163Updated last year
- [SIGIR'24] The official implementation code of MOELoRA.☆184Updated last year
- This is the repo for our paper "Mr-Ben: A Comprehensive Meta-Reasoning Benchmark for Large Language Models"☆50Updated last year
- ☆84Updated last year
- ☆116Updated 3 weeks ago
- [NeurIPS 2024] MATH-Vision dataset and code to measure multimodal mathematical reasoning capabilities.☆120Updated 5 months ago
- [MM 2025] CMM-Math: A Chinese Multimodal Math Dataset To Evaluate and Enhance the Mathematics Reasoning of Large Multimodal Models☆44Updated last year
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning☆152Updated 10 months ago
- Official repository of MMDU dataset☆96Updated last year
- A research repo for experiments about Reinforcement Finetuning☆52Updated 7 months ago
- ☆51Updated 8 months ago
- RewardAnything: Generalizable Principle-Following Reward Models☆44Updated 4 months ago
- Inference Code for Paper "Harder Tasks Need More Experts: Dynamic Routing in MoE Models"☆65Updated last year