HiThink-Research / MME-FinanceLinks
[MM 2025] A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning
☆44Updated 2 weeks ago
Alternatives and similar repositories for MME-Finance
Users that are interested in MME-Finance are comparing it to the libraries listed below
Sorting:
- Scaling Preference Data Curation via Human-AI Synergy☆137Updated 6 months ago
- CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models (NeurIPS 2025)☆172Updated 2 months ago
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆156Updated 7 months ago
- ☆48Updated 11 months ago
- Offical Repository of "AtomThink: Multimodal Slow Thinking with Atomic Step Reasoning"☆61Updated 2 months ago
- ☆177Updated last month
- Paper collections of multi-modal LLM for Math/STEM/Code.☆135Updated 2 months ago
- MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search too…☆384Updated 5 months ago
- Latest Advances on Reasoning of Multimodal Large Language Models (Multimodal R1 \ Visual R1) ) 🍓☆35Updated 9 months ago
- ☆111Updated 7 months ago
- [ICLR 2025] ChartMimic: Evaluating LMM’s Cross-Modal Reasoning Capability via Chart-to-Code Generation☆130Updated last month
- ☆101Updated 2 years ago
- PSFT is a trust-region–inspired fine-tuning objective that views SFT as a policy gradient method with constant advantages, constraining p…☆34Updated 4 months ago
- RewardAnything: Generalizable Principle-Following Reward Models☆45Updated 7 months ago
- ☆47Updated 9 months ago
- [MM 2025] CMM-Math: A Chinese Multimodal Math Dataset To Evaluate and Enhance the Mathematics Reasoning of Large Multimodal Models☆50Updated last year
- ☆39Updated 6 months ago
- MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)☆45Updated 6 months ago
- A research repo for experiments about Reinforcement Finetuning☆53Updated 9 months ago
- ☆135Updated 2 months ago
- ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of…☆76Updated 7 months ago
- [TMLR 25] SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models☆147Updated 3 months ago
- Code for Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models☆92Updated last year
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆141Updated last year
- MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources☆214Updated 4 months ago
- Official repository of MMDU dataset☆102Updated last year
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$☆50Updated last year
- Pre-trained, Scalable, High-performance Reward Models via Policy Discriminative Learning.☆164Updated 4 months ago
- Fantastic Data Engineering for Large Language Models☆93Updated last year
- [EMNLP 2025] CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward☆62Updated 5 months ago