[MM 2025] A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning
☆44Jan 8, 2026Updated 3 months ago
Alternatives and similar repositories for MME-Finance
Users that are interested in MME-Finance are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- BizFinBench.v2: A Unified Offline–Online Bilingual Benchmark for Expert-Level Financial Capability Evaluation of LLMs☆43Jan 29, 2026Updated 3 months ago
- General AI evaluation and Gauge Engine. A unified evaluation engine for LLMs, MLLMs, audio, and diffusion models.☆50Updated this week
- The implement of LLMTreeRec☆14Dec 9, 2024Updated last year
- Codes and Datasets for the Paper: Text-Tuple-Table: Towards Information Integration in Text-to-Table Generation via Global Tuple Extracti…☆15Jun 5, 2024Updated last year
- ☆16Nov 12, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆82Jul 31, 2023Updated 2 years ago
- Bayesian scaling laws for in-context learning.☆15Mar 12, 2025Updated last year
- MuCR is a benchmark designed to evaluate Multimodal Large Language Models' (MLLMs) ability to discern causal links across modalities☆20May 27, 2025Updated 11 months ago
- ☆10May 8, 2024Updated last year
- Weakly Supervised Temporal Anomaly Segmentation☆16Nov 24, 2021Updated 4 years ago
- CrossLMM: Decoupling Long Video Sequences from LMMs via Dual Cross-Attention Mechanisms☆25Dec 21, 2025Updated 4 months ago
- Code for "Improving Translation Faithfulness of Large Language Models via Augmenting Instructions"☆12Aug 26, 2023Updated 2 years ago
- Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion☆125Mar 12, 2026Updated last month
- Collections of RLxLM experiments using minimal codes☆14Feb 17, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A Business-Driven Real-World Financial Benchmark for Evaluating LLMs☆166Jan 9, 2026Updated 3 months ago
- ☆11Oct 2, 2023Updated 2 years ago
- Answering Ambiguous Questions via Iterative Prompting☆14May 25, 2024Updated last year
- [ECCV'24 Oral] PiTe: Pixel-Temporal Alignment for Large Video-Language Model☆17Feb 13, 2025Updated last year
- P2P Social network☆16May 25, 2015Updated 10 years ago
- TaCo: Enhancing Cross-Lingual Transfer for Low-Resource Languages in LLMs through Translation-Assisted Chain-of-Thought Processes☆14Jul 1, 2025Updated 9 months ago
- A Mechanistic‑Interpretability study that finds the structural dynamics of Large Language Models under fine‑tuning.☆16May 30, 2025Updated 11 months ago
- MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiency☆138Aug 5, 2025Updated 8 months ago
- ☆16Sep 27, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- curated collections of modules☆21Aug 12, 2015Updated 10 years ago
- The official repository for Multi3WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for Training and Evaluating Culturally Adapte…☆17Jan 15, 2024Updated 2 years ago
- ☆10Jul 24, 2018Updated 7 years ago
- [ICML 2025] EffiCoder: Enhancing Code Generation in Large Language Models through Efficiency-Aware Fine-tuning☆16May 24, 2025Updated 11 months ago
- [NeurIPS 24] Implementation of "Advancing Video Anomaly Detection: A Concise Review and a New Dataset".☆22Apr 16, 2025Updated last year
- Chinese Financial Assistant Benchmark for Large Language Model☆52Jul 30, 2025Updated 9 months ago
- Immediate put/get for any abstract-chunk-store compliant store☆20Jan 30, 2023Updated 3 years ago
- ☆32Dec 10, 2025Updated 4 months ago
- SenseNova-U series: Native Unified Paradigm with NEO-Unify from the First Principles☆533Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Convert an abstract-chunk-store compliant store into a readable or writable stream☆26Jul 4, 2022Updated 3 years ago
- ☆15Jan 16, 2024Updated 2 years ago
- ☆27Jul 6, 2024Updated last year
- ☆13Dec 9, 2024Updated last year
- 中国科学院大学刘成林老师模式识别☆12Jan 7, 2021Updated 5 years ago
- FileGram: Grounding Agent Personalization in File-System Behavioral Traces☆64Apr 12, 2026Updated 2 weeks ago
- ☆17Feb 4, 2025Updated last year