The FinEval financial domain evaluation benchmark, based on quantitative fundamental methods and developed through long-term objective research, summarization, and rigorous manual screening, utilizes over 26,000 diverse question types that are highly consistent with real-world application scenarios.
☆257Jun 23, 2025Updated 8 months ago
Alternatives and similar repositories for FinEval
Users that are interested in FinEval are comparing it to the libraries listed below
Sorting:
- ☆250Dec 25, 2023Updated 2 years ago
- DISC-FinLLM,中文金融大语言模型(LLM),旨在为用户提供金融场景下专业、智能、全面的金融咨询服务。DISC-FinLLM, a Chinese financial large language model (LLM) designed to provide us…☆848Nov 1, 2023Updated 2 years ago
- ☆280Jul 10, 2023Updated 2 years ago
- Fin-R1 is a large language model for complex financial reasoning developed and open-sourced with the joint efforts of the SUFE-AIFLM-Lab …☆745Mar 27, 2025Updated 11 months ago
- When FLUE Meets FLANG: Benchmarks and Large Pretrained Language Model for Financial Domain☆57Feb 11, 2025Updated last year
- 轩辕:度小满中文金融对话大模型☆1,300Jan 7, 2025Updated last year
- Shaping Language Models with Cognitive Insights☆15Feb 29, 2024Updated 2 years ago
- This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning …☆834Mar 4, 2025Updated 11 months ago
- Chinese Generation Evaluation☆13Aug 14, 2023Updated 2 years ago
- Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]☆1,815Jul 27, 2025Updated 7 months ago
- FinGLM: 致力于构建一个开放的、公益的、持久的金融大模型项目,利用开源开放来促进「AI+金融」。☆2,172May 8, 2024Updated last year
- [NeurIPS'24] Protecting Your LLMs with Information Bottleneck☆25Nov 7, 2024Updated last year
- An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.☆302May 31, 2023Updated 2 years ago
- 李鲁鲁老师的 Copilot-Python 学习。和ChatGPT等大语言模型协同进化。☆10Jun 3, 2025Updated 9 months ago
- Dataset and codes for SEntFiN☆10May 31, 2023Updated 2 years ago
- ☆74Dec 14, 2024Updated last year
- 中文金融大模型测评基准,六大类二十五任务、等级化评价,国内模型获得A级☆10May 6, 2024Updated last year
- A large-scale 7B pretraining language model developed by BaiChuan-Inc.☆5,683Jul 18, 2024Updated last year
- Chinese Financial Assistant Benchmark for Large Language Model☆49Jul 30, 2025Updated 7 months ago
- 聚宝盆(Cornucopia): 中文金融系列开源可商用大模型,并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)☆658Jun 30, 2023Updated 2 years ago
- CFinBench: A Comprehensive Chinese Financial Benchmark for Large Language Models☆15Oct 14, 2024Updated last year
- ☆48Sep 5, 2024Updated last year
- Touchstone: Evaluating Vision-Language Models by Language Models☆83Jan 18, 2024Updated 2 years ago
- BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)☆8,281Oct 16, 2024Updated last year
- 智鹿:中文消金领域对话大模型☆30Nov 12, 2023Updated 2 years ago
- Experimental scripts for researching data adaptive learning rate scheduling.☆22Oct 18, 2023Updated 2 years ago
- 面向中文大模型价值观的评估与对齐研究☆554Jul 20, 2023Updated 2 years ago
- This is the codebase for pre-training, compressing, extending, and distilling LLMs with Megatron-LM.☆12Mar 11, 2024Updated last year
- Codes for DATA: Differentiable ArchiTecture Approximation.☆11Jul 22, 2021Updated 4 years ago
- Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022☆13Nov 25, 2022Updated 3 years ago
- [CVPR2019] Synthesizing Environment-Aware Activities via Activity Sketches☆13Oct 3, 2023Updated 2 years ago
- ☆50Oct 29, 2023Updated 2 years ago
- OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, …☆6,688Updated this week
- AgentTuning: Enabling Generalized Agent Abilities for LLMs☆1,477Oct 31, 2023Updated 2 years ago
- 基于langchain设计的智能体任务,包含规划会话场景资源,构建子任务,任务执行器包含(MCTS)☆33Nov 10, 2025Updated 3 months ago
- LAiW: A Chinese Legal Large Language Models Benchmark☆89Jul 3, 2024Updated last year
- Repository containing the website for the EMNLP 2023 conference☆17Feb 12, 2025Updated last year
- Distributed implementation of Robust PLSA using Spark☆12Apr 29, 2021Updated 4 years ago
- ☆17Jun 12, 2024Updated last year