CLUEbenchmark / SuperCLUE-FinLinks
中文金融大模型测评基准,六大类二十五任务、等级化评价,国内模型获得A级
☆10Updated last year
Alternatives and similar repositories for SuperCLUE-Fin
Users that are interested in SuperCLUE-Fin are comparing it to the libraries listed below
Sorting:
- KDD 2024 AQA competition 2nd place solution☆12Updated last year
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆41Updated last year
- ☆15Updated last year
- 官方transformers源码解析。AI大模型时代,pytorch、transformer是新操作系统,其他都是运行在其上面的软件。☆17Updated 2 years ago
- Python下shuffle几百G文件☆33Updated 4 years ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- code for paper 《RankingGPT: Empowering Large Language Models in Text Ranking with Progressive Enhancement》☆35Updated last year
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆33Updated last year
- ☆48Updated 9 months ago
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆58Updated last year
- 逻辑回归和单层softmax的解析解☆12Updated 4 years ago
- OpenLLMDE: An open source data engineering framework for LLMs☆18Updated 2 years ago
- RLHF experiments on a single A100 40G GPU. Support PPO, GRPO, REINFORCE, RAFT, RLOO, ReMax, DeepSeek R1-Zero reproducing.☆73Updated 7 months ago
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆49Updated 2 years ago
- 一套代码指令微调大模型☆38Updated 2 years ago
- 中文原生工业测评基准☆15Updated last year
- ☆47Updated last year
- ☆20Updated 11 months ago
- 百度QA100万数据集☆48Updated last year
- [ACL 2025] An official pytorch implement of the paper: Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement☆37Updated 4 months ago
- Finetune CPM-1☆24Updated 4 years ago
- ☆81Updated last month
- Llama3开源模型中文版-全方位测评,基于SuperCLUE基准 | Llama3 Chinese Evaluation with SuperCLUE☆16Updated last year
- SuperCLUE-Math6:新一代中文原生多轮多步数学推理数据集的探索之旅☆60Updated last year
- Tutorial for LLM developers about engine design, service deployment, evaluation/benchmark, etc. Provide a C/S style optimized LLM inferen…☆19Updated 2 years ago
- A Benchmark for Multi-Stage Legal Case Documents Generation☆10Updated 7 months ago
- Official codebase for NeurIPS 2022 paper End-to-end Learning to Index and Search in Large Output Spaces☆12Updated 2 years ago
- Code for Robust Fine-tuning (RbFT)☆15Updated 8 months ago
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆22Updated 2 years ago
- ☆19Updated last year