TongjiFinLab / CFBenchmark
Chinese Financial Assistant Benchmark for Large Language Model
☆41Updated 8 months ago
Alternatives and similar repositories for CFBenchmark
Users that are interested in CFBenchmark are comparing it to the libraries listed below
Sorting:
- deepspeed+trainer简单高效实现多卡微调大模型☆125Updated last year
- ☆65Updated last year
- 本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。☆86Updated 8 months ago
- FinEval是一个中文金融领域高质量多项选择与文本问答题的集合。☆195Updated 6 months ago
- ☆97Updated last year
- Chinese Financial Assistant with Large Language Model☆58Updated 8 months ago
- ☆20Updated 10 months ago
- 1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc☆160Updated last year
- 大语言模型指令调优工具(支持 FlashAttention)☆172Updated last year
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING☆87Updated last year
- [EMNLP2024] Aligning Large Language Models on Information Extraction☆46Updated 6 months ago
- 大语言模型训练和服务调研☆37Updated last year
- 天池算法比赛《BetterMixture - 大模型数据混合挑战赛》的第一名top1解决方案☆30Updated 10 months ago
- A Massive Multi-Level Multi-Subject Knowledge Evaluation benchmark☆100Updated last year
- moss chat finetuning☆50Updated last year
- LAiW: A Chinese Legal Large Language Models Benchmark☆79Updated 10 months ago
- ☆140Updated last year
- An open-source conversational language model developed by the Knowledge Works Research Laboratory at Fudan University.☆64Updated last year
- ☆109Updated 10 months ago
- llama,chatglm 等模型的微调☆88Updated 9 months ago
- ☆97Updated last year
- ChatGLM2-6B微调, SFT/LoRA, instruction finetune☆107Updated last year
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆40Updated last year
- 中文 Instruction tuning datasets☆131Updated last year
- ChatGLM-6B添加了RLHF的实现,以及部分核心代码的逐行讲解 ,实例部分是做了个新闻短标题的生成,以及指定context推荐的RLHF的实现☆83Updated last year
- ☆202Updated last year
- ☆160Updated 2 years ago
- 实现了Baichuan-Chat微调,Lora、QLora等各种微调方式,一键运行。☆70Updated last year
- 使用qlora对中文大语言模型进行微调,包含ChatGLM、Chinese-LLaMA-Alpaca、BELLE☆86Updated last year
- A Bilingual Role Evaluation Benchmark for Large Language Models☆40Updated last year