YunwenTechnology / Chinese-Data-Distill-From-R1Links

中文基于满血DeepSeek-R1蒸馏数据集

☆62

Alternatives and similar repositories for Chinese-Data-Distill-From-R1

Users that are interested in Chinese-Data-Distill-From-R1 are comparing it to the libraries listed below

Sorting:

Chinese-Tiny-LLM / Chinese-Tiny-LLM
☆232Updated last year
open-chinese / alpaca-chinese-dataset
Alpaca Chinese Dataset -- 中文指令微调数据集
☆215Updated last year
zejunwang1 / LLMTuner
大语言模型指令调优工具（支持 FlashAttention）
☆178Updated last year
CLUEbenchmark / SuperCLUE-RAG
中文原生检索增强生成测评基准
☆123Updated last year
yanqiangmiffy / how-to-train-tokenizer
怎么训练一个LLM分词器
☆153Updated 2 years ago
thu-coai / CritiqueLLM
☆147Updated last year
xubuvd / LLMs
专注于中文领域大语言模型，落地到某个行业某个领域，成为一个行业大模型、公司级别或行业级别领域大模型。
☆123Updated 7 months ago
percent4 / llm_math_solver
本项目用于大模型数学解题能力方面的数据集合成，模型训练及评测，相关文章记录。
☆95Updated last year
sufengniu / RefGPT
☆163Updated 2 years ago
HIT-SCIR-SC / QiaoBan
☆237Updated last year
beichao1314 / Open-Llama
The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.
☆67Updated 2 years ago
hengjiUSTC / learn-llm
☆115Updated 11 months ago
CASIA-LM / ChineseWebText
☆179Updated last year
RUC-GSAI / Yulan-GARDEN
Official Repository for SIGIR2024 Demo Paper "An Integrated Data Processing Framework for Pretraining Foundation Models"
☆83Updated last year
modelscope / easydistill
a toolkit on knowledge distillation for large language models
☆171Updated last week
zhoujx4 / python-node-deepresearch
deepResearch
☆72Updated 5 months ago
HIT-SCIR / huozi
活字通用大模型
☆392Updated last year
yongzhuo / LLM-SFT
中文大模型微调(LLM-SFT), 数学指令数据集MWP-Instruct, 支持模型(ChatGLM-6B, LLaMA, Bloom-7B, baichuan-7B), 支持(LoRA, QLoRA, DeepSpeed, UI, TensorboardX), 支持(微…
☆211Updated last year
twang2218 / vocab-coverage
语言模型中文认知能力分析
☆236Updated 2 years ago
OpenLMLab / ChatZoo
Light local website for displaying performances from different chat models.
☆87Updated last year
CLUEbenchmark / SuperCLUE-Agent
SuperCLUE-Agent: 基于中文原生任务的Agent智能体核心能力测评基准
☆92Updated last year
gmftbyGMFTBY / science-llm
A large-scale language model for scientific domain, trained on redpajama arXiv split
☆136Updated last year
SupritYoung / RLHF-Label-Tool
用于大模型 RLHF 进行人工数据标注排序的工具。A tool for manual response data annotation sorting in RLHF stage.
☆254Updated 2 years ago
DA-southampton / RedGPT
☆68Updated 2 years ago
morecry / CharacterEval
☆267Updated 4 months ago
THUDM / AlignBench
大模型多维度中文对齐评测基准 (ACL 2024)
☆413Updated last year
X-PLUG / Multi-LLM-Agent
☆232Updated last year
xv44586 / Chinese-instruction-datasets
中文 Instruction tuning datasets
☆137Updated last year
mutonix / RefGPT
☆98Updated last year
tjunlp-lab / M3KE
A Massive Multi-Level Multi-Subject Knowledge Evaluation benchmark
☆102Updated 2 years ago