ChristopheZhao / SFT_data_generation
Instruction Tuning data generation uses LLM in a specific scenario.
☆20Updated last year
Alternatives and similar repositories for SFT_data_generation
Users that are interested in SFT_data_generation are comparing it to the libraries listed below
Sorting:
- Qwen DianJin: LLMs for the Financial Industry by Alibaba Cloud☆68Updated 3 weeks ago
- 顾名思义:手搓的RAG☆122Updated last year
- 中文大模型微调(LLM-SFT), 数学指令数据集MWP-Instruct, 支持模型(ChatGLM-6B, LLaMA, Bloom-7B, baichuan-7B), 支持(LoRA, QLoRA, DeepSpeed, UI, TensorboardX), 支持(微…☆199Updated last year
- ☆78Updated this week
- 对llama3进行全参微调、lora微调以及qlora微调。☆195Updated 7 months ago
- A unified tool to generate fine-tuning datasets for LLMs, including questions, answers, and dialogues. ✨🤖📚💬☆59Updated 2 months ago
- 本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。☆86Updated 8 months ago
- In this fast-paced world, we all need a little something to spice up life. Whether you need a glass of sweet talk to lift your spirits or…☆54Updated 3 months ago
- ☆12Updated 2 months ago
- 基于ChatGPT构建的中文self-instruct数据集☆117Updated 2 years ago
- ☆77Updated 3 months ago
- ☆43Updated 3 weeks ago
- unify-easy-llm(ULM)旨在打造一个简易的一键式大模型训练工具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。☆55Updated 9 months ago
- 大语言模型应用:RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛☆61Updated 3 months ago
- llm-medical-data:用于大模型微调训练的医疗数据集☆107Updated last year
- Official Repository for SIGIR2024 Demo Paper "An Integrated Data Processing Framework for Pretraining Foundation Models"☆80Updated 8 months ago
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆61Updated last year
- 天池算法比赛《BetterMixture - 大模型数据混合挑战赛》的第一名top1解决方案☆30Updated 10 months ago
- 通义千问的DPO训练☆47Updated 7 months ago
- ☆108Updated 6 months ago
- 大语言模型训练和服务调研☆37Updated last year
- 利用免费的大模型api来结合你的私域数据来 生成sft训练数据(妥妥白嫖)支持llamafactory等工具的训练数据格式synthetic data☆160Updated 5 months ago
- ☆52Updated 8 months ago
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆45Updated 3 months ago
- Generate dialog data from documents using LLM like ChatGLM2 or ChatGPT;利用ChatGLM2,ChatGPT等大模型根据文档生成对话数据集☆157Updated last year
- ☆56Updated 7 months ago
- Agentic RAG R1 Framework via Reinforcement Learning☆148Updated last week
- ☆109Updated 10 months ago
- 眼科问诊大模型☆91Updated 10 months ago
- 从0训练类 o1 大语言模型。☆13Updated last month