baichuan LLM surpervised finetune by lora
☆64Jun 28, 2023Updated 2 years ago
Alternatives and similar repositories for baichuan_sft_lora
Users that are interested in baichuan_sft_lora are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Baichuan-13B 指令微调☆90Jul 14, 2023Updated 2 years ago
- Baichuan2代码的逐行解析版本,适合小白☆211Sep 20, 2023Updated 2 years ago
- 基于qlora对baichuan-7B大模型进行指令微调。☆23Jun 22, 2023Updated 2 years ago
- Deepseek-r1复现科普与资源汇总☆22Mar 5, 2025Updated last year
- chatglm3-6b, 微调/LORA/推理/单机多卡/deepspeed/支持多轮对话☆17Nov 30, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- codes and pre-trained models of paper "Segatron: Segment-aware Transformer for Language Modeling and Understanding"☆18Oct 25, 2022Updated 3 years ago
- 文本数据增强☆15Apr 10, 2020Updated 6 years ago
- Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.☆621Jan 24, 2025Updated last year
- A large-scale 7B pretraining language model developed by BaiChuan-Inc.☆5,664Jul 18, 2024Updated last year
- A series of large language models developed by Baichuan Intelligent Technology☆4,108Nov 8, 2024Updated last year
- 从零预训练LLM、SFT、RLHF、DPO笔记整理+面试问题☆21Sep 2, 2024Updated last year
- PyTorch implementation of Listen, Attend and Spell (LAS) speech recognition paper☆12Mar 4, 2022Updated 4 years ago
- ☆27Nov 25, 2025Updated 5 months ago
- 基于qwen3的医疗大模型研发全流程 0.分词训练 1.增量预训练 2.微调 3.强化 4.量化 5.蒸馏 6.评估 7.lora模型合并 8.服务 9.部署☆43Jan 3, 2026Updated 4 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- DALLE-tools provided useful dataset utilities to improve you workflow with WebDatasets.☆14Mar 9, 2022Updated 4 years ago
- ChatGLM2-6B微调, SFT/LoRA, instruction finetune☆109Jul 19, 2023Updated 2 years ago
- PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing☆21Mar 18, 2025Updated last year
- 基于ChatGLM2-6B进行微调,包括全参数、参数有效性、量化感知训练等,可实现指令微调、多轮对话微调等。☆26Jul 29, 2023Updated 2 years ago
- 端到端语音识别实现;包含LAS、CTC、RNNT解码方式,模型SA(MHA) 、LSTM、CNN、DFSMN等☆15Jun 4, 2021Updated 4 years ago
- CTC Decoder implementation with python only. Also supports language model decoding using KenLM.☆37May 3, 2024Updated 2 years ago
- Deepspeed、LLM、Medical_Dialogue、医疗大模型、预训练、微调☆307Updated this week
- Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.☆1,015Apr 27, 2024Updated 2 years ago
- ☆17Mar 1, 2019Updated 7 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A 13B large language model developed by Baichuan Intelligent Technology☆2,932Sep 6, 2023Updated 2 years ago
- 最大开源中文问答数据集 ,助力中文LLM.The largest open-source Chinese Q&A dataset, supporting Chinese LLM☆10Jul 31, 2023Updated 2 years ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆36Apr 29, 2026Updated last week
- ChatGPT中文学习和实践资料汇总——LLaMA、ChatGLM等大模型的Finetune☆14Apr 17, 2023Updated 3 years ago
- ☆10Dec 8, 2022Updated 3 years ago
- dify的插件,用于word切分等操作☆25Sep 12, 2025Updated 7 months ago
- 基于深度学习识别THCHS30数据集☆14Oct 27, 2021Updated 4 years ago
- The code of "NeurJudge: A Circumstance-aware Neural Framework for Legal Judgment Prediction"(SIGIR2021))☆18Jan 3, 2024Updated 2 years ago
- An evaluation bentchmark for classical Chinese☆19Dec 13, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Baseline system for Language-based Audio Retrieval (Task 6B) in DCASE 2023 Challenge☆10Aug 8, 2023Updated 2 years ago
- ☆10Aug 3, 2020Updated 5 years ago
- 项目的issue会存放我的所有blog☆19Sep 12, 2025Updated 7 months ago
- pytorch bert 版的 multi_label_text_classification☆10Dec 28, 2019Updated 6 years ago
- [ACL 2024] Code for the paper "ALaRM: Align Language Models via Hierarchical Rewards Modeling"☆25Mar 28, 2024Updated 2 years ago
- ☆19Nov 7, 2024Updated last year
- A simple text classification example using BERT and huggingface transformers☆11Sep 10, 2020Updated 5 years ago