wp931120 / baichuan_sft_loraView external linksLinks
baichuan LLM surpervised finetune by lora
☆64Jun 28, 2023Updated 2 years ago
Alternatives and similar repositories for baichuan_sft_lora
Users that are interested in baichuan_sft_lora are comparing it to the libraries listed below
Sorting:
- Baichuan-13B 指令微调☆90Jul 14, 2023Updated 2 years ago
- ☆15Mar 12, 2024Updated last year
- 文本数据增强☆15Apr 10, 2020Updated 5 years ago
- chatglm3-6b, 微调/LORA/推理/单机多卡/deepspeed/支持多轮对话☆17Nov 30, 2023Updated 2 years ago
- Baichuan2代码的逐行解析版本,适合小白☆212Sep 20, 2023Updated 2 years ago
- Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.☆619Jan 24, 2025Updated last year
- [ACL 2024] Code for the paper "ALaRM: Align Language Models via Hierarchical Rewards Modeling"☆25Mar 28, 2024Updated last year
- 基于rasa构建的中文任务型对话机器人,并用flask实现ui对话界面☆20Jun 20, 2019Updated 6 years ago
- MFIN7036 NLP Course Project☆10Jul 25, 2024Updated last year
- 基于ChatGLM2-6B进行微调,包括全参数、参数有效性、量化感知训练等,可实现指令微调、多轮对话微调等。☆26Jul 29, 2023Updated 2 years ago
- ChatGLM2-6B微调, SFT/LoRA, instruction finetune☆110Jul 19, 2023Updated 2 years ago
- A large-scale 7B pretraining language model developed by BaiChuan-Inc.☆5,686Jul 18, 2024Updated last year
- Deepspeed、LLM、Medical_Dialogue、医疗大模型、预训练、微调☆289Jun 7, 2024Updated last year
- ☆34Sep 14, 2024Updated last year
- A series of large language models developed by Baichuan Intelligent Technology☆4,118Nov 8, 2024Updated last year
- [ICLR 2024] This is the official implementation for the paper: "Beyond imitation: Leveraging fine-grained quality signals for alignment"☆10May 5, 2024Updated last year
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆12Apr 15, 2025Updated 10 months ago
- 基于WebSocket协议实现实时弹幕信息爬取与信息通信。通过MaxKB容器训练直播互动模型,具备智能互动能力,通过微调预训练的语言模型来适应特定的直播场景需求,提升数字人的交互体验。基于TTS和Wav2lip开发语音克隆和唇形同步算法,通过预训练数字人模型的方式压缩生成时…☆13Oct 7, 2024Updated last year
- LLM-powered chatbot app to enhance accessibility to knowledge contained within PDFs☆14May 6, 2025Updated 9 months ago
- ☆11Aug 20, 2025Updated 5 months ago
- The practitioner's guide to high-speed business automation at enterprise scale using Appian☆11Jan 18, 2023Updated 3 years ago
- multi_gpu_infer 多gpu预测 multiprocessing or subprocessing☆12Mar 24, 2020Updated 5 years ago
- Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"☆24Jul 21, 2025Updated 6 months ago
- Logo design made plain.☆11Jun 9, 2018Updated 7 years ago
- ☆20Sep 11, 2025Updated 5 months ago
- Spam SMS Detector, Flutter Application☆11Oct 14, 2021Updated 4 years ago
- finetune llama2 with traditional chinese dataset