baichuan LLM surpervised finetune by lora
☆63Jun 28, 2023Updated 2 years ago
Alternatives and similar repositories for baichuan_sft_lora
Users that are interested in baichuan_sft_lora are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Baichuan-13B 指令微调☆88Jul 14, 2023Updated 2 years ago
- Baichuan2代码的逐行解析版本,适合小白☆211Sep 20, 2023Updated 2 years ago
- 基于qlora对baichuan-7B大模型进行指令微调。☆22Jun 22, 2023Updated 2 years ago
- Deepseek-r1复现科普与资源汇总☆22Mar 5, 2025Updated last year
- chatglm3-6b, 微调/LORA/推理/单机多卡/deepspeed/支持多轮对话☆17Nov 30, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 文本数据增强☆15Apr 10, 2020Updated 6 years ago
- Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.☆620Jan 24, 2025Updated last year
- A large-scale 7B pretraining language model developed by BaiChuan-Inc.☆5,658Jul 18, 2024Updated last year
- A series of large language models developed by Baichuan Intelligent Technology☆4,102Nov 8, 2024Updated last year
- 基于rasa构建的中文任务型对话机器人,并用flask实现ui对话界面☆19Jun 20, 2019Updated 6 years ago
- ☆27May 12, 2026Updated 2 weeks ago
- ChatGLM2-6B微调, SFT/LoRA, instruction finetune☆109Jul 19, 2023Updated 2 years ago
- Deepspeed、LLM、Medical_Dialogue、医疗大模型、预训练、微调☆307May 3, 2026Updated 3 weeks ago
- Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.☆1,015Apr 27, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 小内存、显存(低于4g)使用bert做下游任务的一个方案☆14Nov 19, 2019Updated 6 years ago
- 臺華平行新聞語料庫☆16Jul 2, 2018Updated 7 years ago
- A 13B large language model developed by Baichuan Intelligent Technology☆2,931Sep 6, 2023Updated 2 years ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆36Apr 29, 2026Updated last month
- ChatGPT中文学习和实践资料汇总——LLaMA、ChatGLM等大模型的Finetune☆14Apr 17, 2023Updated 3 years ago
- dify的插件,用于word切分等操作☆25Sep 12, 2025Updated 8 months ago
- 基于深度学习识别THCHS30数据集☆14Oct 27, 2021Updated 4 years ago
- The code of "NeurJudge: A Circumstance-aware Neural Framework for Legal Judgment Prediction"(SIGIR2021))☆18Jan 3, 2024Updated 2 years ago
- Baseline system for Language-based Audio Retrieval (Task 6B) in DCASE 2023 Challenge☆10Aug 8, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- [ACL 2024] Code for the paper "ALaRM: Align Language Models via Hierarchical Rewards Modeling"☆25Mar 28, 2024Updated 2 years ago
- A simple text classification example using BERT and huggingface transformers☆11Sep 10, 2020Updated 5 years ago
- Repository for "BLEU Meets COMET: Combining Lexical and Neural Metrics Towards Robust Machine Translation Evaluation", accepted at EAMT 2…☆21Jul 19, 2023Updated 2 years ago
- 适用于sophon bm1684x,基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答☆13Jun 5, 2024Updated last year
- ☆19Sep 19, 2024Updated last year
- Implementation of Not All Contexts Are Equal: Teaching LLMs Credibility-aware Generation. Paper: https://arxiv.org/abs/2404.06809☆22Oct 22, 2024Updated last year
- 雪球网评论数据爬取☆10Sep 27, 2019Updated 6 years ago
- My pytorch implementation of the model described in the paper **Hierarchical Attention Networks for Document Classification** [paper](htt…☆10Mar 22, 2019Updated 7 years ago
- TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages☆19May 23, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [SIGIR 2024] TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision☆20Mar 28, 2024Updated 2 years ago
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆49Aug 27, 2023Updated 2 years ago
- Official repository Flash Local Linear Attention☆23Apr 23, 2026Updated last month
- ☆13Mar 16, 2022Updated 4 years ago
- ☆21Oct 30, 2024Updated last year
- PyTorch实现的多标签的文本分类☆14Apr 14, 2019Updated 7 years ago
- Finetune LLaMA-7B with Chinese instruction datasets☆136May 8, 2023Updated 3 years ago