baichuan LLM surpervised finetune by lora
☆64Jun 28, 2023Updated 2 years ago
Alternatives and similar repositories for baichuan_sft_lora
Users that are interested in baichuan_sft_lora are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Baichuan-13B 指令微调☆90Jul 14, 2023Updated 2 years ago
- Baichuan2代码的逐行解析版本,适合小白☆211Sep 20, 2023Updated 2 years ago
- 基于qlora对baichuan-7B大模型进行指令微调。☆23Jun 22, 2023Updated 2 years ago
- codes and pre-trained models of paper "Segatron: Segment-aware Transformer for Language Modeling and Understanding"☆18Oct 25, 2022Updated 3 years ago
- 文本数据增强☆15Apr 10, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.☆621Jan 24, 2025Updated last year
- A large-scale 7B pretraining language model developed by BaiChuan-Inc.☆5,671Jul 18, 2024Updated last year
- 本项目将会以部分裁判文书网上面案由为故意杀人罪的刑事一审判决书为原始数据,通过爬虫的方式获取数据,并通过文本分析的方式对原始的文本进行目标文本提取,并对判决书中针对被告人信息、法院认定、判决情况等部分的信息进行特征提取,并进行特征转换以构建建模变量。本项目以法院的一审判决作…☆14Sep 12, 2023Updated 2 years ago
- A series of large language models developed by Baichuan Intelligent Technology☆4,113Nov 8, 2024Updated last year
- 基于rasa构建的中文任务型对话机器人,并用flask实现ui对话界面☆20Jun 20, 2019Updated 6 years ago
- ☆16Mar 12, 2024Updated 2 years ago
- ☆27Nov 25, 2025Updated 4 months ago
- A collection of models for TensorFlow Go☆12May 29, 2022Updated 3 years ago
- ChatGLM2-6B微调, SFT/LoRA, instruction finetune☆110Jul 19, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- 基于ChatGLM2-6B进行微调,包括全参数、参数有效性、量化感知训练等,可实现指令微调、多轮对话微调等。☆26Jul 29, 2023Updated 2 years ago
- Hanja Understanding Evaluation Dataset☆15May 2, 2022Updated 3 years ago
- CTC Decoder implementation with python only. Also supports language model decoding using KenLM.☆37May 3, 2024Updated last year
- Deepspeed、LLM、Medical_Dialogue、医疗大模型、预训练、微调☆299Apr 11, 2026Updated last week
- Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.☆1,017Apr 27, 2024Updated last year
- ☆12Aug 3, 2020Updated 5 years ago
- A 13B large language model developed by Baichuan Intelligent Technology☆2,933Sep 6, 2023Updated 2 years ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆36Mar 23, 2026Updated 3 weeks ago
- ChatGPT中文学习和实践资料汇总——LLaMA、ChatGLM等大模型的Finetune☆14Apr 17, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 基于深度学习识别THCHS30数据集☆14Oct 27, 2021Updated 4 years ago
- The code of "NeurJudge: A Circumstance-aware Neural Framework for Legal Judgment Prediction"(SIGIR2021))☆17Jan 3, 2024Updated 2 years ago
- An evaluation bentchmark for classical Chinese☆19Dec 13, 2023Updated 2 years ago
- Baseline system for Language-based Audio Retrieval (Task 6B) in DCASE 2023 Challenge☆10Aug 8, 2023Updated 2 years ago
- ☆10Aug 3, 2020Updated 5 years ago
- pytorch bert 版的 multi_label_text_classification☆10Dec 28, 2019Updated 6 years ago
- [ACL 2024] Code for the paper "ALaRM: Align Language Models via Hierarchical Rewards Modeling"☆25Mar 28, 2024Updated 2 years ago
- ☆19Nov 7, 2024Updated last year
- A simple text classification example using BERT and huggingface transformers☆11Sep 10, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The UC Davis Corpus of Written Spanish, L2 and Heritage Speakers☆18Sep 23, 2025Updated 6 months ago
- 适用于sophon bm1684x,基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答☆14Jun 5, 2024Updated last year
- ☆19Sep 19, 2024Updated last year
- 从jieba分词到BERT-wwm,一步步带你进入中文NLP的世界☆15Sep 1, 2022Updated 3 years ago
- 天池-Datawhale 零基础入门NLP-新闻文本分类 最终榜Top10分享☆61Sep 27, 2020Updated 5 years ago
- ArWordVec is a collection of pre-trained word embedding model built from huge repository of Arabic tweets in different topics. The aim of…☆19Jul 9, 2020Updated 5 years ago
- [ICLR 2024] This is the official implementation for the paper: "Beyond imitation: Leveraging fine-grained quality signals for alignment"☆10May 5, 2024Updated last year