baichuan LLM surpervised finetune by lora
☆64Jun 28, 2023Updated 2 years ago
Alternatives and similar repositories for baichuan_sft_lora
Users that are interested in baichuan_sft_lora are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Baichuan-13B 指令微调☆90Jul 14, 2023Updated 2 years ago
- Baichuan2代码的逐行解析版本,适合小白☆212Sep 20, 2023Updated 2 years ago
- 基于qlora对baichuan-7B大模型进行指令微调。☆23Jun 22, 2023Updated 2 years ago
- chatglm3-6b, 微调/LORA/推理/单机多卡/deepspeed/支持多轮对话☆17Nov 30, 2023Updated 2 years ago
- codes and pre-trained models of paper "Segatron: Segment-aware Transformer for Language Modeling and Understanding"☆18Oct 25, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 文本数据增强☆15Apr 10, 2020Updated 5 years ago
- Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.☆620Jan 24, 2025Updated last year
- A large-scale 7B pretraining language model developed by BaiChuan-Inc.☆5,680Jul 18, 2024Updated last year
- ☆15Mar 12, 2024Updated 2 years ago
- A series of large language models developed by Baichuan Intelligent Technology☆4,117Nov 8, 2024Updated last year
- Codes for ICLR 2025 Paper: Towards Semantic Equivalence of Tokenization in Multimodal LLM☆82Apr 19, 2025Updated 11 months ago
- 基于rasa构建的中文任务型对话机器人,并用flask实现ui对话界面☆20Jun 20, 2019Updated 6 years ago
- ☆27Nov 25, 2025Updated 4 months ago
- MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning [NeurIPS 2025 Poster]☆23Dec 10, 2025Updated 3 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A collection of models for TensorFlow Go☆12May 29, 2022Updated 3 years ago
- ChatGLM2-6B微调, SFT/LoRA, instruction finetune☆110Jul 19, 2023Updated 2 years ago
- Using Bayesian inference to mine rule sets☆12Jan 9, 2020Updated 6 years ago
- 基于ChatGLM2-6B进行微调,包括全参数、参数有效性、量化感知训练等,可实现指令微调、多轮对话微调等。☆26Jul 29, 2023Updated 2 years ago
- 中文转emoji☆11Dec 17, 2018Updated 7 years ago
- Deepspeed、LLM、Medical_Dialogue、医疗大模型、预训练、微调☆296Jun 7, 2024Updated last year
- elasticsearch-notebook☆25Jul 19, 2024Updated last year
- Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.☆1,019Apr 27, 2024Updated last year
- 小内存、显存(低于4g)使用bert做下游任务的一个方案☆14Nov 19, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆12Aug 3, 2020Updated 5 years ago
- A 13B large language model developed by Baichuan Intelligent Technology☆2,943Sep 6, 2023Updated 2 years ago
- 最大开源中文问答数据集 ,助力中文LLM.The largest open-source Chinese Q&A dataset, supporting Chinese LLM☆10Jul 31, 2023Updated 2 years ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆37Mar 23, 2026Updated last week
- ChatGPT中文学习和实践资料汇总——LLaMA、ChatGLM等大模型的Finetune☆14Apr 17, 2023Updated 2 years ago
- dify的插件,用于word切分等操作☆24Sep 12, 2025Updated 6 months ago
- an attempt at implementing deep learning model proposed in paper teaching robots to draw☆11Aug 13, 2021Updated 4 years ago
- The code of "NeurJudge: A Circumstance-aware Neural Framework for Legal Judgment Prediction"(SIGIR2021))☆17Jan 3, 2024Updated 2 years ago
- Baseline system for Language-based Audio Retrieval (Task 6B) in DCASE 2023 Challenge☆10Aug 8, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 项目的issue会存放我的所有blog☆19Sep 12, 2025Updated 6 months ago
- Official implementation of paper "ACON: Optimizing Context Compression for Long-horizon LLM Agents"☆59Oct 14, 2025Updated 5 months ago
- aliyun pk report 2012☆20Oct 31, 2012Updated 13 years ago
- pytorch bert 版的 multi_label_text_classification☆10Dec 28, 2019Updated 6 years ago
- Listen, Attend and Spell - PyTorch Implementation☆17Dec 28, 2018Updated 7 years ago
- [ACL 2024] Code for the paper "ALaRM: Align Language Models via Hierarchical Rewards Modeling"☆25Mar 28, 2024Updated 2 years ago
- ☆19Nov 7, 2024Updated last year