Baichuan-13B 指令微调
☆90Jul 14, 2023Updated 2 years ago
Alternatives and similar repositories for Baichuan-13B-Finetuning
Users that are interested in Baichuan-13B-Finetuning are comparing it to the libraries listed below
Sorting:
- baichuan LLM surpervised finetune by lora☆64Jun 28, 2023Updated 2 years ago
- 基于qlora对baichuan-7B大模型进行指令微调。☆23Jun 22, 2023Updated 2 years ago
- 实 现了Baichuan-Chat微调,Lora、QLora等各种微调方式,一键运行。☆70Aug 15, 2023Updated 2 years ago
- Research project on glyph-based Chinese character embedding. Preparing for EMNLP 2019☆11Mar 18, 2019Updated 6 years ago
- Multi-Label Text Classification Based On Bert☆23Feb 28, 2023Updated 3 years ago
- A 13B large language model developed by Baichuan Intelligent Technology☆2,947Sep 6, 2023Updated 2 years ago
- InternLM-7B微调, SFT/LoRA, instruction finetune☆13May 17, 2024Updated last year
- ☆12Apr 29, 2024Updated last year
- baichuan and baichuan2 finetuning and alpaca finetuning☆33Mar 10, 2025Updated last year
- ☆16Aug 5, 2018Updated 7 years ago
- An implementation of "Two are Better than One: An Ensemble of Retrieval- and Generation-Based Dialog Systems"☆14Jul 23, 2019Updated 6 years ago
- Baichuan2代码的逐行解析版本,适合小白☆212Sep 20, 2023Updated 2 years ago
- 使用peft库,对chatGLM-6B/chatGLM2-6B实现4bit的QLoRA高效微调,并做lora model和base model的merge及4bit的量化(quantize)。☆359Aug 22, 2023Updated 2 years ago
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆6,638Oct 24, 2024Updated last year
- ChatGLM2-6B 全参数微调,支持多 轮对话的高效微调。☆402Aug 17, 2023Updated 2 years ago
- Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.☆619Jan 24, 2025Updated last year
- Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调☆3,731Oct 12, 2023Updated 2 years ago
- 纯c++的全平台llm加速库,支持python调用,支持baichuan, glm, llama, moss基座,手机端流畅运行chatglm-6B级模型单卡可达10000+token / s,☆42Aug 16, 2023Updated 2 years ago
- This is an official PyTorch implementation of "Gesture2Vec: Clustering Gestures using Representation Learning Methods for Co-speech Gestu…☆26Feb 9, 2024Updated 2 years ago
- Implementation of Weakly Supervised Deep Image Hashing through Tag Embeddings☆25Jun 22, 2022Updated 3 years ago
- ☆18Jun 10, 2025Updated 9 months ago
- Language Models as Semantic Indexers (ICML 2024)☆40May 2, 2024Updated last year
- 使用多头的思想来进行命名实体识别☆34May 5, 2021Updated 4 years ago
- A large-scale 7B pretraining language model developed by BaiChuan-Inc.☆5,681Jul 18, 2024Updated last year
- A series of large language models developed by Baichuan Intelligent Technology☆4,117Nov 8, 2024Updated last year
- This project showcases engaging interactions between two AI chatbots.☆10Jan 10, 2024Updated 2 years ago
- Video retrieval from query images☆11Oct 10, 2017Updated 8 years ago
- Generalized Sentiment Classifier finetuned by KoELECTRA☆11Nov 28, 2024Updated last year
- chatglm多gpu用deepspeed和☆408Jul 8, 2024Updated last year
- LLM with LuXun (鲁迅) style☆89May 15, 2023Updated 2 years ago
- ☆14Aug 28, 2024Updated last year
- Resources for my <model-viewer> course☆11Jul 25, 2023Updated 2 years ago
- ☆11Mar 12, 2021Updated 4 years ago
- Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.☆1,019Apr 27, 2024Updated last year
- XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.☆644Apr 9, 2024Updated last year
- 基于chatglm快速搭建文档问答机器人☆88May 20, 2023Updated 2 years ago
- GDPnet: "Geometry-guided Dense Perspective Network for Speech-Driven Facial Animation." (TVCG 2021)☆11Nov 21, 2021Updated 4 years ago
- ☆12Oct 19, 2020Updated 5 years ago
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆23Sep 21, 2025Updated 5 months ago