Baichuan-13B 指令微调
☆90Jul 14, 2023Updated 2 years ago
Alternatives and similar repositories for Baichuan-13B-Finetuning
Users that are interested in Baichuan-13B-Finetuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- baichuan LLM surpervised finetune by lora☆64Jun 28, 2023Updated 2 years ago
- baichuan-7B 微调 C++ 面试大模型☆14Jul 8, 2023Updated 2 years ago
- 基于qlora对baichuan-7B大模型进行指令微调。☆23Jun 22, 2023Updated 2 years ago
- 实现了Baichuan-Chat微调,Lora、QLora等各种微调方式,一键运行。☆70Aug 15, 2023Updated 2 years ago
- A 13B large language model developed by Baichuan Intelligent Technology☆2,933Sep 6, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Baichuan2代码的逐行解析版本,适合小白☆211Sep 20, 2023Updated 2 years ago
- Multi-Label Text Classification Based On Bert☆22Feb 28, 2023Updated 3 years ago
- InternLM-7B微调, SFT/LoRA, instruction finetune☆13May 17, 2024Updated last year
- Research project on glyph-based Chinese character embedding. Preparing for EMNLP 2019☆11Mar 18, 2019Updated 7 years ago
- ☆12Apr 29, 2024Updated last year
- An implementation of "Two are Better than One: An Ensemble of Retrieval- and Generation-Based Dialog Systems"☆14Jul 23, 2019Updated 6 years ago
- [Neural Networks 2025] The official code for the paper "MNet: A Multi-Scale Network for Visible Watermark Removal."☆17Jun 16, 2025Updated 10 months ago
- ☆16Aug 5, 2018Updated 7 years ago
- ChatGLM2-6B 全参数微调,支持多轮对话的高效微调。☆402Aug 17, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆6,649Oct 24, 2024Updated last year
- Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.☆621Jan 24, 2025Updated last year
- baichuan and baichuan2 finetuning and alpaca finetuning☆33Mar 10, 2025Updated last year
- A series of large language models developed by Baichuan Intelligent Technology☆4,113Nov 8, 2024Updated last year
- ☆14Aug 26, 2024Updated last year
- Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调☆3,723Oct 12, 2023Updated 2 years ago
- 结合知识图谱做的有关诗词的问答demo☆11Mar 11, 2020Updated 6 years ago
- A large-scale 7B pretraining language model developed by BaiChuan-Inc.☆5,671Jul 18, 2024Updated last year
- 使用peft库,对chatGLM-6B/chatGLM2-6B实现4bit的QLoRA高效微调,并做lora model和base model的merge及4bit的量化(quantize)。☆360Aug 22, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Language Models as Semantic Indexers (ICML 2024)☆41May 2, 2024Updated last year
- 中科大2022春《深度学习导论》课程资源☆10Aug 7, 2022Updated 3 years ago
- An automated pipeline for scraping, processing, and visualizing medical Q&A data to build high-quality datasets. Includes a comprehensive…☆24Dec 24, 2024Updated last year
- python人脸识别和情绪识别☆17Oct 2, 2023Updated 2 years ago
- ☆12Jan 10, 2025Updated last year
- 演示 vllm 对中文大语言模型的神奇效果☆31Nov 4, 2023Updated 2 years ago
- 该项目专注于识别智能对话场景中的用户文本,自动判断情绪类别并给出相应的准确度。可以广泛应用于社交媒体评论情感分析、智能客服情绪分析等场景,成为情感支持工具,帮助用户 从情绪中解脱。多次Prompt提升后,GPT模型最终识别准确率高于人类Baseline水准。☆11Jul 25, 2023Updated 2 years ago
- 在ChatGLM大模型上利用LoRA方法进行小参数学习,训练语料库选择中文的[alpaca-zh](https://huggingface.co/datasets/shibing624/alpaca-zh)☆26Apr 13, 2023Updated 3 years ago
- 对qwen2.5进行微调以及知识蒸馏☆17Dec 24, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 基于internlm-chat-7b的保险知识大模型微调☆20Apr 26, 2024Updated last year
- ☆45Sep 12, 2021Updated 4 years ago
- ☆56Aug 12, 2024Updated last year
- ☆12Oct 24, 2022Updated 3 years ago
- chatglm多gpu用deepspeed和☆408Jul 8, 2024Updated last year
- XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.☆642Apr 9, 2024Updated 2 years ago
- MultilingualShareGPT, the free multi-language corpus for LLM training☆72Apr 6, 2023Updated 3 years ago