Baichuan-13B 指令微调
☆90Jul 14, 2023Updated 2 years ago
Alternatives and similar repositories for Baichuan-13B-Finetuning
Users that are interested in Baichuan-13B-Finetuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- baichuan LLM surpervised finetune by lora☆64Jun 28, 2023Updated 2 years ago
- baichuan-7B 微调 C++ 面试大模型☆14Jul 8, 2023Updated 2 years ago
- 基于qlora对baichuan-7B大模型进行指令微调。☆23Jun 22, 2023Updated 2 years ago
- 实现了Baichuan-Chat微调,Lora、QLora等各种微调方式,一键运行。☆70Aug 15, 2023Updated 2 years ago
- A 13B large language model developed by Baichuan Intelligent Technology☆2,932Sep 6, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Baichuan2代码的逐行解析版本,适合小白☆211Sep 20, 2023Updated 2 years ago
- Multi-Label Text Classification Based On Bert☆22Feb 28, 2023Updated 3 years ago
- InternLM-7B微调, SFT/LoRA, instruction finetune☆13May 17, 2024Updated last year
- ☆12Apr 29, 2024Updated 2 years ago
- An implementation of "Two are Better than One: An Ensemble of Retrieval- and Generation-Based Dialog Systems"☆14Jul 23, 2019Updated 6 years ago
- [Neural Networks 2025] The official code for the paper "MNet: A Multi-Scale Network for Visible Watermark Removal."☆17Jun 16, 2025Updated 10 months ago
- ☆16Aug 5, 2018Updated 7 years ago
- ChatGLM2-6B 全参数微调,支持多轮对话的高效微调。☆402Aug 17, 2023Updated 2 years ago
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆6,645Oct 24, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.☆621Jan 24, 2025Updated last year
- baichuan and baichuan2 finetuning and alpaca finetuning☆33Mar 10, 2025Updated last year
- A series of large language models developed by Baichuan Intelligent Technology☆4,108Nov 8, 2024Updated last year
- ☆14Aug 26, 2024Updated last year
- Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调☆3,721Oct 12, 2023Updated 2 years ago
- 结合知识图谱做的有关诗词的问答demo☆11Mar 11, 2020Updated 6 years ago
- A large-scale 7B pretraining language model developed by BaiChuan-Inc.☆5,664Jul 18, 2024Updated last year
- 使用peft库,对chatGLM-6B/chatGLM2-6B实现4bit的QLoRA高效微调,并做lora model和base model的merge及4bit的量化(quantize)。☆360Aug 22, 2023Updated 2 years ago
- Language Models as Semantic Indexers (ICML 2024)☆41May 2, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 中科大2022春《深度学习导论》课程资源☆10Aug 7, 2022Updated 3 years ago
- 该仓库主要描述了CCAC2023多模态对话情绪识别评测第3名的实现过程☆12Aug 11, 2024Updated last year
- An automated pipeline for scraping, processing, and visualizing medical Q&A data to build high-quality datasets. Includes a comprehensive…☆24Dec 24, 2024Updated last year
- ☆12Jan 10, 2025Updated last year
- 演示 vllm 对中文大语言模型的神奇效果☆31Nov 4, 2023Updated 2 years ago
- 该项目专注于识别智能对话场景中的用户文本,自动判断情绪类别并给出相应的准确度。可以广泛应用于社交媒体评论情感分析、智能客服情绪分析等场景,成为情感支持工具,帮助用户 从情绪中解脱。多次Prompt提升后,GPT模型最终识别准确率高于人类Baseline水准。☆11Jul 25, 2023Updated 2 years ago
- 对qwen2.5进行微调以及知识蒸馏☆17Dec 24, 2024Updated last year
- 基于internlm-chat-7b的保险知识大模型微调☆20Apr 26, 2024Updated 2 years ago
- ☆45Sep 12, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 纯c++的全平台llm加速库,支持python调用,支持baichuan, glm, llama, moss基座,手机端流畅运行chatglm-6B级模型单卡可达10000+token / s,☆42Aug 16, 2023Updated 2 years ago
- ☆56Aug 12, 2024Updated last year
- Official Implementation of TETA metric from ECCV22 paper: Tracking Every Thing In The Wild☆18May 21, 2025Updated 11 months ago
- ☆12Oct 24, 2022Updated 3 years ago
- chatglm多gpu用deepspeed和☆408Jul 8, 2024Updated last year
- XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.☆642Apr 9, 2024Updated 2 years ago
- Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).☆17Jan 8, 2025Updated last year