对llama3进行全参微调、lora微调以及qlora微调。
☆220Oct 4, 2024Updated last year
Alternatives and similar repositories for Llama3.1-Finetuning
Users that are interested in Llama3.1-Finetuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LlaMA3-SFT, Meta-Llama-3-8B/Meta-Llama-3-8B-Instruct微调(transformers)/LORA(peft)/推理, 支持中文(chinese, zh)☆34May 17, 2024Updated 2 years ago
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆73May 17, 2024Updated 2 years ago
- 大语言模型微调,Qwen2VL、Qwen2、GLM4指令微调☆646May 26, 2025Updated last year
- Qwen1.5大模型微调、基于PEFT框架LoRA微调,在数据集HC3-Chinese上实现文本分类。☆12Jun 29, 2024Updated last year
- 中文版hf-alignment-handbook,大模型全套sft、dpo、orpo、cpt训练教程.☆15Aug 25, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official code for the paper Improving Language Plasticity via Pretraining with Active Forgetting, NeurIPS 2023☆22Mar 12, 2026Updated 3 months ago
- Gemma-SFT, gemma-2b/gemma-7b微调(finetune,transformers)/LORA(peft)/推理(inference)☆32May 17, 2024Updated 2 years ago
- 基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等☆2,778Dec 12, 2023Updated 2 years ago
- LLM+RAG for QA☆23Jan 15, 2024Updated 2 years ago
- 人工智能实验五:多模态情感分类☆16Jul 14, 2022Updated 3 years ago
- 基于deepseek、qwen3大模型,lora sft 医疗行业数据☆15Apr 10, 2026Updated 2 months ago
- Templates and examples for ACL and EMNLP conference posters.☆14Oct 5, 2024Updated last year
- 文本相似度,语义向量,文本向量,text-similarity,similarity, sentence-similarity,BERT,SimCSE,BERT-Whitening,Sentence-BERT, PromCSE, SBERT☆77Nov 26, 2024Updated last year
- ☆37Feb 16, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [CIKM 2025] Constraint Back-translation Improves Complex Instruction Following of Large Language Models☆19May 23, 2025Updated last year
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆28Jul 23, 2024Updated last year
- Aligning Agentic World Models via Knowledgeable Experience Learning☆36May 15, 2026Updated last month
- Alpaca Chinese Dataset -- 中文指令微调数据集☆221Oct 6, 2024Updated last year
- Code for Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction (ACL24))☆51Aug 6, 2024Updated last year
- 以InternLM2-chat-7为基座模型,以常用中药等为数据集,微调的大模型。中医聊天小助手。☆18Feb 29, 2024Updated 2 years ago
- ☆26Aug 21, 2024Updated last year
- dpo算法实现☆53Jun 12, 2024Updated 2 years ago
- 简单易懂的LLaMA微调指南。☆413Jul 5, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆18May 28, 2024Updated 2 years ago
- 陶弘景中医药大模型,包括命名实体识别,关系抽取,知识图谱构建,大模型增量微调,RAG☆19Jul 28, 2025Updated 10 months ago
- ☆22Oct 22, 2024Updated last year
- 中文文本摘要生成模型☆21Jul 29, 2022Updated 3 years ago
- Multi-Task instruction-tuned LLaMA☆14May 5, 2023Updated 3 years ago
- 大模型微调工具集合☆26Mar 15, 2024Updated 2 years ago
- 基于qlora对baichuan-7B大模型进行指令微调。☆22Jun 22, 2023Updated 2 years ago
- qwen models finetuning☆107Mar 9, 2025Updated last year
- 使用UniLM实现中文文本摘要☆43Mar 25, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [EMNLP 2024 Findings] ICON: Improving Inter-Report Consistency in Radiology Report Generation via Lesion-aware Mixup Augmentation☆19Dec 11, 2024Updated last year
- This repository contains source code for the PASTA model, a pre-trained language model for table-based fact verification.☆18Dec 27, 2022Updated 3 years ago
- 结合知识图谱做的有关诗词的问答demo☆11Mar 11, 2020Updated 6 years ago
- The official code and dataset for EMNLP 2022 paper "COPEN: Probing Conceptual Knowledge in Pre-trained Language Models".☆21Mar 9, 2023Updated 3 years ago
- Llama3-Tutorial(XTuner、LMDeploy、OpenCompass)☆508May 10, 2024Updated 2 years ago
- A toolkit for modeling and simulation of cloud-native applications.☆16Aug 4, 2025Updated 10 months ago
- Predicting new perovskites with ensemble Machine Learning algorithms☆16Nov 8, 2025Updated 7 months ago