yongzhuo/LLaMA3-SFT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yongzhuo/LLaMA3-SFT)

yongzhuo / LLaMA3-SFT

LlaMA3-SFT, Meta-Llama-3-8B/Meta-Llama-3-8B-Instruct微调(transformers)/LORA(peft)/推理, 支持中文(chinese, zh)

☆34

Alternatives and similar repositories for LLaMA3-SFT

Users that are interested in LLaMA3-SFT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ChenXingLing / Qwen-fine-tune
View on GitHub
Qwen1.5大模型微调、基于PEFT框架LoRA微调，在数据集HC3-Chinese上实现文本分类。
☆12Jun 29, 2024Updated 2 years ago
yongzhuo / gemma-sft
View on GitHub
Gemma-SFT, gemma-2b/gemma-7b微调(finetune,transformers)/LORA(peft)/推理(inference)
☆32May 17, 2024Updated 2 years ago
Phoenix8215 / build_neural_network_from_scratch_CPP
View on GitHub
Created a simple neural network using C++17 standard and the Eigen library that supports both forward and backward propagation.
☆11Jul 27, 2024Updated 2 years ago
ramsrigouthamg / BERT_generate_grammar_MCQ_from_news_article
View on GitHub
Use pretrained BERT model to automatically generate grammar multiple choice questions (MCQ) from any news article or story.
☆13Oct 2, 2019Updated 6 years ago
tt2615 / CFPRec
View on GitHub
☆10Aug 16, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
JennyXieJiayi / STAR-HiT
View on GitHub
The implementation of STAR-HiT.
☆11Oct 18, 2023Updated 2 years ago
GioGioBond / NBCEonChatGLM6b
View on GitHub
(NBCE)Naive Bayes-based Context Extension on ChatGLM-6b
☆15Jun 7, 2023Updated 3 years ago
icmpnorequest / ICASSP2024_ASTHL
View on GitHub
The official PyTorch implementation for 2024-ICASSP-Adaptive Spatial-Temporal Hypergraph Fusion Learning for Next POI Recommendation
☆13Sep 8, 2024Updated last year
taishan1994 / Llama3.1-Finetuning
View on GitHub
对llama3进行全参微调、lora微调以及qlora微调。
☆221Oct 4, 2024Updated last year
Yifang-Qin / DisenPOI
View on GitHub
The pytorch implementation of DisenPOI.
☆25Oct 18, 2023Updated 2 years ago
chatopera / efaqa-corpus-raw
View on GitHub
Emotional First Aid Raw Dataset, 心理咨询问答原始语料库
☆23Mar 6, 2026Updated 4 months ago
tianchiguaixia / ocr_recognition
View on GitHub
微调阿里开源的文字检测模型，利用合合识别返回的OCR结果作为初始训练数据，对模型进行优化训练，使其更加适应1万张图片的具体场景，提高文字识别的精度。
☆10Dec 9, 2024Updated last year
yongzhuo / Llama2-SFT
View on GitHub
Llama2-SFT, Llama-2-7B微调(transformers)/LORA(peft)/推理
☆27Jul 26, 2023Updated 3 years ago
S1s-Z / NOVA
View on GitHub
[ACL'25] Code for "Aligning Large Language Models to Follow Instructions and Hallucinate Less via Effective Data Filtering"
☆21Jul 23, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Woody5962 / Ranked-List-Truncation
View on GitHub
A framework for Ranked List Truncation, including the implementation of multiple existing deep models, such as BiCut、Choopy and AttnCut. …
☆14May 7, 2022Updated 4 years ago
he-h / ST-MoE-BERT
View on GitHub
This repository contains the code for the paper "ST-MoE-BERT: A Spatial-Temporal Mixture-of-Experts Framework for Long-Term Cross-City Mo…
☆16Feb 20, 2025Updated last year
AmourWaltz / UAlign
View on GitHub
Project of ACL 2025 "UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models"
☆15Mar 25, 2025Updated last year
caihao20 / zh_correct_pinyin
View on GitHub
中文纠错-使用拼音树及编辑距离
☆13Jul 19, 2019Updated 7 years ago
agsarthak / Goal-oriented-Dialogue-Systems
View on GitHub
Applying Deep Reinforcement Learning for dialogue generation. aka chatbot
☆13Apr 30, 2017Updated 9 years ago
NoManNayeem / Langchain_CrewAI_Gemini-AI_Agents
View on GitHub
Langchain_CrewAI_Gemini - An Gemini AI powered AI Agent (Multi-Agent) Project.
☆14Mar 24, 2024Updated 2 years ago
liunian-Jay / AgenticRAG-RL
View on GitHub
A minimal implementation of Agentic RAG using GRPO
☆17Jun 11, 2025Updated last year
Ren-Research / LOMAR
View on GitHub
[ICML 2023] Learning for Edge-Weighted Online Bipartite Matching with Robustness Guarantees
☆11Aug 9, 2023Updated 2 years ago
bebr2 / RACE
View on GitHub
Code for RACE.
☆15Nov 12, 2025Updated 8 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
XMUDeepLIT / QGC
View on GitHub
Code for "Retaining Key Information under High Compression Rates: Query-Guided Compressor for LLMs" (ACL 2024)
☆19Jun 12, 2024Updated 2 years ago
liuhongjiang / blog_code
View on GitHub
Code for my own blog
☆10Nov 7, 2013Updated 12 years ago
yongzhuo / qwen2-sft
View on GitHub
Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理
☆73May 17, 2024Updated 2 years ago
MaYufei-NPU / InfoGain-RAG
View on GitHub
Implementation of EMNLP Oral Paper: InfoGain-RAG: Boosting Retrieval-Augmented Generation through Document Information Gain-based Reranki…
☆18Sep 17, 2025Updated 10 months ago
lv2020 / EBM
View on GitHub
LBSN based on foursquare dataset
☆14Apr 26, 2019Updated 7 years ago
gmuggs / pol
View on GitHub
Patterns-of-Life simulation
☆17Jul 23, 2023Updated 3 years ago
SilenceSengoku / IsolationFroest2
View on GitHub
upload a new programing，something like
☆27Nov 30, 2019Updated 6 years ago
atultiwari / LLaVA-Med
View on GitHub
Large Language-and-Vision Assistant for BioMedicine, built towards multimodal GPT-4 level capabilities.
☆10Nov 29, 2023Updated 2 years ago
byronBBL / Context-DPO
View on GitHub
Official repository of paper "Context-DPO: Aligning Language Models for Context-Faithfulness"
☆23Feb 17, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
zruiii / QwenAudioSFT
View on GitHub
The repoduction codes for Qwen-Audio Fine-tuning
☆55Feb 28, 2026Updated 5 months ago
eliasgoldsztejn95 / PTDRL
View on GitHub
Hospital simulator with pedestrians and robot
☆15Oct 20, 2024Updated last year
owenliang / nano-graphrag
View on GitHub
A simple, easy-to-hack GraphRAG implementation
☆15Sep 21, 2024Updated last year
ThisIsHwang / EXIT
View on GitHub
Official code and resources for the paper "EXIT: Context-Aware Extractive Compression for Enhancing Retrieval-Augmented Generation."
☆25Jul 15, 2026Updated 2 weeks ago
zzmylq / MMPOI
View on GitHub
☆16Nov 12, 2025Updated 8 months ago
WisdomShell / ADG
View on GitHub
[ACL'26 Main Conference] Instruction Data Selection via Answer Divergence
☆22Apr 14, 2026Updated 3 months ago
YanJieWen / OD-STGCN
View on GitHub
Spatiotemporal OD prediction based on traffic zones
☆13May 26, 2025Updated last year