zhilizju/Awesome-instruction-tuning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zhilizju/Awesome-instruction-tuning)

zhilizju / Awesome-instruction-tuning

A curated list of awesome instruction tuning datasets, models, papers and repositories.

☆346

Alternatives and similar repositories for Awesome-instruction-tuning

Users that are interested in Awesome-instruction-tuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yaodongC / awesome-instruction-dataset
View on GitHub
A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)
☆1,152Jan 4, 2024Updated 2 years ago
RenzeLou / awesome-instruction-learning
View on GitHub
Papers and Datasets on Instruction Tuning and Following. ✨✨✨
☆512Apr 4, 2024Updated 2 years ago
jianzhnie / awesome-instruction-datasets
View on GitHub
A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。
☆738Jun 17, 2026Updated last month
SinclairCoder / Instruction-Tuning-Papers
View on GitHub
Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).
☆769Jul 20, 2023Updated 3 years ago
zhliu0106 / learning-to-refuse
View on GitHub
Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"
☆10Dec 13, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
raunak-agarwal / instruction-datasets
View on GitHub
Datasets for Instruction Tuning of Large Language Models
☆261Nov 30, 2023Updated 2 years ago
AI21Labs / factor
View on GitHub
Code and data for the FACTOR paper
☆54Nov 15, 2023Updated 2 years ago
HelloEveryboby / Butler
View on GitHub
Butler 是一个用于自动化服务管理和任务调度的工具项目。
☆17Updated this week
Lslland / T-Vaccine
View on GitHub
☆19Jun 21, 2025Updated last year
qinyiwei / InfoBench
View on GitHub
☆61Aug 22, 2024Updated last year
XueFuzhao / InstructionWild
View on GitHub
☆462Jun 9, 2024Updated 2 years ago
RenzeLou / Muffin
View on GitHub
MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following
☆16Oct 31, 2024Updated last year
magesh-technovator / awesome-ai-applications
View on GitHub
A Comprehensive survey on business use cases of AI that help them thrive in the digital economy
☆13Oct 7, 2020Updated 5 years ago
ethz-spylab / unlearning-vs-safety
View on GitHub
☆27Oct 6, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
allenai / open-instruct
View on GitHub
AllenAI's post-training codebase
☆3,801Updated this week
yizhongw / self-instruct
View on GitHub
Aligning pretrained language models with instruction data generated by themselves.
☆4,607Mar 27, 2023Updated 3 years ago
Instruction-Tuning-with-GPT-4 / GPT-4-LLM
View on GitHub
Instruction Tuning with GPT-4
☆4,332Jun 11, 2023Updated 3 years ago
boyiwei / CoTaEval
View on GitHub
[NeurIPS 2024 D&B] Evaluating Copyright Takedown Methods for Language Models
☆17Jul 17, 2024Updated 2 years ago
EnnengYang / Efficient-WEMoE
View on GitHub
Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging. Arxiv, 2024.
☆16Oct 28, 2024Updated last year
deadshot465 / novelcrafter-mcp
View on GitHub
An experimental desktop client for using Claude Desktop's MCP with Novelcrafter codices.
☆11Dec 3, 2024Updated last year
luojie1024 / MossQA-mnbvc
View on GitHub
本项目主要对开源的MOSS SFT数据进行整理，转换成mnbvc多轮对话格式。MOSS-003涵盖用性、忠实性、无害性三个层面，共353w样本，MOSS-003 包含更细粒度的有用性类别标记、更广泛的无害性数据和更长对话轮数，共630w样本，
☆13Dec 3, 2023Updated 2 years ago
allenai / natural-instructions
View on GitHub
Expanding natural instructions
☆1,045Dec 11, 2023Updated 2 years ago
scosman / voicebox
View on GitHub
Exploration: using technology to aid people who lack both the ability to speak and fine motor control.
☆21Oct 24, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
yeonsw / RankEncoder
View on GitHub
☆35May 18, 2023Updated 3 years ago
Zjh-819 / LLMDataHub
View on GitHub
A quick guide (especially) for trending instruction finetuning datasets
☆3,401Nov 28, 2023Updated 2 years ago
DevSinghSachan / art
View on GitHub
Code and models for the paper "Questions Are All You Need to Train a Dense Passage Retriever (TACL 2023)"
☆62Dec 27, 2022Updated 3 years ago
hikariming / chat-dataset-baseline
View on GitHub
人工精调的中文对话数据集和一段chatglm的微调代码
☆1,190May 3, 2025Updated last year
DAMO-NLP-SG / LLM-Zoo
View on GitHub
LLM Zoo collects information of various open- and close-sourced LLMs
☆270Aug 23, 2023Updated 2 years ago
Instruction-Tuning-with-GPT-4 / Instruction-Tuning-with-GPT-4.github.io
View on GitHub
☆21Apr 11, 2023Updated 3 years ago
yyxxrr739 / autosar-rag
View on GitHub
This is a AUTOSAR documents specific retriever based on LLM and RAG.
☆16Nov 12, 2024Updated last year
luofuli / A-Review-of-Text-Style-Transfer
View on GitHub
Text Style Transfer: A Review
☆13Jun 1, 2019Updated 7 years ago
OpenGVLab / LLaMA-Adapter
View on GitHub
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
☆5,916Mar 14, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
dqxiu / ICL_PaperList
View on GitHub
Paper List for In-context Learning 🌷
☆876Oct 8, 2024Updated last year
WeblateOrg / hello
View on GitHub
Hello world demonstration for Weblate
☆15Jan 20, 2026Updated 6 months ago
huggingface / alignment-handbook
View on GitHub
Robust recipes to align language models with human and AI preferences
☆5,639May 26, 2026Updated last month
vale-cli / SubVale
View on GitHub
A Sublime Text 3 client for Vale Server.
☆13Dec 7, 2020Updated 5 years ago
pkunlp-icler / MLS
View on GitHub
Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ ACL 2022
☆13Apr 13, 2022Updated 4 years ago
qiuhuachuan / PsyChat
View on GitHub
PsyChat: A Client-Centric Dialogue System for Mental Health Support
☆66Sep 4, 2024Updated last year
google-research / FLAN
View on GitHub
☆1,565Jul 2, 2026Updated 2 weeks ago