A curated list of awesome instruction tuning datasets, models, papers and repositories.
☆345Jun 12, 2023Updated 3 years ago
Alternatives and similar repositories for Awesome-instruction-tuning
Users that are interested in Awesome-instruction-tuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)☆1,148Jan 4, 2024Updated 2 years ago
- Papers and Datasets on Instruction Tuning and Following. ✨✨✨☆511Apr 4, 2024Updated 2 years ago
- A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。☆735Jun 17, 2026Updated last week
- Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).☆769Jul 20, 2023Updated 2 years ago
- Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"☆10Dec 13, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Datasets for Instruction Tuning of Large Language Models☆261Nov 30, 2023Updated 2 years ago
- Code and data for the FACTOR paper☆54Nov 15, 2023Updated 2 years ago
- Butler 是一个用于自动化服务管理和任务调度的工具项目。☆17Updated this week
- ☆60Aug 22, 2024Updated last year
- ☆19Jun 21, 2025Updated last year
- ☆462Jun 9, 2024Updated 2 years ago
- MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following☆16Oct 31, 2024Updated last year
- A Comprehensive survey on business use cases of AI that help them thrive in the digital economy☆13Oct 7, 2020Updated 5 years ago
- AllenAI's post-training codebase☆3,775Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆27Oct 6, 2024Updated last year
- Aligning pretrained language models with instruction data generated by themselves.☆4,602Mar 27, 2023Updated 3 years ago
- Instruction Tuning with GPT-4☆4,334Jun 11, 2023Updated 3 years ago
- [NeurIPS 2024 D&B] Evaluating Copyright Takedown Methods for Language Models☆17Jul 17, 2024Updated last year
- An experimental desktop client for using Claude Desktop's MCP with Novelcrafter codices.☆11Dec 3, 2024Updated last year
- Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging. Arxiv, 2024.☆16Oct 28, 2024Updated last year
- 本项目主要对开源的MOSS SFT数据进行整理 ,转换成mnbvc多轮对话格式。MOSS-003涵盖用性、忠实性、无害性三个层面,共353w样本,MOSS-003 包含更细粒度的有用性类别标记、更广泛的无害性数据和更长对话轮数,共630w样本,☆13Dec 3, 2023Updated 2 years ago
- Expanding natural instructions☆1,045Dec 11, 2023Updated 2 years ago
- ☆35May 18, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Exploration: using technology to aid people who lack both the ability to speak and fine motor control.☆21Oct 24, 2024Updated last year
- Code and models for the paper "Questions Are All You Need to Train a Dense Passage Retriever (TACL 2023)"☆62Dec 27, 2022Updated 3 years ago
- A quick guide (especially) for trending instruction finetuning datasets☆3,396Nov 28, 2023Updated 2 years ago
- 人工精调的中文对话数据集和一段chatglm的微调代码☆1,191May 3, 2025Updated last year
- This is a AUTOSAR documents specific retriever based on LLM and RAG.☆16Nov 12, 2024Updated last year
- LLM Zoo collects information of various open- and close-sourced LLMs☆271Aug 23, 2023Updated 2 years ago
- Text Style Transfer: A Review☆13Jun 1, 2019Updated 7 years ago
- [ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters☆5,921Mar 14, 2024Updated 2 years ago
- Paper List for In-context Learning 🌷☆877Oct 8, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆21Apr 11, 2023Updated 3 years ago
- Robust recipes to align language models with human and AI preferences☆5,623May 26, 2026Updated last month
- ☆1,566Jun 10, 2026Updated 2 weeks ago
- Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ ACL 2022☆13Apr 13, 2022Updated 4 years ago
- A framework for few-shot evaluation of language models.☆13,106Updated this week
- Useful collection of webrat Textmate snippets meant for use with the RSpec Story and/or Cucumber bundles☆78Aug 7, 2009Updated 16 years ago
- PsyChat: A Client-Centric Dialogue System for Mental Health Support☆66Sep 4, 2024Updated last year