A curated list of awesome instruction tuning datasets, models, papers and repositories.
☆347Jun 12, 2023Updated 2 years ago
Alternatives and similar repositories for Awesome-instruction-tuning
Users that are interested in Awesome-instruction-tuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)☆1,146Jan 4, 2024Updated 2 years ago
- Papers and Datasets on Instruction Tuning and Following. ✨✨✨☆509Apr 4, 2024Updated 2 years ago
- A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。☆728Apr 7, 2024Updated 2 years ago
- Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).☆767Jul 20, 2023Updated 2 years ago
- Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"☆10Dec 13, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Datasets for Instruction Tuning of Large Language Models☆261Nov 30, 2023Updated 2 years ago
- Code and data for the FACTOR paper☆53Nov 15, 2023Updated 2 years ago
- Butler 是一个用于自动化服务管理和任务调度的工具项目。☆16Updated this week
- ☆59Aug 22, 2024Updated last year
- ☆19Jun 21, 2025Updated 9 months ago
- ☆462Jun 9, 2024Updated last year
- MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following☆16Oct 31, 2024Updated last year
- A Comprehensive survey on business use cases of AI that help them thrive in the digital economy☆13Oct 7, 2020Updated 5 years ago
- AllenAI's post-training codebase☆3,677Updated this week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆27Oct 6, 2024Updated last year
- Aligning pretrained language models with instruction data generated by themselves.☆4,588Mar 27, 2023Updated 3 years ago
- Instruction Tuning with GPT-4☆4,335Jun 11, 2023Updated 2 years ago
- [NeurIPS 2024 D&B] Evaluating Copyright Takedown Methods for Language Models☆17Jul 17, 2024Updated last year
- An experimental desktop client for using Claude Desktop's MCP with Novelcrafter codices.☆10Dec 3, 2024Updated last year
- Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging. Arxiv, 2024.☆16Oct 28, 2024Updated last year
- Expanding natural instructions☆1,039Dec 11, 2023Updated 2 years ago
- Hello world demonstration for Weblate☆14Jan 20, 2026Updated 2 months ago
- 本项目主要对开源的MOSS SFT数据进行整理 ,转换成mnbvc多轮对话格式。MOSS-003涵盖用性、忠实性、无害性三个层面,共353w样本,MOSS-003 包含更细粒度的有用性类别标记、更广泛的无害性数据和更长对话轮数,共630w样本,☆12Dec 3, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆35May 18, 2023Updated 2 years ago
- Exploration: using technology to aid people who lack both the ability to speak and fine motor control.☆22Oct 24, 2024Updated last year
- Code and models for the paper "Questions Are All You Need to Train a Dense Passage Retriever (TACL 2023)"☆62Dec 27, 2022Updated 3 years ago
- A quick guide (especially) for trending instruction finetuning datasets☆3,379Nov 28, 2023Updated 2 years ago
- 人工精调的中文对话数据集和一段chatglm的微调代码☆1,194May 3, 2025Updated 11 months ago
- This is a AUTOSAR documents specific retriever based on LLM and RAG.☆16Nov 12, 2024Updated last year
- LLM Zoo collects information of various open- and close-sourced LLMs☆271Aug 23, 2023Updated 2 years ago
- Text Style Transfer: A Review☆13Jun 1, 2019Updated 6 years ago
- [ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters☆5,934Mar 14, 2024Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Paper List for In-context Learning 🌷☆872Oct 8, 2024Updated last year
- ☆22Apr 11, 2023Updated 2 years ago
- Robust recipes to align language models with human and AI preferences☆5,551Updated this week
- ☆1,561Mar 25, 2026Updated last week
- A framework for few-shot evaluation of language models.☆12,020Updated this week
- Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ ACL 2022☆13Apr 13, 2022Updated 3 years ago
- Useful collection of webrat Textmate snippets meant for use with the RSpec Story and/or Cucumber bundles☆79Aug 7, 2009Updated 16 years ago