Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).
☆769Jul 20, 2023Updated 2 years ago
Alternatives and similar repositories for Instruction-Tuning-Papers
Users that are interested in Instruction-Tuning-Papers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Papers and Datasets on Instruction Tuning and Following. ✨✨✨☆511Apr 4, 2024Updated 2 years ago
- Paper List for In-context Learning 🌷☆875Oct 8, 2024Updated last year
- A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".☆2,103Oct 5, 2023Updated 2 years ago
- Aligning pretrained language models with instruction data generated by themselves.☆4,600Mar 27, 2023Updated 3 years ago
- Benchmarking large language models' complex reasoning ability with chain-of-thought prompting☆2,773Aug 4, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)☆1,148Jan 4, 2024Updated 2 years ago
- Expanding natural instructions☆1,044Dec 11, 2023Updated 2 years ago
- Paper collections of methods that using language to interact with environment, including interact with real world, simulated world or WWW…☆128Jul 26, 2023Updated 2 years ago
- Must-read papers on prompt-based tuning for pre-trained language models.☆4,301Jul 17, 2023Updated 2 years ago
- [ACL 2023] Reasoning with Language Model Prompting: A Survey☆1,004May 21, 2025Updated 11 months ago
- This repository contains a collection of papers and resources on Reasoning in Large Language Models.☆569Nov 13, 2023Updated 2 years ago
- ☆14Aug 18, 2022Updated 3 years ago
- Awesome papers on Language-Model-as-a-Service (LMaaS)☆545May 14, 2024Updated 2 years ago
- A curated list of reinforcement learning with human feedback resources (continually updated)☆4,363Dec 9, 2025Updated 5 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A modular RL library to fine-tune language models to human preferences☆2,387Mar 1, 2024Updated 2 years ago
- ☆920Jul 24, 2024Updated last year
- A curated list of awesome instruction tuning datasets, models, papers and repositories.☆347Jun 12, 2023Updated 2 years ago
- Instruction Tuning with GPT-4☆4,336Jun 11, 2023Updated 2 years ago
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆99Apr 26, 2023Updated 3 years ago
- ☆1,563May 12, 2026Updated last week
- Paper List for Contrastive Learning for Natural Language Processing☆573Apr 27, 2023Updated 3 years ago
- Toolkit for creating, sharing and using natural language prompts.☆3,015Oct 23, 2023Updated 2 years ago
- ICML'2022: Black-Box Tuning for Language-Model-as-a-Service & EMNLP'2022: BBTv2: Towards a Gradient-Free Future with Large Language Model…☆272Nov 8, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Must-read papers, related blogs and API tools on the pre-training and tuning methods for ChatGPT.☆329Aug 10, 2023Updated 2 years ago
- Paper collection on building and evaluating language model agents via executable language grounding☆365Apr 29, 2024Updated 2 years ago
- Materials for ACL-2022 tutorial: Knowledge-Augmented Methods for Natural Language Processing☆286Aug 8, 2022Updated 3 years ago
- Datasets for Instruction Tuning of Large Language Models☆260Nov 30, 2023Updated 2 years ago
- 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.☆21,138Updated this week
- Resource, Evaluation and Detection Papers for ChatGPT☆455Mar 21, 2024Updated 2 years ago
- This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.☆553Mar 10, 2024Updated 2 years ago
- A framework for few-shot evaluation of language models.☆12,595May 11, 2026Updated last week
- Aligning Large Language Models with Human: A Survey☆742Sep 11, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for "SCL-RAI: Span-based Contrastive Learning with Retrieval Augmented Inference for Unlabeled Entity Problem in NER" @COLING-2022☆11Aug 20, 2022Updated 3 years ago
- A framework for human-readable prompt-based method with large language models. Specially designed for researchers. (Deprecated, check out…☆131Feb 25, 2023Updated 3 years ago
- This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)☆104Dec 1, 2022Updated 3 years ago
- From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓☆3,608Apr 20, 2026Updated 3 weeks ago
- [NIPS2023] RRHF & Wombat☆806Sep 22, 2023Updated 2 years ago
- [ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters☆5,923Mar 14, 2024Updated 2 years ago
- 🩺 A collection of ChatGPT evaluation reports on various bechmarks.☆50Mar 28, 2023Updated 3 years ago