r-three / t-fewView external linksLinks
Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"
☆457Sep 6, 2023Updated 2 years ago
Alternatives and similar repositories for t-few
Users that are interested in t-few are comparing it to the libraries listed below
Sorting:
- Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)☆465Nov 5, 2022Updated 3 years ago
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…☆138Aug 2, 2023Updated 2 years ago
- ☆130Aug 18, 2022Updated 3 years ago
- Toolkit for creating, sharing and using natural language prompts.☆2,997Oct 23, 2023Updated 2 years ago
- This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)☆104Dec 1, 2022Updated 3 years ago
- A Unified Library for Parameter-Efficient and Modular Transfer Learning☆2,802Oct 12, 2025Updated 4 months ago
- ☆184May 26, 2023Updated 2 years ago
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆98Apr 26, 2023Updated 2 years ago
- Original Implementation of Prompt Tuning from Lester, et al, 2021☆698Mar 6, 2025Updated 11 months ago
- ☆1,560Feb 5, 2026Updated last week
- Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates☆473Apr 21, 2024Updated last year
- ☆290Dec 2, 2022Updated 3 years ago
- SGPT: GPT Sentence Embeddings for Semantic Search☆873Feb 17, 2024Updated last year
- PyTorch codes for "LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning"☆242Jan 20, 2023Updated 3 years ago
- Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch☆879Oct 30, 2023Updated 2 years ago
- [NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation☆475Mar 7, 2024Updated last year
- Expanding natural instructions☆1,030Dec 11, 2023Updated 2 years ago
- [EMNLP 2021] Improving and Simplifying Pattern Exploiting Training☆152Jun 10, 2022Updated 3 years ago
- This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"☆1,628Jun 12, 2023Updated 2 years ago
- Seminar on Large Language Models (COMP790-101 at UNC Chapel Hill, Fall 2022)☆313Nov 21, 2022Updated 3 years ago
- Few-shot Learning of GPT-3☆357Sep 18, 2023Updated 2 years ago
- Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.☆183Oct 28, 2022Updated 3 years ago
- Parallelformers: An Efficient Model Parallelization Toolkit for Deployment☆791Apr 24, 2023Updated 2 years ago
- An original implementation of "MetaICL Learning to Learn In Context" by Sewon Min, Mike Lewis, Luke Zettlemoyer and Hannaneh Hajishirzi☆273Apr 15, 2023Updated 2 years ago
- ☆10Sep 27, 2021Updated 4 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆96Feb 9, 2023Updated 3 years ago
- [NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333☆1,143Jan 11, 2024Updated 2 years ago
- Implementation of paper "Towards a Unified View of Parameter-Efficient Transfer Learning" (ICLR 2022)☆543Mar 24, 2022Updated 3 years ago
- ICML'2022: Black-Box Tuning for Language-Model-as-a-Service & EMNLP'2022: BBTv2: Towards a Gradient-Free Future with Large Language Model…☆271Nov 8, 2022Updated 3 years ago
- Efficient few-shot learning with Sentence Transformers☆2,678Dec 11, 2025Updated 2 months ago
- ☆54Jan 18, 2023Updated 3 years ago
- contrastive decoding☆207Nov 14, 2022Updated 3 years ago
- Fusion-in-Decoder☆591Oct 4, 2023Updated 2 years ago
- OpenICL is an open-source framework to facilitate research, development, and prototyping of in-context learning.☆584Oct 3, 2023Updated 2 years ago
- ☆12Nov 15, 2022Updated 3 years ago
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆206Aug 17, 2022Updated 3 years ago
- Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging F…☆577Nov 10, 2023Updated 2 years ago
- ☆99Jul 25, 2023Updated 2 years ago
- ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentation☆24May 1, 2022Updated 3 years ago