bigscience-workshop/t-zero

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bigscience-workshop/t-zero)

bigscience-workshop / t-zero

Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)

☆463

Alternatives and similar repositories for t-zero

Users that are interested in t-zero are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

bigscience-workshop / promptsource
View on GitHub
Toolkit for creating, sharing and using natural language prompts.
☆3,027Oct 23, 2023Updated 2 years ago
google-research / FLAN
View on GitHub
☆1,565Jul 2, 2026Updated 2 weeks ago
allenai / natural-instructions
View on GitHub
Expanding natural instructions
☆1,045Dec 11, 2023Updated 2 years ago
bigscience-workshop / xmtf
View on GitHub
Crosslingual Generalization through Multitask Finetuning
☆535Sep 22, 2024Updated last year
r-three / t-few
View on GitHub
Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"
☆460Sep 6, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
soheeyang / unified-prompt-selection
View on GitHub
[TACL 2024] Improving Probability-based Prompt Selection Through Unified Evaluation and Analysis
☆11Nov 14, 2024Updated last year
yizhongw / Tk-Instruct
View on GitHub
Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.
☆184Oct 28, 2022Updated 3 years ago
seonghyeonye / Flipped-Learning
View on GitHub
[ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners
☆117Jun 28, 2025Updated last year
google / seqio
View on GitHub
Task-based datasets, preprocessing, and evaluation for sequence models.
☆595Jul 2, 2026Updated 2 weeks ago
joeljang / ELM
View on GitHub
[ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning
☆99Apr 26, 2023Updated 3 years ago
facebookresearch / MetaICL
View on GitHub
An original implementation of "MetaICL Learning to Learn In Context" by Sewon Min, Mike Lewis, Luke Zettlemoyer and Hannaneh Hajishirzi
☆274Apr 15, 2023Updated 3 years ago
google-research / t5x
View on GitHub
☆2,976Jul 9, 2026Updated last week
google / BIG-bench
View on GitHub
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
☆3,248Jul 19, 2024Updated 2 years ago
facebookresearch / FiD
View on GitHub
Fusion-in-Decoder
☆595Oct 4, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
CarperAI / trlx
View on GitHub
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
☆4,752Jan 8, 2024Updated 2 years ago
awebson / prompt_semantics
View on GitHub
This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”
☆84May 10, 2022Updated 4 years ago
bigscience-workshop / architecture-objective
View on GitHub
☆100Jul 25, 2023Updated 2 years ago
facebookresearch / KILT
View on GitHub
Library for Knowledge Intensive Language Tasks
☆978Mar 31, 2022Updated 4 years ago
INK-USC / expl-refinement
View on GitHub
Code for the paper "Refining Language Model with Compositional Explanation" (NeurIPS 2021)
☆11Oct 25, 2021Updated 4 years ago
bigscience-workshop / bigscience
View on GitHub
Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.
☆1,018Jul 29, 2024Updated last year
timoschick / pet
View on GitHub
This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"
☆1,625Jun 12, 2023Updated 3 years ago
facebookresearch / metaseq
View on GitHub
Repo for external large-scale work
☆6,549Apr 27, 2024Updated 2 years ago
shmsw25 / Channel-LM-Prompting
View on GitHub
An original implementation of "Noisy Channel Language Model Prompting for Few-Shot Text Classification"
☆130Apr 23, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
allenai / RL4LMs
View on GitHub
A modular RL library to fine-tune language models to human preferences
☆2,393Mar 1, 2024Updated 2 years ago
google-research / text-to-text-transfer-transformer
View on GitHub
Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
☆6,539Jul 8, 2026Updated last week
anthropics / hh-rlhf
View on GitHub
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
☆1,851Jun 17, 2025Updated last year
KNOT-FIT-BUT / R2-D2
View on GitHub
Official repository of the R2-D2's pipeline
☆21Nov 16, 2021Updated 4 years ago
kernelmachine / demix
View on GitHub
DEMix Layers for Modular Language Modeling
☆54Feb 25, 2026Updated 4 months ago
AkariAsai / ATTEMPT
View on GitHub
This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)
☆104Dec 1, 2022Updated 3 years ago
google-research / prompt-tuning
View on GitHub
Original Implementation of Prompt Tuning from Lester, et al, 2021
☆701Mar 6, 2025Updated last year
adapter-hub / adapters
View on GitHub
A Unified Library for Parameter-Efficient and Modular Transfer Learning
☆2,822Apr 26, 2026Updated 2 months ago
allenai / unifiedqa
View on GitHub
UnifiedQA: Crossing Format Boundaries With a Single QA System
☆442May 9, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
martiansideofthemoon / rankgen
View on GitHub
Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…
☆140Aug 2, 2023Updated 2 years ago
facebookresearch / GENRE
View on GitHub
Autoregressive Entity Retrieval
☆800Jul 6, 2023Updated 3 years ago
google-research / longt5
View on GitHub
☆183May 26, 2023Updated 3 years ago
bigscience-workshop / Megatron-DeepSpeed
View on GitHub
Ongoing research training transformer language models at scale, including: BERT & GPT-2
☆1,448Mar 20, 2024Updated 2 years ago
bigscience-workshop / evaluation
View on GitHub
Code and Data for Evaluation WG
☆42May 4, 2022Updated 4 years ago
unbiarirang / Fixed-Input-Parameterization
View on GitHub
This repository contains the official code for the paper: "Prompt Injection: Parameterization of Fixed Inputs"
☆32Sep 13, 2024Updated last year
machelreid / m2d2
View on GitHub
M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer
☆54Nov 21, 2022Updated 3 years ago