gonglinyuan / metro_t0Links

Code repo for "Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers" (ACL 2023)

☆22

Alternatives and similar repositories for metro_t0

Users that are interested in metro_t0 are comparing it to the libraries listed below

Sorting:

argilla-io / distilabel-spin-dibt
Repository containing the SPIN experiments on the DIBT 10k ranked prompts
☆24Updated last year
allenai / EmbeddingRecycling
Embedding Recycling for Language models
☆39Updated 2 years ago
r-three / RAD
Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model
☆44Updated last year
mungg / FABLES
☆57Updated 10 months ago
facebookresearch / lss_eval
This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…
☆31Updated last year
luyug / magix
Supercharge huggingface transformers with model parallelism.
☆77Updated last week
EleutherAI / improved-t5
Experiments for efforts to train a new and improved t5
☆76Updated last year
castorini / hf-spacerini
Plug-and-play Search Interfaces with Pyserini and Hugging Face
☆32Updated last year
google-research-datasets / swim-ir
SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…
☆49Updated last year
Zyphra / Zyda_processing
☆37Updated last year
HazyResearch / aioli
Aioli: A unified optimization framework for language model data mixing
☆27Updated 6 months ago
ielab / Starbucks
Starbucks: Improved Training for 2D Matryoshka Embeddings
☆21Updated last month
salesforce / simplification
☆22Updated 6 months ago
stanfordnlp / multi-distribution-retrieval
Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval
☆15Updated last year
kaistAI / factual-knowledge-acquisition
☆21Updated 3 months ago
allenai / bff
☆39Updated last year
seonghyeonye / Flipped-Learning
[ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners
☆116Updated last month
kaistAI / InstructIR
IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…
☆32Updated last year
kyegomez / LM-Infinite
Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
☆40Updated 8 months ago
kaiokendev / cutoff-len-is-context-len
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆63Updated 2 years ago
xingyaoww / LeTI
Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."
☆65Updated 2 years ago
ZeroSumEval / ZeroSumEval
A framework for pitting LLMs against each other in an evolving library of games ⚔
☆32Updated 3 months ago
hadasah / btm
☆75Updated last year
kernelmachine / cbtm
Code repository for the c-BTM paper
☆107Updated last year
bminixhofer / zett
Code for Zero-Shot Tokenizer Transfer
☆133Updated 6 months ago
ltgoslo / bert-in-context
Official implementation of "BERTs are Generative In-Context Learners"
☆31Updated 4 months ago
bminixhofer / tokenkit
A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.
☆40Updated 3 weeks ago
CarperAI / squeakily
A library for squeakily cleaning and filtering language datasets.
☆47Updated 2 years ago
EleutherAI / semantic-memorization
☆44Updated 8 months ago
eth-easl / fmengine
Utilities for Training Very Large Models
☆58Updated 10 months ago