IBM / model-recyclingLinks

Ranking of fine-tuned HF models as base models.

☆36

Alternatives and similar repositories for model-recycling

Users that are interested in model-recycling are comparing it to the libraries listed below

Sorting:

allenai / EmbeddingRecycling
Embedding Recycling for Language models
☆39Updated 2 years ago
JoaoLages / RATransformers
RATransformers 🐭- Make your transformer (like BERT, RoBERTa, GPT-2 and T5) Relation Aware!
☆41Updated 2 years ago
huggingface / olm-training
Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.
☆93Updated 2 years ago
nbroad1881 / strideformer
Using short models to classify long texts
☆21Updated 2 years ago
google-research-datasets / QAmeleon
QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…
☆34Updated 2 years ago
allenai / smashed
SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…
☆33Updated last year
HendrikStrobelt / LMdiff
A diff tool for language models
☆43Updated last year
EleutherAI / semantic-memorization
☆44Updated 9 months ago
nreimers / se-pytorch-xla
☆21Updated 3 years ago
Ankush7890 / ssfinetuning
A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning
☆14Updated 3 years ago
frankxu2004 / knnlm-why
Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"
☆58Updated 2 years ago
castorini / hf-spacerini
Plug-and-play Search Interfaces with Pyserini and Hugging Face
☆32Updated 2 years ago
lucy3 / whos_filtered
☆14Updated 10 months ago
PyThaiNLP / MultiEL
Multilingual Entity Linking model by BELA model
☆12Updated 2 years ago
orevaahia / magnet-tokenization
☆13Updated 8 months ago
google-research-datasets / swim-ir
SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…
☆49Updated last year
NathanGodey / headless-lm
Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…
☆27Updated last year
thevasudevgupta / bigbird
Google's BigBird (Jax/Flax & PyTorch) @ 🤗Transformers
☆49Updated 2 years ago
kaustubhdhole / natural-dont-know
Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries
☆19Updated 3 years ago
stanfordnlp / ColBERT-QA
Code for Relevance-guided Supervision for OpenQA with ColBERT (TACL'21)
☆41Updated 4 years ago
davidheineman / thresh
🌾 Universal, customizable and deployable fine-grained evaluation for text generation.
☆23Updated last year
peterbhase / SLAG-Belief-Updating
Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"
☆28Updated 3 years ago
jungokasai / beam_with_patience
☆46Updated 3 years ago
guilhermemr04 / scaling-zero-shot-retrieval
No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval
☆29Updated 2 years ago
UKPLab / on-emergence
Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning
☆33Updated 7 months ago
cimeister / typical-sampling
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
☆82Updated 3 years ago
facebookresearch / UNIREX
This is the official PyTorch repo for "UNIREX: A Unified Learning Framework for Language Model Rationale Extraction" (ICML 2022).
☆26Updated 2 years ago
MilaNLProc / language-invariant-properties
☆22Updated 3 years ago
terrierteam / pyterrier_doc2query
☆37Updated 8 months ago
oriram / spider
☆54Updated 2 years ago