stas00 / portingLinks

Helper scripts and notes that were used while porting various nlp models

☆48

Alternatives and similar repositories for porting

Users that are interested in porting are comparing it to the libraries listed below

Sorting:

huggingface / olm-training
Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.
☆96Updated 2 years ago
google-research / t5x_retrieval
☆101Updated 2 years ago
HendrikStrobelt / LMdiff
A diff tool for language models
☆44Updated last year
cimeister / typical-sampling
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
☆81Updated 3 years ago
zphang / minimal-opt
☆67Updated 3 years ago
google-research-datasets / seahorse
Seahorse is a dataset for multilingual, multi-faceted summarization evaluation. It consists of 96K summaries with human ratings along 6 q…
☆89Updated last year
leogao2 / lm_dataformat
☆78Updated last year
gsarti / t5-flax-gcp
Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP
☆58Updated 3 years ago
martiansideofthemoon / hurdles-longform-qa
Official repository with code and data accompanying the NAACL 2021 paper "Hurdles to Progress in Long-form Question Answering" (https://a…
☆46Updated 3 years ago
Geotrend-research / smaller-transformers
Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.
☆104Updated 3 years ago
jungokasai / beam_with_patience
☆46Updated 3 years ago
google-research-datasets / QAmeleon
QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…
☆35Updated 2 years ago
AI21Labs / lm-evaluation
Evaluation suite for large-scale language models.
☆128Updated 4 years ago
lucidrains / marge-pytorch
Implementation of Marge, Pre-training via Paraphrasing, in Pytorch
☆76Updated 4 years ago
martiansideofthemoon / rankgen
Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…
☆138Updated 2 years ago
huggingface / model_card
☆30Updated 4 years ago
salesforce / TaiChi
Open source library for few shot NLP
☆78Updated 2 years ago
bloomberg / minilmv2.bb
Our open source implementation of MiniLMv2 (https://aclanthology.org/2021.findings-acl.188)
☆61Updated 2 years ago
jwieting / paraphrastic-representations-at-scale
☆75Updated 4 years ago
nreimers / se-pytorch-xla
☆21Updated 4 years ago
microsoft / xtreme-distil-transformers
XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale
☆156Updated last year
huggingface / tune
☆87Updated 3 years ago
allenai / EmbeddingRecycling
Embedding Recycling for Language models
☆38Updated 2 years ago
spyysalo / wiki-bert-pipeline
Generate BERT vocabularies and pretraining examples from Wikipedias
☆17Updated 5 years ago
allenai / tpu_pretrain
LM Pretraining with PyTorch/TPU
☆136Updated 6 years ago
oriram / spider
☆54Updated 2 years ago
castorini / hf-spacerini
Plug-and-play Search Interfaces with Pyserini and Hugging Face
☆32Updated 2 years ago
google-research-datasets / swim-ir
SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…
☆49Updated last year
NathanGodey / headless-lm
Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…
☆27Updated last year
frankxu2004 / knnlm-why
Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"
☆59Updated 2 years ago