google-research / byt5Links

☆514

Alternatives and similar repositories for byt5

Users that are interested in byt5 are comparing it to the libraries listed below

Sorting:

GEM-benchmark / NL-Augmenter
NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations
☆787Updated last year
alexa / bort
Repository for the paper "Optimal Subarchitecture Extraction for BERT"
☆473Updated 3 years ago
microsoft / fastformers
FastFormers - highly efficient transformer models for NLU
☆705Updated 4 months ago
google-research / xtreme
XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…
☆645Updated 2 years ago
richarddwang / electra_pytorch
Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)
☆330Updated last year
Ki6an / fastT5
⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.
☆585Updated 2 years ago
tunib-ai / parallelformers
Parallelformers: An Efficient Model Parallelization Toolkit for Deployment
☆790Updated 2 years ago
microsoft / fastseq
An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/p…
☆433Updated 2 years ago
IntelLabs / academic-budget-bert
Repository containing code for "How to Train BERT with an Academic Budget" paper
☆314Updated last year
facebookresearch / vizseq
An Analysis Toolkit for Natural Language Generation (Translation, Captioning, Summarization, etc.)
☆445Updated last month
abelriboulot / onnxt5
Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.
☆255Updated 2 years ago
facebookresearch / anli
Adversarial Natural Language Inference Benchmark
☆397Updated 3 years ago
princeton-nlp / DensePhrases
[ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…
☆604Updated 3 years ago
google-research / multilingual-t5
☆1,279Updated 2 years ago
facebookresearch / SentAugment
SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…
☆361Updated 3 years ago
Lightning-Universe / lightning-transformers
Flexible components pairing 🤗 Transformers with Pytorch Lightning
☆609Updated 2 years ago
google-research-datasets / tydiqa
TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …
☆310Updated 5 years ago
bhoov / exbert
A Visual Analysis Tool to Explore Learned Representations in Transformers Models
☆597Updated last year
facebookresearch / GENRE
Autoregressive Entity Retrieval
☆793Updated 2 years ago
google / seqio
Task-based datasets, preprocessing, and evaluation for sequence models.
☆583Updated last week
facebookresearch / KILT
Library for Knowledge Intensive Language Tasks
☆953Updated 3 years ago
google-research-datasets / ToTTo
ToTTo is an open-domain English table-to-text dataset with over 120,000 training examples that proposes a controlled generation task: giv…
☆453Updated 10 months ago
microsoft / MPNet
MPNet: Masked and Permuted Pre-training for Language Understanding https://arxiv.org/pdf/2004.09297.pdf
☆294Updated 3 years ago
google-research-datasets / paws
This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, an…
☆559Updated 3 years ago
helboukkouri / character-bert
Main repository for "CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters"
☆201Updated last year
allenai / allennlp-models
Officially supported AllenNLP models
☆546Updated 2 years ago
awslabs / mlm-scoring
Python library & examples for Masked Language Model Scoring (ACL 2020)
☆344Updated 2 years ago
google-research / bleurt
BLEURT is a metric for Natural Language Generation based on transfer learning.
☆746Updated 2 years ago
microsoft / xtreme-distil-transformers
XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale
☆155Updated last year
timoschick / dino
This repository contains the code for "Generating Datasets with Pretrained Language Models".
☆188Updated 3 years ago