allenai / longformer
Longformer: The Long-Document Transformer
☆2,094Updated 2 years ago
Alternatives and similar repositories for longformer:
Users that are interested in longformer are comparing it to the libraries listed below
- Transformers for Longer Sequences☆596Updated 2 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,350Updated last year
- Reformer, the efficient Transformer, in Pytorch☆2,155Updated last year
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,903Updated 2 years ago
- Multi-Task Deep Neural Networks for Natural Language Understanding☆2,249Updated last year
- Code for using and evaluating SpanBERT.☆896Updated last year
- jiant is an nlp toolkit☆1,663Updated last year
- Pytorch library for fast transformer implementations☆1,687Updated 2 years ago
- The implementation of DeBERTa☆2,058Updated last year
- ☆3,639Updated 2 years ago
- MASS: Masked Sequence to Sequence Pre-training for Language Generation☆1,117Updated 2 years ago
- Data augmentation for NLP, presented at EMNLP 2019☆1,624Updated 2 years ago
- BERT-related papers☆2,041Updated last year
- [DEPRECATED] Repo for exploring multi-task learning approaches to learning sentence representations☆791Updated 3 years ago
- [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821☆3,523Updated 5 months ago
- Code for paper Fine-tune BERT for Extractive Summarization☆1,481Updated 3 years ago
- Data augmentation for NLP☆4,525Updated 9 months ago
- Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.☆1,141Updated last year
- FastFormers - highly efficient transformer models for NLU☆704Updated last year
- Unsupervised Word Segmentation for Neural Machine Translation and Text Generation☆2,227Updated 7 months ago
- This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"☆1,628Updated last year
- XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…☆640Updated 2 years ago
- code for EMNLP 2019 paper Text Summarization with Pretrained Encoders☆1,291Updated 8 months ago
- ☆1,268Updated 2 years ago
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)☆1,205Updated 5 months ago
- Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons☆1,114Updated last week
- Entity Linker solution☆1,184Updated last year
- A python tool for evaluating the quality of sentence embeddings.☆2,100Updated last year
- BERT score for text generation☆1,703Updated 7 months ago
- Collection of papers and resources for data augmentation for NLP.☆829Updated 2 years ago