allenai/longformer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/allenai/longformer)

allenai / longformer

Longformer: The Long-Document Transformer

☆2,197

Alternatives and similar repositories for longformer

Users that are interested in longformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lucidrains / reformer-pytorch
View on GitHub
Reformer, the efficient Transformer, in Pytorch
☆2,191Jun 21, 2023Updated 3 years ago
google-research / text-to-text-transfer-transformer
View on GitHub
Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
☆6,533Jul 2, 2026Updated last week
google-research / electra
View on GitHub
ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators
☆2,366Mar 23, 2024Updated 2 years ago
google-research / bigbird
View on GitHub
Transformers for Longer Sequences
☆633Sep 1, 2022Updated 3 years ago
laiguokun / Funnel-Transformer
View on GitHub
☆220Jun 8, 2020Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
zihangdai / xlnet
View on GitHub
XLNet: Generalized Autoregressive Pretraining for Language Understanding
☆6,181May 28, 2023Updated 3 years ago
facebookresearch / XLM
View on GitHub
PyTorch original implementation of Cross-lingual Language Model Pretraining.
☆2,927Feb 14, 2023Updated 3 years ago
huggingface / sentence-transformers
View on GitHub
State-of-the-Art Embeddings, Retrieval, and Reranking
☆18,900Updated this week
facebookresearch / fairseq
View on GitHub
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
☆32,237Sep 30, 2025Updated 9 months ago
facebookresearch / SentAugment
View on GitHub
SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…
☆359Feb 22, 2022Updated 4 years ago
allenai / allennlp
View on GitHub
An open-source NLP research library, built on PyTorch.
☆11,886Nov 22, 2022Updated 3 years ago
allenai / dont-stop-pretraining
View on GitHub
Code associated with the Don't Stop Pretraining ACL 2020 paper
☆543Nov 15, 2021Updated 4 years ago
facebookresearch / KILT
View on GitHub
Library for Knowledge Intensive Language Tasks
☆978Mar 31, 2022Updated 4 years ago
facebookresearch / SpanBERT
View on GitHub
Code for using and evaluating SpanBERT.
☆907Jul 25, 2023Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
idiap / fast-transformers
View on GitHub
Pytorch library for fast transformer implementations
☆1,772Mar 23, 2023Updated 3 years ago
princeton-nlp / SimCSE
View on GitHub
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
☆3,651Oct 16, 2024Updated last year
facebookresearch / adaptive-span
View on GitHub
Transformer training code for sequential tasks
☆610Sep 14, 2021Updated 4 years ago
makcedward / nlpaug
View on GitHub
Data augmentation for NLP
☆4,662Jun 20, 2026Updated 3 weeks ago
SCHENLIU / longformer-chinese
View on GitHub
chinese version of longformer
☆116Nov 6, 2020Updated 5 years ago
timoschick / pet
View on GitHub
This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"
☆1,624Jun 12, 2023Updated 3 years ago
microsoft / DeBERTa
View on GitHub
The implementation of DeBERTa
☆2,240Sep 29, 2023Updated 2 years ago
marcotcr / checklist
View on GitHub
Beyond Accuracy: Behavioral Testing of NLP models with CheckList
☆2,049Jan 9, 2024Updated 2 years ago
facebookresearch / LAMA
View on GitHub
LAnguage Model Analysis
☆1,391Jul 7, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
microsoft / fastformers
View on GitHub
FastFormers - highly efficient transformer models for NLU
☆706Mar 21, 2025Updated last year
kimiyoung / transformer-xl
View on GitHub
☆3,703Sep 21, 2022Updated 3 years ago
google / sentencepiece
View on GitHub
Unsupervised text tokenizer for Neural Network-based text generation.
☆11,950Jul 2, 2026Updated last week
jessevig / bertviz
View on GitHub
BertViz: Visualize Attention in Transformer Models
☆8,113Jan 8, 2026Updated 6 months ago
nyu-mll / jiant
View on GitHub
jiant is an nlp toolkit
☆1,675Jul 6, 2023Updated 3 years ago
facebookresearch / DPR
View on GitHub
Dense Passage Retriever - is a set of tools and models for open domain Q&A task.
☆1,867Apr 6, 2023Updated 3 years ago
openai / sparse_attention
View on GitHub
Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"
☆1,616Aug 12, 2020Updated 5 years ago
google-research / language
View on GitHub
Shared repository for open-sourced projects from the Google AI Language team.
☆1,784Jun 10, 2026Updated last month
mit-han-lab / lite-transformer
View on GitHub
[ICLR 2020] Lite Transformer with Long-Short Range Attention
☆609Jul 11, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
sacmehta / delight
View on GitHub
DeLighT: Very Deep and Light-Weight Transformers
☆469Oct 16, 2020Updated 5 years ago
google-research / long-range-arena
View on GitHub
Long Range Arena for Benchmarking Efficient Transformers
☆788Dec 16, 2023Updated 2 years ago
harvardnlp / pytorch-struct
View on GitHub
Fast, general, and tested differentiable structured prediction in PyTorch
☆1,132Apr 20, 2022Updated 4 years ago
appvision-ai / fast-bert
View on GitHub
Super easy library for BERT based NLP models
☆1,918Aug 19, 2024Updated last year
clovaai / length-adaptive-transformer
View on GitHub
Official Pytorch Implementation of Length-Adaptive Transformer (ACL 2021)
☆102Nov 2, 2020Updated 5 years ago
google-research / xtreme
View on GitHub
XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…
☆651Jan 4, 2023Updated 3 years ago
deepset-ai / FARM
View on GitHub
Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.
☆1,750Dec 20, 2023Updated 2 years ago