gsarti / lambda-bert
A š¤-style implementation of BERT using lambda layers instead of self-attention
ā70Updated 4 years ago
Related projects ā
Alternatives and complementary repositories for lambda-bert
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.ā145Updated 3 years ago
- This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer ā¦ā55Updated 3 years ago
- Implementation of Marge, Pre-training via Paraphrasing, in Pytorchā75Updated 3 years ago
- LM Pretraining with PyTorch/TPUā132Updated 5 years ago
- ā73Updated 3 years ago
- ā47Updated 4 years ago
- ā63Updated 2 years ago
- Implementation of the GBST block from the Charformer paper, in Pytorchā117Updated 3 years ago
- Factorization of the neural parameter space for zero-shot multi-lingual and multi-task transferā39Updated 4 years ago
- ā64Updated 4 years ago
- Official repository with code and data accompanying the NAACL 2021 paper "Hurdles to Progress in Long-form Question Answering" (https://aā¦ā46Updated 2 years ago
- Code and Data for Evaluation WGā41Updated 2 years ago
- The implementation of "Neural Machine Translation without Embeddings", NAACL 2021ā33Updated 3 years ago
- ā20Updated last year
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselinesā132Updated last year
- Implementation of Mixout with PyTorchā74Updated last year
- Implementation of COCO-LM, Correcting and Contrasting Text Sequences for Language Model Pretraining, in Pytorchā45Updated 3 years ago
- Code for the paper "UnNatural Language Inference" to appear at ACL 2021 (Long Paper)ā36Updated 3 years ago
- Boolean Question Answering with multi-task learning and uses large LM embeddings like BERT, RoBERTaā18Updated 5 years ago
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We sā¦ā66Updated last year
- PyTorch code for the EMNLP 2020 paper "Embedding Words in Non-Vector Space with Unsupervised Graph Learning"ā41Updated 3 years ago
- A BART version of an open-domain QA model in a closed-book setupā119Updated 4 years ago
- Hyperparameter Search for AllenNLPā134Updated 4 years ago
- QED: A Framework and Dataset for Explanations in Question Answeringā115Updated 3 years ago
- This repositary hosts my experiments for the project, I did with OffNote Labs.ā11Updated 3 years ago
- diagNNose is a Python library that facilitates a broad set of tools for analysing hidden activations of neural models.ā81Updated last year
- ā46Updated 4 years ago
- On Generating Extended Summaries of Long Documentsā77Updated 3 years ago
- Fine-tune transformers with pytorch-lightningā44Updated 2 years ago
- Code and datasets of "Multilingual Extractive Reading Comprehension by Runtime Machine Translation"ā39Updated 5 years ago