ofirpress / sandwich_transformer

This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer Models by Reordering their Sublayers.
55Updated 4 years ago

Alternatives and similar repositories for sandwich_transformer:

Users that are interested in sandwich_transformer are comparing it to the libraries listed below