bdusell / stack-attention

Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"
17Updated 10 months ago

Alternatives and similar repositories for stack-attention:

Users that are interested in stack-attention are comparing it to the libraries listed below