RobertCsordas / linear_layer_as_attention

The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns via Spotlights of Attention".
16Updated last year

Alternatives and similar repositories for linear_layer_as_attention:

Users that are interested in linear_layer_as_attention are comparing it to the libraries listed below