jack57lee / Diversify-MHA

EMNLP 2018: Multi-Head Attention with Disagreement Regularization; NAACL 2019: Information Aggregation for Multi-Head Attention with Routing-by-Agreement
19Updated 4 years ago

Related projects

Alternatives and complementary repositories for Diversify-MHA