SkyworkAI / MoH

MoH: Multi-Head Attention as Mixture-of-Head Attention
191Updated 3 months ago

Alternatives and similar repositories for MoH:

Users that are interested in MoH are comparing it to the libraries listed below