SkyworkAI / MoH

MoH: Multi-Head Attention as Mixture-of-Head Attention
240Updated 6 months ago

Alternatives and similar repositories for MoH:

Users that are interested in MoH are comparing it to the libraries listed below