thomaschlt / mla.c

Implementation from scratch in C of the Multi-head latent attention used in the Deepseek-v3 technical paper.
15Updated 2 months ago

Alternatives and similar repositories for mla.c:

Users that are interested in mla.c are comparing it to the libraries listed below