thomaschlt / mla.cLinks

Implementation from scratch in C of the Multi-head latent attention used in the Deepseek-v3 technical paper.
19Updated last year

Alternatives and similar repositories for mla.c

Users that are interested in mla.c are comparing it to the libraries listed below

Sorting: