RadeonFlow / RadeonFlow_KernelsLinks

Efficient implementation of DeepSeek Ops (Blockwise FP8 GEMM, MoE, and MLA) for AMD Instinct MI300X
50Updated last week

Alternatives and similar repositories for RadeonFlow_Kernels

Users that are interested in RadeonFlow_Kernels are comparing it to the libraries listed below

Sorting: