RadeonFlow / RadeonFlow_KernelsView on GitHub
Efficient implementation of DeepSeek Ops (Blockwise FP8 GEMM, MoE, and MLA) for AMD Instinct MI300X
75Feb 11, 2026Updated 2 weeks ago

Alternatives and similar repositories for RadeonFlow_Kernels

Users that are interested in RadeonFlow_Kernels are comparing it to the libraries listed below

Sorting:

Are these results useful?