Bruce-Lee-LY / cuda_back2back_hgemm

Use tensor core to calculate back-to-back HGEMM (half-precision general matrix multiplication) with MMA PTX instruction.
11Updated last year

Related projects

Alternatives and complementary repositories for cuda_back2back_hgemm