Bruce-Lee-LY / cuda_hgemv

Several optimization methods of half-precision general matrix vector multiplication (HGEMV) using CUDA core.
49Updated 2 months ago

Related projects

Alternatives and complementary repositories for cuda_hgemv