tlc-pack / cutlass_fpA_intB_gemm

A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer
85Updated 8 months ago

Related projects

Alternatives and complementary repositories for cutlass_fpA_intB_gemm