tlc-pack / cutlass_fpA_intB_gemm

A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer
82Updated 6 months ago

Related projects: