yifu-ding / BGEMM-CUDA

This is a repository of Binary General Matrix Multiply (BGEMM) by customized CUDA kernel. Thank FP6-LLM for the wheels!
10Updated 3 weeks ago

Related projects: