tgale96 / grouped_gemm
PyTorch bindings for CUTLASS grouped GEMM.
☆51Updated last week
Related projects ⓘ
Alternatives and complementary repositories for grouped_gemm
- ☆79Updated 2 months ago
- High-speed GEMV kernels, at most 2.7x speedup compared to pytorch baseline.☆87Updated 3 months ago
- PyTorch bindings for CUTLASS grouped GEMM.