bdhirsh / pytorch_open_registration_example
Example of using pytorch's open device registration API
☆25Updated last year
Related projects: ⓘ
- ☆34Updated 2 years ago
- An extension library of WMMA API (Tensor Core API)☆81Updated 2 months ago
- A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer☆82Updated 6 months ago
- ☆38Updated 4 years ago
- ☆73Updated 5 months ago
- Benchmark code for the "Online normalizer calculation for softmax" paper☆52Updated 6 years ago
- Matrix Multiply-Accumulate with CUDA and WMMA( Tensor Core)☆109Updated 4 years ago
- Benchmark scripts for TVM