ROCm / AITemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
11Updated 10 months ago

Alternatives and similar repositories for AITemplate

Users that are interested in AITemplate are comparing it to the libraries listed below

Sorting: