ROCm / AITemplateView on GitHub
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
12Jun 24, 2024Updated last year

Alternatives and similar repositories for AITemplate

Users that are interested in AITemplate are comparing it to the libraries listed below

Sorting:

Are these results useful?