facebookincubator / AITemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
4,585Updated last month

Alternatives and similar repositories for AITemplate:

Users that are interested in AITemplate are comparing it to the libraries listed below