NVIDIA / TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
2,086Updated this week

Alternatives and similar repositories for TransformerEngine:

Users that are interested in TransformerEngine are comparing it to the libraries listed below