NVIDIA / TransformerEngineLinks

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.
2,954Updated this week

Alternatives and similar repositories for TransformerEngine

Users that are interested in TransformerEngine are comparing it to the libraries listed below

Sorting: