[NeurIPS 2023] Softmax Output Approximation for Activation Memory-Efficient Training of Attention-based Networks
☆77Jun 7, 2024Updated last year
Alternatives and similar repositories for SoftmaxOutputApproximation
Users that are interested in SoftmaxOutputApproximation are comparing it to the libraries listed below
Sorting: