lmsdss / LayerNorm-ScalingView on GitHub
[NeurIPS 2025] Official Pytorch Implementation of "The Curse of Depth in Large Language Models" by Wenfang Sun, Xinyuan Song, Pengxiang Li, Lu Yin,Yefeng Zheng, Shiwei Liu
70Mar 3, 2026Updated last month

Alternatives and similar repositories for LayerNorm-Scaling

Users that are interested in LayerNorm-Scaling are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?