lmsdss / LayerNorm-ScalingView on GitHub
[NeurIPS 2025] Official Pytorch Implementation of "The Curse of Depth in Large Language Models" by Wenfang Sun, Xinyuan Song, Pengxiang Li, Lu Yin,Yefeng Zheng, Shiwei Liu
70Mar 3, 2026Updated 3 weeks ago

Alternatives and similar repositories for LayerNorm-Scaling

Users that are interested in LayerNorm-Scaling are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?