tongxuluo / prts
View external linksLinks

Code and Model for NeurIPS 2024 Spotlight Paper "Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training"
44Oct 16, 2024Updated last year

Alternatives and similar repositories for prts

Users that are interested in prts are comparing it to the libraries listed below

Sorting:

Are these results useful?