tongxuluo / prts

Code and Model for NeurIPS 2024 Spotlight Paper "Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training"
41Updated 7 months ago

Alternatives and similar repositories for prts

Users that are interested in prts are comparing it to the libraries listed below

Sorting: