sanyalsunny111 / Early_Weight_Avg

[COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training
13Updated last month

Related projects

Alternatives and complementary repositories for Early_Weight_Avg