xiamengzhou / training_trajectory_analysis

[ACL 2023]: Training Trajectories of Language Models Across Scales https://arxiv.org/pdf/2212.09803.pdf
22Updated 10 months ago

Related projects: