cofe-ai / MSG

Masked Structural Growth for 2x Faster Language Model Pre-training
22Updated 6 months ago

Related projects

Alternatives and complementary repositories for MSG