Gryphe / BlockMerge_Gradient

Merge Transformers language models by use of gradient parameters.
202Updated 3 months ago

Related projects

Alternatives and complementary repositories for BlockMerge_Gradient