Gryphe / BlockMerge_Gradient

Merge Transformers language models by use of gradient parameters.
205Updated 6 months ago

Alternatives and similar repositories for BlockMerge_Gradient:

Users that are interested in BlockMerge_Gradient are comparing it to the libraries listed below